Raspberry-Pi Based Physical Media to Audio Conversion device for Visually Impaired Individuals
DOI:
https://doi.org/10.32628/IJSRSET24114127Keywords:
Visually Impaired People, Optical Character Recognition, Text-To-Speech ConvertersAbstract
The proposed product is a device for real-time scanning and conversion of text from physical media to audio for the aid of visually impaired individuals. The focus of the project is to make a device which brings the experience of visually impaired individuals as close to that of the ordinarily abled/educated as possible when it comes to access to resources, books, and physical reading material. This device is targeted towards libraries, reading rooms, and schools for visually impaired individuals. The prototype is developed using a FDM 3D printer with PLA material and using a laser cutting machine with MDF material to allow for maximum customisability to meet the end-user’s needs. The proposed device is equipped with a Raspberry Pi 4B+, a camera, two pushbuttons, two potentiometers and a head-phone. A variety of image processing techniques, bundled with open-source optical character recognition (OCR) software and text-to-speech libraries, are used to capture and process images of book pages and convert them to audio files, all while maintaining a physical user interface which can be navigated autonomously by the visually challenged. The product is capable of handling over 200 fonts from 8pt to 36pt size. The product is successfully tested on 15 users for approximately 4000 words.
Downloads
References
P. Vashist et al., “Blindness and visual impairment and their causes in India: Results of a nationally representative survey,” PLoS One, vol. 17, no. 7 July, Jul. 2022, doi: 10.1371/journal.pone.0271736. DOI: https://doi.org/10.1371/journal.pone.0271736
R. M. Peters and D. Goldreich, “Tactile spatial acuity in childhood: Effects of age and fingertip size,” PLoS One, vol. 8, no. 12, Dec. 2013, doi: 10.1371/journal.pone.0084650. DOI: https://doi.org/10.1371/journal.pone.0084650
K. M. Gopal, S. Kumar, and O. Garg, “Senior Care Reforms in India: Reimagining the senior care paradigm,” 2023. [Online]. Available: https://www.niti.gov.in/ DOI: https://doi.org/10.31219/osf.io/tnr98
P. Joy and R. Brunsdon, “Visual agnosia and prosopagnosia in childhood: A prospective case study,” Child Neuropsychol., vol. 8, no. 1, pp. 1–15, 2002, doi: 10.1076/chin.8.1.1.8721. DOI: https://doi.org/10.1076/chin.8.1.1.8721
K. Khanam, “Changing Problems of Elderly Persons of India Changing Problems of Elderly Persons of India,” no. May, 2022.
T. Smythe, M. Zuurmond, C. J. Tann, M. Gladstone, and H. Kuper, “Early intervention for children with developmental disabilities in low and middle-income countries - The case for action,” Int. Health, vol. 13, no. 3, pp. 222–231, 2021, doi: 10.1093/inthealth/ihaa044. DOI: https://doi.org/10.1093/inthealth/ihaa044
V. Argyropoulos et al., “Refreshable braille displays and reading fluency: A pilot study in individuals with blindness,” Educ. Inf. Technol., vol. 25, no. 5, pp. 3613–3630, Sep. 2020, doi: 10.1007/s10639-020-10126-2. DOI: https://doi.org/10.1007/s10639-020-10126-2
N. Coppins and F. Barlow-Brown, “Reading difficulties in blind, braille-reading children,” Br. J. Vis. Impair., vol. 24, no. 1, pp. 37–39, 2006, doi: 10.1177/0264619606060035. DOI: https://doi.org/10.1177/0264619606060035
American Foundation for the Blind, “What Is Braille?,” https://www.afb.org/blindness-and-low-vision/braille/what-braille.
S. Thiyagarajan, G. Saravana Kumar, E. Praveen Kumar, and G. Sakana 4 2 Professor, “Implementation of Optical Character Recognition Using Raspberry Pi for Visually Challenged Person,” 2018. [Online]. Available: https://raspberry.piaustralia.com.au/raspberry-pi-3-model-b DOI: https://doi.org/10.14419/ijet.v7i3.34.18718
O. Journal, M. Khorrami-nejad, A. Sarabandi, M.-R. Akbari, and F. Askarizadeh, “Medical Hypothesis, Discovery &Innovation The Impact of Visual Impairment on Quality of Life Correspondence to,” 2016.
Q. Ye and D. Doermann, “Text Detection and Recognition in Imagery: A Survey,” IEEE Trans. Pattern Anal. Mach. Intell., vol. 37, no. 7, pp. 1480–1500, Jul. 2015, doi: 10.1109/TPAMI.2014.2366765. DOI: https://doi.org/10.1109/TPAMI.2014.2366765
N. Islam, Z. Islam, and N. Noor, “A Survey on Optical Character Recognition System,” 2016. [Online]. Available: https://www.researchgate.net/publication/320442536
O. Nazir and A. Malik, “Deep Learning End to End Speech Synthesis: A Review,” in ICSCCC 2021 - International Conference on Secure Cyber Computing and Communications, Institute of Electrical and Electronics Engineers Inc., May 2021, pp. 66–71. doi: 10.1109/ICSCCC51823.2021.9478125. DOI: https://doi.org/10.1109/ICSCCC51823.2021.9478125
G. R. Hemanth, M. Jayasree, S. K. Venii, P. Akshaya, and R. Saranya, “CNN-RNN BASED HANDWRITTEN TEXT RECOGNITION,” ICTACT J. SOFT Comput., p. 1, 2021, doi: 10.21917/ijsc.2021.0351.
Y. Ning, S. He, Z. Wu, C. Xing, and L. J. Zhang, “Review of deep learning based speech synthesis,” Appl. Sci., vol. 9, no. 19, pp. 1–16, 2019, doi: 10.3390/app9194050. DOI: https://doi.org/10.3390/app9194050
A. Mukhamadiyev, I. Khujayarov, O. Djuraev, and J. Cho, “Automatic Speech Recognition Method Based on Deep Learning Approaches for Uzbek Language,” Sensors, vol. 22, no. 10, 2022, doi: 10.3390/s22103683. DOI: https://doi.org/10.3390/s22103683
Jinto TJ, Lena EA, and Sudip Chakraborty, “Alexa Enabled Assistive System For Visually Impaired Persons Using AWS, Raspberry PI, And C#,” Int. Res. J. Mod. Eng. Technol. Sci., Dec. 2023, doi: 10.56726/irjmets47329. DOI: https://doi.org/10.56726/IRJMETS47329
Sushmitha, Geetha Nayana, G. Sree Varshitha, Bhoomika HR, Keerthi KS, and Geetha Nayana, “Raspberry Pi based Object Detection and Text Reader using Voice Assistance,” Int. J. Res. Appl. Sci. Eng. Technol., vol. 9, no. VIII, pp. 157–160, Aug. 2021, doi: 10.22214/ijraset.2021.37262. DOI: https://doi.org/10.22214/ijraset.2021.37262
R. Setiawan, R. A. Fadlurahman, and N. F. Hikmah, “ShareAlike 4.0 International License (CC BY-SA 4.0) How to cite: ‘Design of Low Vision Electronic Glasses with Image Processing Capabilities Using Raspberry Pi’ Design of Low Vision Electronic Glasses with Image Processing Capabilities Using Raspberry Pi,” J. Electron. Electromed. Eng. Med. Informatics, vol. 5, no. 2, pp. 89–98, 2023, doi: 10.35882/jeemi.v5i2.294. DOI: https://doi.org/10.35882/jeeemi.v5i2.294
J. Kane, M. N. Johnstone, and P. Szewczyk, “Voice Synthesis Improvement by Machine Learning of Natural Prosody,” Sensors, vol. 24, no. 5, pp. 1–22, 2024, doi: 10.3390/s24051624. DOI: https://doi.org/10.3390/s24051624
Seeing AI, “Seeing AI.” [Online]. Available: https://www.seeingai.com/
Envision Technologies B.V., “Envision AI.” [Online]. Available: https://apps.apple.com/us/app/envision-ai/id1268632314
Be my eyes, “See the world together.” [Online]. Available: https://www.bemyeyes.com/
National Federation of the blind, “KNFBreader.” [Online]. Available: https://knfbreader.nfb.org/knfbreader
Dolfin Computer Access, “Dolphin Easy Reader.” [Online]. Available: https://yourdolphin.com/EasyReader-App/Features
CZUR, “CZUR Fancy Series,” https://www.czur.com/product/fancyPro.
American Foundation for the blind, “Braille Printers,” https://www.afb.org/blindness-and-low-vision/using-technology/assistive-technology-products/braille-printers.
American Foundation for the Blind, “Refreshable Braille Displays,” https://www.afb.org/node/16207/refreshable-braille-displays.
American Foundation for the Blind, “Screen Readers,” https://www.afb.org/blindness-and-low-vision/using-technology/assistive-technology-products/screen-readers.
American Foundation for the Blind, “Speech Synthesizers,” https://www.afb.org/node/16207/speech-synthesizers.
Amazon, “Get your imagination going,” https://www.audible.in/.
P. Parikh, A. Sharma, R. Trivedi, D. Roy, and K. Joshi, “Performance evaluation of an indigenously-designed high performance dynamic feeding robotic structure using advanced additive manufacturing technology , machine learning and robot kinematics,” Int. J. Interact. Des. Manuf., 2023, doi: 10.1007/s12008-023-01513-3. DOI: https://doi.org/10.1007/s12008-023-01513-3
P. A. Parikh, R. Trivedi, and K. D. Joshi, “Optimising inverse kinematics algorithm for an indigenous vision-based feeding serial robot using particle swarm optimisation and hybrid genetic algorithm : a comparison,” Int. J. Adv. Mechatron. Syst., vol. 10, no. 2, pp. 88–101, 2023, doi: 10.1504/IJAMECHS.2023.131332. DOI: https://doi.org/10.1504/IJAMECHS.2023.131332
P. A. Parikh, K. D. Joshi, and R. Trivedi, “Face Detection-Based Depth Estimation by 2D and 3D Cameras: A Comparison,” in 2022 28th International Conference on Mechatronics and Machine Vision in Practice (M2VIP), IEEE, Nov. 2022, pp. 1–4. doi: 10.1109/M2VIP55626.2022.10041072. DOI: https://doi.org/10.1109/M2VIP55626.2022.10041072
P. Parikh, R. Trivedi, J. Dave, K. Joshi, and D. Adhyaru, “Design and Development of a Low-Cost Vision-Based 6 DoF Assistive Feeding Robot for the Aged and Specially-Abled People,” IETE J. Res., 2023, doi: 10.1080/03772063.2023.2173665. DOI: https://doi.org/10.1080/03772063.2023.2173665
P. Parikh, R. Trivedi, and K. Joshi, “Continuous trajectory planning of a 6 DoF feeding robotic arm using a novel multi-point LSPB algorithm,” in 2022 28th International Conference on Mechatronics and Machine Vision in Practice (M2VIP), IEEE, Nov. 2022, pp. 1–6. doi: 10.1109/M2VIP55626.2022.10041082. DOI: https://doi.org/10.1109/M2VIP55626.2022.10041082
Downloads
Published
Issue
Section
License
Copyright (c) 2024 International Journal of Scientific Research in Science, Engineering and Technology
This work is licensed under a Creative Commons Attribution 4.0 International License.