Advancing Accessibility and Social Inclusion through Generative AI and Machine Learning-Powered Multi-Modality on Mobile Platforms

Waseem Syed

doi:10.32628/CSEIT25111254

Authors

Waseem Syed JNTU, Hyderabad, India Author

DOI:

https://doi.org/10.32628/CSEIT25111254

Keywords:

Multi-Modal Accessibility, Generative AI, Digital Inclusion, Assistive Technology, Mobile Computing

Abstract

The advent of generative AI-powered multi-modality on mobile platforms is transforming how devices function as empowering tools, facilitating inclusive and accessible experiences for diverse populations. By integrating voice, text, and image interactions with advanced AI technologies such as Whisper, Gemini, alongside mobile-specific frameworks, multi-modality addresses significant accessibility challenges in mobile computing. This integration enables features like voice-to-text conversion, image-based form filling, and contextual conversation summarization, which enhance usability for individuals with disabilities, non-native speakers, and those managing complex digital workflows. Although acknowledging several advancements, the article also addresses challenges such as data privacy, algorithmic bias, and unequal distribution of AI technologies, exploring both societal and technical difficulties in their equitable deployment. Additionally, it discusses multi-modality application and benefits across healthcare, education, and public services, calling for a collaborative and inclusive approach in technology development to ensure that innovations benefit all users. This analysis highlights how generative AI and multi-modality are key to advancing digital inclusivity on mobile platforms.

Downloads

Download data is not yet available.

References

World Bank Group, "Digital Development Global Practice," World Bank, 2023. [Online]. Available: https://thedocs.worldbank.org/en/doc/b16e2ba1cb754ab47a2dd1b214dd374e-0400062023/original/DigitalDevelopmentBrochure.pdf

World Bank Group, "World Report on Disability," World Bank, 2011. [Online]. Available: https://documents1.worldbank.org/curated/ar/665131468331271288/pdf/627830WP0World00PUBLIC00BOX361491B0.pdf

UNICEF and WHO, "Global report on assistive technology," UNICEF, 2022. [Online]. Available: https://www.unicef.org/reports/global-report-assistive-technology

Ana Maria Carrillo Soubic, "Background Paper on Technologies and Older Persons," United Nations Department of Economic and Social Affairs, 2021. [Online]. Available: https://social.desa.un.org/sites/default/files/inline-files/Technologies-and-older-persons-by-Ana-Maria-Carrillo-Soubic.pdf

UNESCO, "Artificial intelligence and gender equality: key findings of UNESCO’s Global Dialogue," UNESCO Digital Library, 2020. [Online]. Available: https://unesdoc.unesco.org/ark:/48223/pf0000374174

International Telecommunication Union, "Measuring digital development: Facts and Figures 2024," ITU Publications, 2024. [Online]. Available: https://www.itu.int/en/ITU-D/Statistics/pages/facts/default.aspx

W3C Web Accessibility Initiative (WAI), "Making the Web Accessible," W3C. [Online]. Available: https://www.w3.org/WAI/

OECD, "State of implementation of the OECD AI Principles," OECD Digital Economy Papers, June 2021. [Online]. Available: https://www.oecd.org/en/publications/state-of-implementation-of-the-oecd-ai-principles_1cd40c44-en.html

Tim Springer, "The Fifth Annual State of Digital Accessibility Report," Level Access, 2023. [Online]. Available: https://www.levelaccess.com/wp-content/uploads/2023/12/The-Fifth-Annual-State-of-Digital-Accessibility-Report.pdf

N. Chumuang and M. Ketcham, "Model for Handwritten Recognition Based on Artificial Intelligence," 2018 International Joint Symposium on Artificial Intelligence and Natural Language Processing (iSAI-NLP), Pattaya, Thailand, 2018, pp. 1-5, doi: 10.1109/iSAI-NLP.2018.8692958. DOI: https://doi.org/10.1109/iSAI-NLP.2018.8692958

FXM, "AI in Education: Enhancing Accessibility and Inclusivity for Individuals with Disabilities," FXM Web, Aug. 2024. [Online]. Available: https://www.fxmweb.com/insights/ai-in-education-enhancing-accessibility-and-inclusivity-for-individuals-with-disabilities.html

Daniel Gaspar-Figueiredo et al., "Reinforcement Learning-Based Framework for the Intelligent Adaptation of User Interfaces," arXiv:2405.09255v1 [cs.HC], May 2024. Available: https://arxiv.org/html/2405.09255v1

"IEEE Standard Adoption of Moving Picture, Audio and Data Coding by Artificial Intelligence (MPAI) Technical Specification Multimodal Conversion Version 1.2," in IEEE Std 3300-2022 , vol., no., pp.1-108, 28 April 2023, doi: 10.1109/IEEESTD.2023.10112603. DOI: https://doi.org/10.1109/IEEESTD.2023.10112603

Syed, W. (2025). AI-powered multi-modal form filling: Advancing accessibility through voice and image recognition. International Journal of Scientific Research in Computer Science, Engineering and Information Technology, 11(1), 1–11. https://doi.org/10.32628/CSEIT2511120301. DOI: https://doi.org/10.32628/CSEIT25111203