Automated Framework for Communication Development in Autism Spectrum Disorder Using Whisper ASR and GPT-4o LLM

Authors

  • Naela Fauzul Muna Universitas Islam Indonesia, Yogyakarta, Indonesia
  • Mukhammad Andri Setiawan Universitas Islam Indonesia, Yogyakarta, Indonesia

DOI:

https://doi.org/10.21009/jtp.v27i1.54243

Keywords:

Autism Spectrum Disorder, Communication Assessment, Artificial Intelligence, Whisper Automatic Speech Recognition, GPT Large Language Model

Abstract

Autism Spectrum Disorder (ASD) is a developmental condition impacting communication, social interaction, and behavior. Communication assessments for children with ASD are often conducted manually, making the process time-consuming, which can lead to delays in developing educational programs and a lack of standardization due to subjective evaluations. This study introduces an automated framework using Whisper and GPT-4o to enhance the efficiency and accuracy of evaluating communication abilities and language patterns in children with ASD. The research adopts a Research and Development (RnD) approach with the ASET model (Analyze, System Design, Execution, Testing), engaging children with mild and moderate verbal ASD and teachers from four autism schools in Daerah Istimewa Yogyakarta, Indonesia. Data were collected through interviews, classroom observations, audio recordings, and a matrix-based evaluation. Whisper was employed for automated transcription, integrated with GPT-4o for speaker diarization and communication analysis. Results showed an 89.1% reduction in analysis time compared to manual methods. Whisper achieved a low Word Error Rate (WER) for mild autism (average 5%) and a higher rate for moderate autism (average 23%). GPT-4o contributed to the process with high speaker diarization accuracy (93.9% for mild autism and 89.2% for moderate autism). The framework identified detailed communication improvements through the matrix-based evaluation, including verbal, pragmatic, semantic, sentence structure, and echolalia aspects. It provided insights previously undetected by teachers, such as specific developmental patterns in each aspect. The future research should integrate intonation and emotional analysis, refine diarization accuracy, and validate the approach across diverse populations.

References

Desideri, L., Pérez-Fuster, P., & Herrera, G. (2021). Information and communication technologies to support early screening of autism spectrum disorder: A systematic review. Children, 8(2). https://doi.org/10.3390/children8020093

Dwi Pratiwi, R., Dwi Pranata, A., Ayuningtyas, G., Azzahra, P., Program Studi, D. S., Widya Dharma Husada Tangerang, Stik., & Program Studi, M. S. (2023). DETERMINAN KEJADIAN ANAK AUTIS BASED ON SYSTEMATIC REVIEW. In Nursing Science Journal (NSJ) (Vol. 4, Issue 2).

Fuller, E. A., & Kaiser, A. P. (2020). The Effects of Early Intervention on Social Communication Outcomes for Children with Autism Spectrum Disorder: A Meta-analysis. Journal of Autism and Developmental Disorders, 50(5), 1683–1700. https://doi.org/10.1007/s10803-019-03927-z

Grabrucker, A. M. (2021). Autism Spectrum Disorders. Exon Publications, Brisbane, Australia. https://doi.org/https://doi.org/10.36255/exonpublications.autismspectrumdisorders.2021

Harrison, J. E., Weber, S., Jakob, R., & Chute, C. G. (2021). ICD-11: an international classification of diseases for the twenty-first century. In BMC Medical Informatics and Decision Making (Vol. 21). BioMed Central Ltd. https://doi.org/10.1186/s12911-021-01534-6

Hodis, B., Mughal, S., & Saadabadi, A. (2025). Autism Spectrum Disorder.

Kasari, C., Brady, N., Lord, C., & Tager-Flusberg, H. (2013). Assessing the minimally verbal school-aged child with autism spectrum disorder. In Autism Research (Vol. 6, Issue 6, pp. 479–493). https://doi.org/10.1002/aur.1334

Mukherjee, P., Gokul, R. S., Sadhukhan, S., Godse, M., & Chakraborty, B. (2023). Detection of Autism Spectrum Disorder (ASD) from Natural Language Text using BERT and ChatGPT Models. IJACSA) International Journal of Advanced Computer Science and Applications, 14(10). https://doi.org/10.14569/IJACSA.2023.0141041

Okoye, C., Obialo-Ibeawuchi, C. M., Obajeun, O. A., Sarwar, S., Tawfik, C., Waleed, M. S., Wasim, A. U., Mohamoud, I., Afolayan, A. Y., & Mbaezue, R. N. (2023). Early Diagnosis of Autism Spectrum Disorder: A Review and Analysis of the Risks and Benefits. Cureus. https://doi.org/10.7759/cureus.43226

O’Sullivan, J., Bogaarts, G., Kosek, M., Ullmann, R., Schoenenberger, P., Chatham, C., Nobbs, D., Murtagh, L., Lindemann, M., Parish-Morris, J., Liberman, M., Aponte, E., Dorn, J., & Lipsmeier, F. (2023). Automatic Speech Recognition for ASD Using the Open-Source Whisper Model from OpenAI. International Society for Autism Research.

Rahmawati, S. (2024). Optimalisasi Fokus: “Strategi Pembelajaran untuk Meningkatkan Konsentrasi pada Anak dengan Gangguan Spektrum Autisme (GSA).” In Jurnal Kependidikan (Vol. 13, Issue 2). https://jurnaldidaktika.org

Reddy Ananthula Akash, T. (2023). Comprehensive Review of Autism Spectrum Disorder: Etiology, Early Signs, and Diagnostic Assessment. International Journal of Science and Research (IJSR), 12(8), 480–485. https://doi.org/10.21275/sr23803085129

Richards, J. A., Xu, D., & Gilkerson, J. (2010). Development and Performance of the LENA Automatic Autism Screen.

Schaeffer, J., Abd El-Raziq, M., Castroviejo, E., Durrleman, S., Ferré, S., Grama, I., Hendriks, P., Kissine, M., Manenti, M., Marinis, T., Meir, N., Novogrodsky, R., Perovic, A., Panzeri, F., Silleresi, S., Sukenik, N., Vicente, A., Zebib, R., Prévost, P., & Tuller, L. (2023). Language in autism: domains, profiles and co-occurring conditions. Journal of Neural Transmission, 130(3), 433–457. https://doi.org/10.1007/s00702-023-02592-y

Talantseva, O. I., Romanova, R. S., Shurdova, E. M., Dolgorukova, T. A., Sologub, P. S., Titova, O. S., Kleeva, D. F., & Grigorenko, E. L. (2023). The global prevalence of autism spectrum disorder: A three-level meta-analysis. In Frontiers in Psychiatry (Vol. 14). Frontiers Media S.A. https://doi.org/10.3389/fpsyt.2023.1071181

Trembath, D., Paynter, J., Sutherland, R., & Tager-Flusberg, H. (2019). Assessing Communication in Children with Autism Spectrum Disorder Who Are Minimally Verbal. In Current Developmental Disorders Reports (Vol. 6, Issue 3, pp. 103–110). Springer. https://doi.org/10.1007/s40474-019-00171-z

Vogindroukas, I., Stankova, M., Chelas, E. N., & Proedrou, A. (2022). Language and Speech Characteristics in Autism. In Neuropsychiatric Disease and Treatment (Vol. 18, pp. 2367–2377). Dove Medical Press Ltd. https://doi.org/10.2147/NDT.S331987

World Health Organization. (2023). Autism.

Zeidan, J., Fombonne, E., Scorah, J., Ibrahim, A., Durkin, M. S., Saxena, S., Yusuf, A., Shih, A., & Elsabbagh, M. (2022). Global prevalence of autism: A systematic review update. In Autism Research (Vol. 15, Issue 5, pp. 778–790). John Wiley and Sons Inc. https://doi.org/10.1002/aur.2696

Downloads

Published

2025-04-29

How to Cite

Muna, N. F., & Setiawan, M. A. (2025). Automated Framework for Communication Development in Autism Spectrum Disorder Using Whisper ASR and GPT-4o LLM. JTP - Jurnal Teknologi Pendidikan, 27(1), 137–149. https://doi.org/10.21009/jtp.v27i1.54243