Automated Framework for Communication Development in Autism Spectrum Disorder Using Whisper ASR and GPT-4o LLM
DOI:
https://doi.org/10.21009/jtp.v27i1.54243Keywords:
Autism Spectrum Disorder, Communication Assessment, Artificial Intelligence, Whisper Automatic Speech Recognition, GPT Large Language ModelAbstract
Autism Spectrum Disorder (ASD) is a developmental condition impacting communication, social interaction, and behavior. Communication assessments for children with ASD are often conducted manually, making the process time-consuming, which can lead to delays in developing educational programs and a lack of standardization due to subjective evaluations. This study introduces an automated framework using Whisper and GPT-4o to enhance the efficiency and accuracy of evaluating communication abilities and language patterns in children with ASD. The research adopts a Research and Development (RnD) approach with the ASET model (Analyze, System Design, Execution, Testing), engaging children with mild and moderate verbal ASD and teachers from four autism schools in Daerah Istimewa Yogyakarta, Indonesia. Data were collected through interviews, classroom observations, audio recordings, and a matrix-based evaluation. Whisper was employed for automated transcription, integrated with GPT-4o for speaker diarization and communication analysis. Results showed an 89.1% reduction in analysis time compared to manual methods. Whisper achieved a low Word Error Rate (WER) for mild autism (average 5%) and a higher rate for moderate autism (average 23%). GPT-4o contributed to the process with high speaker diarization accuracy (93.9% for mild autism and 89.2% for moderate autism). The framework identified detailed communication improvements through the matrix-based evaluation, including verbal, pragmatic, semantic, sentence structure, and echolalia aspects. It provided insights previously undetected by teachers, such as specific developmental patterns in each aspect. The future research should integrate intonation and emotional analysis, refine diarization accuracy, and validate the approach across diverse populations.
References
Desideri, L., Pérez-Fuster, P., & Herrera, G. (2021). Information and communication technologies to support early screening of autism spectrum disorder: A systematic review. Children, 8(2). https://doi.org/10.3390/children8020093
Dwi Pratiwi, R., Dwi Pranata, A., Ayuningtyas, G., Azzahra, P., Program Studi, D. S., Widya Dharma Husada Tangerang, Stik., & Program Studi, M. S. (2023). DETERMINAN KEJADIAN ANAK AUTIS BASED ON SYSTEMATIC REVIEW. In Nursing Science Journal (NSJ) (Vol. 4, Issue 2).
Fuller, E. A., & Kaiser, A. P. (2020). The Effects of Early Intervention on Social Communication Outcomes for Children with Autism Spectrum Disorder: A Meta-analysis. Journal of Autism and Developmental Disorders, 50(5), 1683–1700. https://doi.org/10.1007/s10803-019-03927-z
Grabrucker, A. M. (2021). Autism Spectrum Disorders. Exon Publications, Brisbane, Australia. https://doi.org/https://doi.org/10.36255/exonpublications.autismspectrumdisorders.2021
Harrison, J. E., Weber, S., Jakob, R., & Chute, C. G. (2021). ICD-11: an international classification of diseases for the twenty-first century. In BMC Medical Informatics and Decision Making (Vol. 21). BioMed Central Ltd. https://doi.org/10.1186/s12911-021-01534-6
Hodis, B., Mughal, S., & Saadabadi, A. (2025). Autism Spectrum Disorder.
Kasari, C., Brady, N., Lord, C., & Tager-Flusberg, H. (2013). Assessing the minimally verbal school-aged child with autism spectrum disorder. In Autism Research (Vol. 6, Issue 6, pp. 479–493). https://doi.org/10.1002/aur.1334
Mukherjee, P., Gokul, R. S., Sadhukhan, S., Godse, M., & Chakraborty, B. (2023). Detection of Autism Spectrum Disorder (ASD) from Natural Language Text using BERT and ChatGPT Models. IJACSA) International Journal of Advanced Computer Science and Applications, 14(10). https://doi.org/10.14569/IJACSA.2023.0141041
Okoye, C., Obialo-Ibeawuchi, C. M., Obajeun, O. A., Sarwar, S., Tawfik, C., Waleed, M. S., Wasim, A. U., Mohamoud, I., Afolayan, A. Y., & Mbaezue, R. N. (2023). Early Diagnosis of Autism Spectrum Disorder: A Review and Analysis of the Risks and Benefits. Cureus. https://doi.org/10.7759/cureus.43226
O’Sullivan, J., Bogaarts, G., Kosek, M., Ullmann, R., Schoenenberger, P., Chatham, C., Nobbs, D., Murtagh, L., Lindemann, M., Parish-Morris, J., Liberman, M., Aponte, E., Dorn, J., & Lipsmeier, F. (2023). Automatic Speech Recognition for ASD Using the Open-Source Whisper Model from OpenAI. International Society for Autism Research.
Rahmawati, S. (2024). Optimalisasi Fokus: “Strategi Pembelajaran untuk Meningkatkan Konsentrasi pada Anak dengan Gangguan Spektrum Autisme (GSA).” In Jurnal Kependidikan (Vol. 13, Issue 2). https://jurnaldidaktika.org
Reddy Ananthula Akash, T. (2023). Comprehensive Review of Autism Spectrum Disorder: Etiology, Early Signs, and Diagnostic Assessment. International Journal of Science and Research (IJSR), 12(8), 480–485. https://doi.org/10.21275/sr23803085129
Richards, J. A., Xu, D., & Gilkerson, J. (2010). Development and Performance of the LENA Automatic Autism Screen.
Schaeffer, J., Abd El-Raziq, M., Castroviejo, E., Durrleman, S., Ferré, S., Grama, I., Hendriks, P., Kissine, M., Manenti, M., Marinis, T., Meir, N., Novogrodsky, R., Perovic, A., Panzeri, F., Silleresi, S., Sukenik, N., Vicente, A., Zebib, R., Prévost, P., & Tuller, L. (2023). Language in autism: domains, profiles and co-occurring conditions. Journal of Neural Transmission, 130(3), 433–457. https://doi.org/10.1007/s00702-023-02592-y
Talantseva, O. I., Romanova, R. S., Shurdova, E. M., Dolgorukova, T. A., Sologub, P. S., Titova, O. S., Kleeva, D. F., & Grigorenko, E. L. (2023). The global prevalence of autism spectrum disorder: A three-level meta-analysis. In Frontiers in Psychiatry (Vol. 14). Frontiers Media S.A. https://doi.org/10.3389/fpsyt.2023.1071181
Trembath, D., Paynter, J., Sutherland, R., & Tager-Flusberg, H. (2019). Assessing Communication in Children with Autism Spectrum Disorder Who Are Minimally Verbal. In Current Developmental Disorders Reports (Vol. 6, Issue 3, pp. 103–110). Springer. https://doi.org/10.1007/s40474-019-00171-z
Vogindroukas, I., Stankova, M., Chelas, E. N., & Proedrou, A. (2022). Language and Speech Characteristics in Autism. In Neuropsychiatric Disease and Treatment (Vol. 18, pp. 2367–2377). Dove Medical Press Ltd. https://doi.org/10.2147/NDT.S331987
World Health Organization. (2023). Autism.
Zeidan, J., Fombonne, E., Scorah, J., Ibrahim, A., Durkin, M. S., Saxena, S., Yusuf, A., Shih, A., & Elsabbagh, M. (2022). Global prevalence of autism: A systematic review update. In Autism Research (Vol. 15, Issue 5, pp. 778–790). John Wiley and Sons Inc. https://doi.org/10.1002/aur.2696
Downloads
Published
How to Cite
Issue
Section
License
Copyright (c) 2025 Naela Fauzul Muna, Mukhammad Andri Setiawan

This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.
Jurnal Teknologi Pendidikan is an Open Access Journal. The authors who publish the manuscript in Jurnal Teknologi Pendidikan agree to the following terms.
Attribution-ShareAlike 4.0 International (CC BY-SA 4.0)
-
Attribution — You must give appropriate credit, provide a link to the license, and indicate if changes were made. You may do so in any reasonable manner, but not in any way that suggests the licensor endorses you or your use.
-
ShareAlike — If you remix, transform, or build upon the material, you must distribute your contributions under the same license as the original.
- No additional restrictions — You may not apply legal terms or technological measures that legally restrict others from doing anything the license permits.
Notices:
- You do not have to comply with the license for elements of the material in the public domain or where your use is permitted by an applicable exception or limitation.
- No warranties are given. The license may not give you all of the permissions necessary for your intended use. For example, other rights such as publicity, privacy, or moral rights may limit how you use the material.




