Muhammad Dehan Al Kautsar

I am a Research Engineer at MBZUAI working on multilinguality and dialogue in NLP. I like tweaking and tinkering with tokenization and representations to understand how LLMs handle underrepresented languages. I am also interested in language code-mixing and code-switching in NLP. My goal is to make those models more inclusive, especially for languages across the Global South.

I also enjoy playing piano and football in my spare time. If you’re interested in collaborating or discussing the research (or anything), feel free to get in touch!

NLP: Multilinguality in LLM NLP: Dialogue System Data Science
Dehan
Easter Egg🥚

Yep... this whole site is rocking a Red Velvet–themed vibe🍰.

The 🟡🟣 accents and the 🐻 (bear) + 🦄 (unicorn) icons are tiny tributes to Seulgi and Yeri, Dehan’s faves.

That’s it. Just a fun small detail :)

Education
M.Sc. in Informatics Institut Teknologi Bandung
2023 - 2024
Bandung, Indonesia
Final GPA: 3.96 / 4.00
Thesis: “End-to-end Fused Dialogue System in Open-Source Large Language Model”.
Supervised by Ayu Purwarianti, Samuel Cahyawijaya, and Genta Indra Winata.
B.Sc. in Informatics Institut Teknologi Bandung
2019 – 2023
Bandung, Indonesia
Final GPA: 3.93 / 4.00
Final Task: “End-to-end Task-oritented Dialogue System in Indonesia”.
Supervised by Ayu Purwarianti, Samuel Cahyawijaya, and Genta Indra Winata.
Working Experiences
Nov 2024 - Present
Abu Dhabi, UAE
Dept: Natural Language Processing
AI Engineer Intern GLAIR
Nov 2022 - Aug 2023
Jakarta, Indonesia
Dept: Computer Vision
AI Engineer Intern Prosa.ai
May - Oct 2022
Bandung, Indonesia
Dept: Natural Language Processing
Academic & Laboratory Assistant Institut Teknologi Bandung
2021 – 2024
Bandung, Indonesia
Courses: Natural Language Processing, Programming Fundamentals, Introduction to Computation.
Publications
Selected papers & preprints (chronological order).
Vision Language Models are Confused Tourists
Patrick Amadeus Irawan, Ikhlasul Akmal Hanif, Muhammad Dehan Al Kautsar, Genta Indra Winata, Fajri Koto, Alham Fikri Aji. (2025).
Preprint — arXiv:2511.17004
SEADialogues: A Multilingual Culturally Grounded Multi-turn Dialogue Dataset on Southeast Asian Languages
Muhammad Dehan Al Kautsar, Aswin Candra, Muhammad Alif Al Hakim, Maxalmina Satria Kahfi, Fajri Koto, Alham Fikri Aji, Peerat Limkonchotiwat, Ekapol Chuangsuwanich, Genta Indra Winata. (2025).
Preprint — arXiv:2508.07069
Role-Aware Language Models for Secure and Contextualized Access Control in Organizations
Saeed Almheiri, Yerulan Kongrat, Adrian Santosh, Ruslan Tasmukhanov, Josemaria Loza Vera, Muhammad Dehan Al Kautsar, Fajri Koto. (2025).
In: AACL-IJCNLP 2025 Main
Evaluating Vision-Language and Large Language Models for Automated Student Assessment in Indonesian Classrooms
Nurul Aisyah, Muhammad Dehan Al Kautsar, Arif Hidayat, Raqib Chowdhury, Fajri Koto. (2025).
Preprint — arXiv:2506.04822
Simulating Training Data Leakage in Multiple-Choice Benchmarks for LLM Evaluation
Naila Shafirni Hidayat, Muhammad Dehan Al Kautsar, Alfan Wicaksono, Fajri Koto. (2025).
In: Eval4NLP 2025 Workshop (Co-located with AACL-IJCNLP 2025)
News
Recent updates & highlights.
Nov 2023
Our paper titled 'IndoToD: A multi-domain Indonesian benchmark for end-to-end task-oriented dialogue systems' is accepted and selected as the Best Paper on SEALP 2023 Workshop, co-located with AACL-IJCNLP 2023 in Bali, Indonesia.🏆
Oct 2023
I went to Tokyo, Japan to become a delegate of Institut Teknologi Bandung in a technology and cultural exchange program hosted by The University of Electro-Communications (UEC).🌸
Nov 2021
Became a finalist of Pusat Prestasi Nasional GEMASTIK - Smart City Division. We developed 'Virtual Hospital', an IoT-based technology to monitor the patient virtually because of the impact of COVID-19.
Contact
You can reach me via email or follow me on social media.
Location
Abu Dhabi, United Arab Emirates
Profiles