| CARVIEW |
Patrick Amadeus Irawan
About Me
-
I am a PhD student at MBZUAI π€, where I focus on multimodal NLP topics, particularly vision-language interaction. I also work on vision-language alignment, unified (any) multimodal models, and large-scale evaluations. I am advised by Alham Fikri Aji and Yova Kementchedjhieva.
-
Previously, I was a Research Engineer at SMU πΈπ¬ working at the intersection of multilingual and multimodal interpretation under Chong-Wah Ngo. Before that, I earned my bachelorβs degree in CS from Institut Teknologi Bandung, where I worked under Ayu Purwarianti on explainable multimodal synthetic data generation.
-
In addition to research, I also do AI engineering for various use cases. You can see my other experiences here.
-
Further, I plan to specialize my studies in developing methods to better align different modalities, with the goal of mitigating modality imbalance, especially in the avenue of unified multimodal models.
Research Interests
During my studies and prior experience, I have often worked on topics including, but not limited to, the following:
- Multimodal Imbalance: I believe that imbalanced learning is a significant bottleneck that prevents us from obtaining reliable multimodal models, as modality shortcuts and biases can harm both performance and the objectivity of evaluation. My work focuses on discovering its root causes and exploring methods to better align models to prevent such issues.
- LLM/VLM Alignment: I also work on both architectural and non-architectural adaptations (knowledge enrichment, data reformulation, RL) to address above issues and/or improve multimodal language modeling in general.
- Large-Scale Evaluations: I often question model robustness in scenarios with varying resource levels; however, probing this requires designing both broad and specific evaluation coverage. My work in this area aims to design benchmarks that assess the inclusivity of multimodal models, specifically by addressing concept underrepresentation through targeted data curation in multilingual and multicultural domains.
Updates
- [Nov. 2025] Our study exposing the confusion of VLMs in cultural-conflict visual scenario is up on arXiv!
- [Dec. 2025] M4-RAG is out on arXiv! We present an evaluation of how multimodal knowledge enrichment helps model in tackling multilingual query. Spoiler, it does not always helpβ¦ π€―
- [Oct. 2025] Entropy2Vec got accepted into MRL Workshop @ EMNLP 2025 ππ¨π³!
- [July 2025] Seeing Culture Benchmark is accepted to EMNLP 2025 π¨π³! On to the next one with SMU Multimedia team πͺ
- [May. 2025] DataRubrics is now on arXiv! We propose a unified scorecard to evaluate data quality on multi-faceted metrics.
- [Apr. 2025] WorldCuisines receives Best Theme Paper award at NAACL 2025! πππ½οΈ
- [Mar. 2025] Admitted to the Fall 2025 cohort of the MBZUAI PhD program in NLP! π
- [Jan. 2025] WorldCuisines and ProxyLM are accepted to NAACL 2025 πΊπΈ ποΈ
- [Nov. 2024] My first first-author paper, a VL synthetic data generation framework, is accepted to COLING 2025 π
- [Oct. 2024] WorldCuisines, the largest multicultural VL food benchmark, is released. Honored to co-lead the project π₯
- [Sep. 2024] SEACrowd is accepted to EMNLP 2024! πΊπΈ
Publications
2025
-
Patrick Amadeus Irawan, Ikhlasul Akmal Hanif, Muhammad Dehan Al Kautsar, Genta Indra Winata, Fajri Koto, Alham Fikri Aji -
David Anugraha, Patrick Amadeus Irawan, Anshul Singh, En-Shiun Annie Lee, Genta Indra Winata -
EMNLP
Burak Satar, Zhixin Ma, Patrick Amadeus Irawan, Wilfried A. Mulyawan, Jing Jiang, Ee-Peng Lim, Chong-Wah NgoConference on Empirical Methods in Natural Language Processing (EMNLP), 2025. -
MRL @ EMNLP
Entropy2Vec: Crosslingual Language Modeling Entropy as End-to-End Learnable Language RepresentationsPatrick Amadeus Irawan, Ryandito Diandaru, Belati Jagad Bintang Syuhada, Randy Zakya Suchrady, Alham Fikri Aji, Genta Indra Winata, Fajri Koto, Samuel CahyawijayaMultilingual Representation Learning (MRL) Workshop @ EMNLP 2025PDF Poster -
NAACL
Genta Indra Winata, Frederikus Hudi, Patrick Amadeus Irawan, David Anugraha, Rifki Afina Putri, Yutong Wang, Adam Nohejl, Ubaidillah Ariq Prathama, Nedjma Ousidhoum, and othersNorth American Chapter of the Association for Computational Linguistics (NAACL), 2025. -
NAACL
David Anugraha, Genta Indra Winata, Chenyue Li, Patrick Amadeus Irawan, En-Shiun Annie LeeNorth American Chapter of the Association for Computational Linguistics (NAACL), 2025. -
COLING
Patrick Amadeus Irawan, Genta Indra Winata, Samuel Cahyawijaya, Ayu PurwariantiInternational Conference on Computational Linguistics (COLING), 2025. -
Genta Indra Winata, David Anugraha, Emmy Liu, Alham Fikri Aji, Shou-Yi Hung, Aditya Parashar, Patrick Amadeus Irawan, and others
2024
-
EMNLP
Holy Lovenia, Rahmad Mahendra, Salsabil Maulana Akbar, Lester James V Miranda, Jennifer Santoso, Elyanah Aco, ..., Patrick Amadeus Irawan, and othersConference on Empirical Methods in Natural Language Processing (EMNLP), 2024. -
APSIPA ASC
Nana Sutisna, Aditya Prawira Nugroho, Christopher Jeffrey, Patrick Amadeus Irawan, Rizky Ramadhana, Ronggur Mahendra, Michael Jonathan, Infall Syafalni, Trio Adiono2024 Asia Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC)PDF Main (Oral)
Misc.
Experience
- Research Engineer @ Singapore Management University (2025 - Now)
- Research Engineer Intern @ ai&you (2024 -2024)
- Data Scientist Intern @ Supertype (2023 - 2023)
- Software Engineer Intern @ Blibli (2022 - 2022)
- Software Engineer Intern @ Ruangguru (2022 - 2022)