About

Welcome! I’m Long (*), a first-year CS Ph.D. student at National University of Singapore (NUS) and Agency for Science, Technology and Research (A*STAR). I am advised by Prof. Kenji Kawaguchi, Prof. Kan Min Yen, Dr. Nancy Chen to study Human-AI Alignment.

I received my B.Sc. in Mathematics (Statistics) + Computer Science at Nanyang Technological University, Singapore (NTU) advised by Prof. Shafiq Joty. At NTU, I won the SPMS Outstanding Undergraduate Award 2023 (Outstanding Achievement).

My works are mostly published at top-tier NLP/ML conferences such as ACL, EMNLP, NAACL.

(*) my name means Dragon in Vietnamese context.

Education

Research Interests

  • Large Language Models
  • Prompt Design & Optimization
  • Human-AI Alignment

Preprints

(*) denotes equal contribution.

4. Prompt Optimization via Adversarial In-Context Learning

Xuan Long Do*, Yiran Zhao*, Hannah Brown*, Yuxi Xie, James Xu Zhao, Nancy F. Chen, Kenji Kawaguchi, Michael Qizhe Xie, Junxian He; arXiv preprint arXiv:2312.02614, [pdf].

3. ChOiRe: Characterizing and Predicting Human Opinions with Chain of Opinion Reasoning

Xuan Long Do, Kenji Kawaguchi, Min-Yen Kan, Nancy F. Chen; arXiv preprint arXiv:2311.08385, [pdf].

2. Do LLMs Work on Charts? Designing Few-Shot Prompts for Chart Question Answering and Summarization

Xuan Long Do, Mohammad Hassanpour, Ahmed Masry, Parsa Kavehzadeh, Enamul Hoque, Shafiq Joty; arXiv preprint arXiv:2312.10610, [pdf].

1. xCodeEval: A Large Scale Multilingual Multitask Benchmark for Code Understanding, Generation, Translation and Retrieval

Mohammad Abdullah Matin Khan*, M Saiful Bari*, Xuan Long Do, Weishi Wang, Md Rizwan Parvez, Shafiq Joty; arXiv preprint arXiv:2303.03004, [pdf].

Publications

(*) denotes equal contribution.

9. ToXCL: A Unified Framework for Toxic Speech Detection and Explanation

Nhat M. Hoang*, Xuan Long Do*, Duc Anh Do, Duc Anh Vu, Anh Tuan Luu; Proceedings of the 2024 Annual Conference of the North American Chapter of the Association for Computational Linguistics (NAACL 2024).

8. ChatGPT as a Math Questioner? Evaluating ChatGPT on Generating Pre-university Math Questions

Phuoc Pham Van Long*, Duc Anh Vu*, Nhat Minh Hoang*, Xuan Long Do*, Anh Tuan Luu; Proceedings of the 39th ACM/SIGAPP Symposium On Applied Computing, AI for Education Track (ACM/SIGAPP SAC 2024), [pdf].

7. UniChart: A Universal Vision-language Pretrained Model for Chart Comprehension and Reasoning

Ahmed Masry*, Parsa Kavehzadeh*, Xuan Long Do, Enamul Hoque, Shafiq Joty; Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing (EMNLP 2023), [pdf].

6. Retrieving Multimodal Information for Augmented Generation: A Survey

Ruochen Zhao, Hailin Chen, Weishi Wang, Fangkai Jiao, Xuan Long Do, Chengwei Qin, Bosheng Ding, Xiaobao Guo, Minzhi Li, Xingxuan Li, Shafiq Joty; Proceedings of Findings of 2023 Conference on Empirical Methods in Natural Language Processing (EMNLP 2023), [pdf].

5. Modeling What-to-ask and How-to-ask for Answer-unaware Conversational Question Generation

Xuan Long Do, Bowei Zou, Shafiq Joty, Anh Tai Tran, Liangming Pan, Nancy F. Chen, Ai Ti Aw; Proceedings of the 61st Annual Meeting of Association for Computational Lingustics (ACL 2023), [pdf].

4. CoHS-CQG: Context and History Selection for Conversational Question Generation

Xuan Long Do, Bowei Zou, Liangming Pan, Nancy F. Chen, Shafiq Joty, Ai Ti Aw; Proceedings of the 29th International Conference on Computational Lingustics (COLING 2022), [pdf].

3. OpenCQA: Open-ended Question Answering with Charts

Shankar Kantharaj, Xuan Long Do, Rixie Tiffany Leong, Jia Qing Tan, Enamul Hoque and Shafiq Joty; Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing (EMNLP 2022), [pdf].

2. ChartQA: A Benchmark for Question Answering about Charts with Visual and Logical Reasoning

Ahmed Masry, Do Xuan Long, Jia Qing Tan, Shafiq Joty, Enamul Hoque; Proceedings of Findings of 60th Annual Meeting of the Association for Computational Linguistics (ACL 2022), [pdf].

1. A Deep Learning Platform for Language Education Research and Development

Kye Min Tan, Richeng Duan, Xin Huang, Bowei Zou, Xuan Long Do; Proceedings of 2022 Conference of the International Speech Communication Association (INTERSPEECH 2022), [pdf].

Research Experiences

Teaching

  • National University of Singapore, Teaching Assistant (SoC, NUS), Jan. 2024 - present
    • AY23-24 Sem 2: CS3244 Machine Learning.
  • Nanyang Technological University, Singapore, Teaching Assistant (SPMS, NTU), Aug. 2022 - May. 2023
    • AY22-23 Sem 2: MH3500: Statistics, PS0002: Introduction to Data Science and Artificial Intelligence.
    • AY22-23 Sem 1: PS0001: Introduction to Computational Thinking.

Awards

  • Undergraduate Awards
    • SPMS Outstanding Undergraduate Award 2023 (Outstanding Achievement)
    • A*STAR Computing and Information Science (ACIS) Scholarship, 2023-2027
    • NTU President Research Scholar, 2022
    • ACM-ICPC Jakarta Regional Contest 2021, 2022, team NTUDragons & WCRush, ICPCID
    • Second Place Award, ISC 2021 Student Cluster Competition
    • Second Prize, International Mathematics Competition for University Students 2020 (IMC)
    • Dean’s List AY2019-2020, School of Computer Science and Engineering (SCSE), NTU
  • High-school Awards
    • Gold Medal, Iranian Geometry Olympiad 2018, Open Section (IGO)
    • Honorable Prize, Vietnamese Mathematical Olympiads 2018 (VMO)

Services & Volunteers

  • Professional Membership:
    • Association for Computational Linguistics (ACL) Member, May 2021 - present
    • Association for Computing Machinery (ACM) Member Dec. 2023 - present
  • Program Committee/Reviewer:
    • Journals: IEEE/ACM TASLP (2023, 2022).
    • Conferences: ICLR (2024), ACL RR (Feb 2023, Dec 2022), COLING (2022).
  • Student Volunteer Award:
    • ACL (2023, 2022), EMNLP (2022).

Media Coverage