About
Thanks for visiting! I’m Long (*), a first-year CS Ph.D. student at National University of Singapore (NUS). I am advised by Prof. Kenji Kawaguchi, Prof. Kan Min Yen, Dr. Nancy Chen (A*STAR) to study Human-AI Alignment.
I received my B.Sc. in Mathematics (Statistics) + Computer Science at Nanyang Technological University, Singapore (NTU) advised by Prof. Shafiq Joty. At NTU, I won the SPMS Outstanding Undergraduate Award 2023 (Outstanding Achievement).
My works are mostly published at top-tier NLP/ML conferences such as ACL, EMNLP, NAACL.
(*) my name means Dragon in Vietnamese context.
Education
National University of Singapore (NUS), Doctor of Philosophy (Ph.D.) in Computer Science (Highest Distinction), Aug. 2023 - present
Nanyang Technological University, Singapore (NTU), Bachelor of Science (B.Sc.) in Mathematical and Computer Sciences (Double major) (Honours, Highest Distinction), Aug. 2019 - Jul. 2023
Research Interests
- Large Language Models
- Prompt Design & Optimization
- Human-AI Alignment
Preprints
(*) denotes equal contribution.
2. ChOiRe: Characterizing and Predicting Human Opinions with Chain of Opinion Reasoning
Xuan Long Do, Kenji Kawaguchi, Min-Yen Kan, Nancy F. Chen; arXiv preprint arXiv:2311.08385, [pdf].
1. Accelerating Greedy Coordinate Gradient via Probe Sampling
Yiran Zhao, Wenyue Zheng, Tianle Cai, Xuan Long Do, Kenji Kawaguchi, Anirudh Goyal, Michael Shieh; arXiv preprint arXiv:2403.01251, [pdf].
Publications
(*) denotes equal contribution.
11. Prompt Optimization via Adversarial In-Context Learning
Xuan Long Do*, Yiran Zhao*, Hannah Brown*, Yuxi Xie, James Xu Zhao, Nancy F. Chen, Kenji Kawaguchi, Michael Qizhe Xie, Junxian He; Proceedings of the 62nd Annual Meeting of Association for Computational Lingustics (ACL 2024, Oral), [pdf].
10. xCodeEval: A Large Scale Multilingual Multitask Benchmark for Code Understanding, Generation, Translation and Retrieval
Mohammad Abdullah Matin Khan*, M Saiful Bari*, Xuan Long Do, Weishi Wang, Md Rizwan Parvez, Shafiq Joty; Proceedings of the 62nd Annual Meeting of Association for Computational Lingustics (ACL 2024), [pdf].
9. ToXCL: A Unified Framework for Toxic Speech Detection and Explanation
Nhat M. Hoang*, Xuan Long Do*, Duc Anh Do, Duc Anh Vu, Anh Tuan Luu; Proceedings of the 2024 Annual Conference of the North American Chapter of the Association for Computational Linguistics (NAACL 2024).
8. ChatGPT as a Math Questioner? Evaluating ChatGPT on Generating Pre-university Math Questions
Phuoc Pham Van Long*, Duc Anh Vu*, Nhat Minh Hoang*, Xuan Long Do*, Anh Tuan Luu; Proceedings of the 39th ACM/SIGAPP Symposium On Applied Computing, AI for Education Track (ACM/SIGAPP SAC 2024), [pdf].
7. UniChart: A Universal Vision-language Pretrained Model for Chart Comprehension and Reasoning
Ahmed Masry*, Parsa Kavehzadeh*, Xuan Long Do, Enamul Hoque, Shafiq Joty; Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing (EMNLP 2023), [pdf].
6. Retrieving Multimodal Information for Augmented Generation: A Survey
Ruochen Zhao, Hailin Chen, Weishi Wang, Fangkai Jiao, Xuan Long Do, Chengwei Qin, Bosheng Ding, Xiaobao Guo, Minzhi Li, Xingxuan Li, Shafiq Joty; Proceedings of Findings of 2023 Conference on Empirical Methods in Natural Language Processing (EMNLP 2023), [pdf].
5. Modeling What-to-ask and How-to-ask for Answer-unaware Conversational Question Generation
Xuan Long Do, Bowei Zou, Shafiq Joty, Anh Tai Tran, Liangming Pan, Nancy F. Chen, Ai Ti Aw; Proceedings of the 61st Annual Meeting of Association for Computational Lingustics (ACL 2023), [pdf].
4. CoHS-CQG: Context and History Selection for Conversational Question Generation
Xuan Long Do, Bowei Zou, Liangming Pan, Nancy F. Chen, Shafiq Joty, Ai Ti Aw; Proceedings of the 29th International Conference on Computational Lingustics (COLING 2022), [pdf].
3. OpenCQA: Open-ended Question Answering with Charts
Shankar Kantharaj, Xuan Long Do, Rixie Tiffany Leong, Jia Qing Tan, Enamul Hoque and Shafiq Joty; Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing (EMNLP 2022), [pdf].
2. ChartQA: A Benchmark for Question Answering about Charts with Visual and Logical Reasoning
Ahmed Masry, Do Xuan Long, Jia Qing Tan, Shafiq Joty, Enamul Hoque; Proceedings of Findings of 60th Annual Meeting of the Association for Computational Linguistics (ACL 2022), [pdf].
1. A Deep Learning Platform for Language Education Research and Development
Kye Min Tan, Richeng Duan, Xin Huang, Bowei Zou, Xuan Long Do; Proceedings of 2022 Conference of the International Speech Communication Association (INTERSPEECH 2022), [pdf].
Research Experiences
- Natural Language Processing group at NTU (NTU-NLP), NTU President Research Scholar under FYP-URECA programme at NTU, advised by Prof. Shafiq Rayhan Joty, Jul. 2021 - Jul. 2023
Institute for Infocomm Research, A*STAR, Singapore, NLP Research Intern, advised by Dr. Bowei Zou and Dr. Nancy F. Chen, Dec. 2021 - Jan. 2023
Nanyang Technological University, Singapore, Research Assistant at NAIL, advised by Prof. Luu Anh Tuan, Aug. 2022 - Dec. 2022
Eureka Robotics, Singapore, Computer Vision Intern under SGInnovate Summation Programme, supervised by Dr. Xu Zhang, May. 2021 - Aug. 2021
Earth Observatory of Singapore, Research Assistant, advised by Dr. Christina WIDIWIJAYANTI, Aug. 2020 - May. 2021
Panasonic R&D Center, Singapore, Video Algorithm Research Intern, advised by Han Boon Teo, Jun. 2020 - Aug. 2020
Teaching
- National University of Singapore, Teaching Assistant (SoC, NUS), Jan. 2024 - present
- AY23-24 Sem 2: CS3244 Machine Learning.
- Nanyang Technological University, Singapore, Teaching Assistant (SPMS, NTU), Aug. 2022 - May. 2023
- AY22-23 Sem 2: MH3500: Statistics, PS0002: Introduction to Data Science and Artificial Intelligence.
- AY22-23 Sem 1: PS0001: Introduction to Computational Thinking.
Awards
- Undergraduate Awards
- SPMS Outstanding Undergraduate Award 2023 (Outstanding Achievement)
- A*STAR Computing and Information Science (ACIS) Scholarship, 2023-2027
- NTU President Research Scholar, 2022
- ACM-ICPC Jakarta Regional Contest 2021, 2022, team NTUDragons & WCRush, ICPCID
- Second Place Award, ISC 2021 Student Cluster Competition
- Second Prize, International Mathematics Competition for University Students 2020 (IMC)
- Dean’s List AY2019-2020, School of Computer Science and Engineering (SCSE), NTU
- High-school Awards
- Gold Medal, Iranian Geometry Olympiad 2018, Open Section (IGO)
- Honorable Prize, Vietnamese Mathematical Olympiads 2018 (VMO)
Services & Volunteers
- Professional Membership:
- Association for Computational Linguistics (ACL) Member, May 2021 - present
- Association for Computing Machinery (ACM) Member Dec. 2023 - present
- Program Committee/Reviewer:
- Journals: IEEE/ACM TASLP (2023, 2022).
- Conferences: ICLR (2024), EMNLP (2024), ACL RR (2024, 2023, 2022), COLING (2022).
- Student Volunteer Award:
- ACL (2024, 2023, 2022), EMNLP (2022).