About
Welcome! I’m Long (*), a CS Ph.D. student at National University of Singapore (NUS) and Agency for Science, Technology and Research (A*STAR). I am fortunate to be advised by Prof. Kenji Kawaguchi, Prof. Kan Min Yen, Dr. Nancy Chen to study Human-AI Alignment.
I received my B.Sc. in Mathematics (Statistics) + Computer Science at Nanyang Technological University, Singapore (NTU) where I was fortunate to be advised by Prof. Shafiq Joty. At NTU, I won the SPMS Outstanding Undergraduate Award 2023 (Outstanding Achievement).
(*) my name means Dragon in Vietnamese context.
Education
National University of Singapore (NUS)
Doctor of Philosophy (Ph.D.) in Computer Science, Aug. 2023 - presentNanyang Technological University, Singapore (NTU)
Bachelor of Science (B.Sc.) in Mathematical and Computer Sciences (Double major), Aug. 2019 - Jul. 2023
Grade: Honours (Highest Distinction)
Experiences
Natural Language Processing group at NTU (NTU-NLP)
Research Assistant under FYP-URECA programme at NTU, supervised by Prof. Shafiq Rayhan Joty, Jul. 2021 - presentHuawei, Noah’s Ark Lab
Research Intern, under the supervision of Dr. Liu Yong, Dec. 2022 - Jul. 2023- Nanyang Technological University, Singapore
Teaching Assistant at SPMS, Aug. 2022 - May. 2023- AY22-23 Sem 2: MH3500: Statistics, PS0002: Introduction to Data Science and Artificial Intelligence
- AY22-23 Sem 1: PS0001: Introduction to Computational Thinking
Institute for Infocomm Research, A*STAR, Singapore
NLP Research Intern, supervised by Dr. Bowei Zou and Dr. Nancy F. Chen, Dec. 2021 - Jan. 2023Nanyang Technological University, Singapore
Research Assistant at NAIL, under the supervision of Prof. Luu Anh Tuan, Aug. 2022 - Dec. 2022Eureka Robotics, Singapore
Computer Vision Engineer Intern under SGInnovate Summation Programme, supervised by Dr. Xu Zhang, May. 2021 - Aug. 2021Earth Observatory of Singapore
Research Assistant (CV), supervised by Dr. Christina WIDIWIJAYANTI, Aug. 2020 - May. 2021- Panasonic R&D Center, Singapore
Video Algorithm Research Intern, supervised by Senior Software Engineer Han Boon Teo, Jun. 2020 - Aug. 2020
Research
The topics I have been working on include:
- Large Language Models
- Vision-Language
- Question Answering & Generation
- Text Classification
- Code Understanding and Generation
- AI Applications (Education, Recommender Systems)
Preprints
(*) denotes equal contribution.
2. ChOiRe: Characterizing and Predicting Human Opinions with Chain of Opinion Reasoning
Xuan Long Do, Kenji Kawaguchi, Min-Yen Kan, Nancy F. Chen
arXiv preprint arXiv:2311.08385, [pdf]
1. xCodeEval: A Large Scale Multilingual Multitask Benchmark for Code Understanding, Generation, Translation and Retrieval
Mohammad Abdullah Matin Khan*, M Saiful Bari*, Xuan Long Do, Weishi Wang, Md Rizwan Parvez, Shafiq Joty
arXiv preprint arXiv:2303.03004, [pdf]
Publications
(*) denotes equal contribution.
8. ChatGPT as a Math Questioner? Evaluating ChatGPT on Generating Pre-university Math Questions
Phuoc Pham Van Long*, Duc Anh Vu*, Nhat Minh Hoang*, Xuan Long Do*, Anh Tuan Luu
To appear at the 39th ACM/SIGAPP Symposium On Applied Computing, AI for Education Track (ACM/SIGAPP SAC 2024)
7. UniChart: A Universal Vision-language Pretrained Model for Chart Comprehension and Reasoning
Ahmed Masry*, Parsa Kavehzadeh*, Xuan Long Do, Enamul Hoque, Shafiq Joty
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing (EMNLP 2023), [pdf]
6. Retrieving Multimodal Information for Augmented Generation: A Survey
Ruochen Zhao, Hailin Chen, Weishi Wang, Fangkai Jiao, Xuan Long Do, Chengwei Qin, Bosheng Ding, Xiaobao Guo, Minzhi Li, Xingxuan Li, Shafiq Joty
Proceedings of Findings of 2023 Conference on Empirical Methods in Natural Language Processing (EMNLP 2023), [pdf]
5. Modeling What-to-ask and How-to-ask for Answer-unaware Conversational Question Generation
Xuan Long Do, Bowei Zou, Shafiq Joty, Anh Tai Tran, Liangming Pan, Nancy F. Chen, Ai Ti Aw
Proceedings of the 61st Annual Meeting of Association for Computational Lingustics (ACL 2023), [pdf]
4. CoHS-CQG: Context and History Selection for Conversational Question Generation
Xuan Long Do, Bowei Zou, Liangming Pan, Nancy F. Chen, Shafiq Joty, Ai Ti Aw
Proceedings of the 29th International Conference on Computational Lingustics (COLING 2022), [pdf]
3. OpenCQA: Open-ended Question Answering with Charts
Shankar Kantharaj, Xuan Long Do, Rixie Tiffany Leong, Jia Qing Tan, Enamul Hoque and Shafiq Joty
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing (EMNLP 2022), [pdf]
2. ChartQA: A Benchmark for Question Answering about Charts with Visual and Logical Reasoning
Ahmed Masry, Do Xuan Long, Jia Qing Tan, Shafiq Joty, Enamul Hoque
Proceedings of Findings of 60th Annual Meeting of the Association for Computational Linguistics (Findings of ACL 2022), [pdf]
1. A Deep Learning Platform for Language Education Research and Development
Kye Min Tan, Richeng Duan, Xin Huang, Bowei Zou, Xuan Long Do
Proceedings of 2022 Conference of the International Speech Communication Association (INTERSPEECH 2022), [pdf]
Awards
- Undergraduate Awards
- SPMS Outstanding Undergraduate Award 2023 (Outstanding Achievement)
- A*STAR Computing and Information Science (ACIS) Scholarship, 2023-2027
- NTU President Research Scholar, 2022
- ACM-ICPC Jakarta Regional Contest 2021, 2022, team NTUDragons & WCRush, ICPCID
- Second Place Award, ISC 2021 Student Cluster Competition
- Second Prize, International Mathematics Competition for University Students 2020 (IMC)
- Dean’s List AY2019-2020, School of Computer Science and Engineering (SCSE), NTU
- High-school Awards
- Gold Medal, Iranian Geometry Olympiad 2018, Open Section (IGO)
- Honorable Prize, Vietnamese Mathematical Olympiads 2018 (VMO)
Services & Volunteers
- Reviewer:
- Journals: IEEE/ACM TASLP (2023, 2022)
- Conferences: ICLR (2024), ACL RR (Feb 2023, Dec 2022), COLING (2022)
- Student Volunteer Award: ACL (2023, 2022), EMNLP (2022)