Thank you for visiting! I'm Long, a third-year CS Ph.D. student at the National University of Singapore (NUS), advised by Professors Min-Yen Kan, Kenji Kawaguchi, Nancy Chen, Shafiq Joty. I am also joining the Amazon Core Search team to intern as an Applied Scientist, and was previously a Student Researcher at the Google Cloud AI Research team. My research focuses on efficient adaptation of language and vision-language models:

  1. Prompt Analysis [NLPromptEval (ACL'25), FormatBias (NAACL'25)], Design [Multi-expert Prompting (EMNLP'24), Chain-of-Opinion (COLING'25)], and Optimization [adv-ICL (ACL'24)].
  2. Agents [VISTA (arxiv'25), Multi-expert Prompting (EMNLP'24)].
  3. Efficient Alignment [LongGuide (ACL'25), Probe sampling (NeurIPS'24)].
  4. Vision-Language [PromptChart (preprint'23), Unichart (EMNLP'23), OpenCQA (EMNLP'22), ChartQA (ACL'22)].

Previously, I received my B.Sc. in Mathematics (Statistics) and Computer Science at Nanyang Technological University, Singapore (NTU) advised by Prof. Shafiq Joty. At NTU, I won the SPMS Outstanding Undergraduate Award 2023 (Outstanding Achievement).

💥: I am open to opportunities for collaboration and new research ideas. If you are interested in working with me, please feel free to contact via my email.

News

  • [Oct 2025] VISTA has been featured across numerous platforms, including YouTube (e.g., 1, 2, 3), X (formerly Twitter; e.g., 1, 2, 3), Instagram (theartificialintelligence), Facebook, Reddit, and many others!
  • [Oct 2025] My intern work at Google has been released! Check out VISTA, a test-time self-improving video agent which improves Veo 3 by up to 60%!
  • [Aug 2025] I joined Amazon Core Search as an Applied Scientist intern!
  • [Jun 2025] Ngoc-Hai Nguyen has been admitted to Tufts University PhD programme in Computer Science and Nhat M. Hoang has been admitted to NTU PhD programme in Computer Science! Congratulation!
  • [May 2025] Do you know if you are polite to your LLMs, they can give you better outcomes? Check out NLPromptEval accepted by ACL 2025 Main Conference!
  • [May 2025] Do you know you LLMs can self-evolve to adapt to NLP generation tasks? Check out LongGuide accepted by Findings of ACL 2025!
  • [May 2025] I joined Google Cloud AI Research as a Student Researcher!
  • [Jan 2025] I have been awarded the Research Achievement Award by School of Computing, NUS!
  • [Jan 2025] Do you know your LLMs are biased towards output formats? Check out FormatBiasEval accepted by NAACL 2025 Main Conference!
  • [Dec 2024] I passed my Qualifying Exam (QE). I am now a PhD student -> PhD candidate!
  • [Dec 2024] Do you know Value-Belief-Norm Reasoning is indeed effective for LLMs? Check out Chain-of-Opinions accepted by COLING 2025!
  • [Sep 2024] Do you know simulating an LLM as multiple experts leading to more trustworthy outcomes? Check out Multi-Expert Prompting accepted by EMNLP 2024 Main Conference, and featured by elvis!
  • [May 2024] Do you know simulating an LLM as a Generative Adversarial Network can help you optimize your prompts? Check out adv-ICL accepted by ACL 2024 Main Conference!

Publications

(*) denotes equal contribution.

2025

19. VISTA: A Test-Time Self-Improving Video Generation Agent
Do Xuan Long, Xingchen Wan, Hootan Nakhost, Chen-Yu Lee, Tomas Pfister, Sercan Ö. Arık
arXiv preprint, 2025

18. What Makes a Good Natural Language Prompt?
Do Xuan Long, Duy Dinh, Ngoc-Hai Nguyen, Kenji Kawaguchi, Nancy F. Chen, Shafiq Joty, Min-Yen Kan
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (ACL 2025)
Featured by multiple X (e.g., 1, 2, 3) accounts.

17. Beyond In-Context Learning: Aligning Long-form Generation of Large Language Models via Task-Inherent Attribute Guidelines
Do Xuan Long, Duong Ngoc Yen, Do Xuan Trong, Anh Tuan Luu, Kenji Kawaguchi, Shafiq Joty, Min-Yen Kan, Nancy F. Chen
Proceedings of Findings of the 63rd Annual Meeting of the Association for Computational Linguistics (ACL 2025 Findings)

16. A Survey of Frontiers in LLM Reasoning: Inference Scaling, Learning to Reason, and Agentic Systems
Zixuan Ke, Fangkai Jiao, Yifei Ming, Xuan-Phi Nguyen, Austin Xu, Do Xuan Long, Minzhi Li, Chengwei Qin, Peifeng Wang, Silvio Savarese, Caiming Xiong, Shafiq Joty
Transactions on Machine Learning Research 2025 (TMLR, Survey Certification)

15. LLMs Are Biased Towards Output Formats! Systematically Evaluating and Mitigating Output Format Bias of LLMs
Do Xuan Long, Hai Nguyen Ngoc, Tiviatis Sim, Hieu Dao, Shafiq Joty, Kenji Kawaguchi, Nancy Chen, Min-Yen Kan
Proceedings of the 2025 Annual Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics (NAACL 2025)

14. Aligning Large Language Models with Human Opinions through Persona Selection and Value--Belief--Norm Reasoning
Do Xuan Long, Kenji Kawaguchi, Min-Yen Kan, Nancy F. Chen
Proceedings of the 31st International Conference on Computational Linguistics (COLING 2025)

2024

13. Multi-expert Prompting Improves Safety, Reliability, and Usefulness of Large Language Models
Do Xuan Long, Yen Duong, Anh Tuan Luu, Kenji Kawaguchi, Min-Yen Kan, Nancy Chen
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing (EMNLP 2024)
Featured by elvis X and other X, Reddit, Linkedin accounts.

12. Prompt Optimization via Adversarial In-Context Learning
Do Xuan Long*, Yiran Zhao*, Hannah Brown*, Yuxi Xie, James Xu Zhao, Nancy F. Chen, Kenji Kawaguchi, Michael Qizhe Xie, Junxian He
Proceedings of the 62nd Annual Meeting of Association for Computational Linguistics (ACL 2024, Oral)

11. Accelerating Greedy Coordinate Gradient and General Prompt Optimization via Probe Sampling
Yiran Zhao, Wenyue Zheng, Tianle Cai, Do Xuan Long, Kenji Kawaguchi, Anirudh Goyal, Michael Shieh
Proceedings of the Thirty-eighth Annual Conference on Neural Information Processing Systems (NeurIPS 2024)

10. xCodeEval: A Large Scale Multilingual Multitask Benchmark for Code Understanding, Generation, Translation and Retrieval
Mohammad Abdullah Matin Khan*, M Saiful Bari*, Xuan Long Do, Weishi Wang, Md Rizwan Parvez, Shafiq Joty
Proceedings of the 62nd Annual Meeting of Association for Computational Linguistics (ACL 2024)

9. ToXCL: A Unified Framework for Toxic Speech Detection and Explanation
Nhat M. Hoang*, Xuan Long Do*, Duc Anh Do, Duc Anh Vu, Anh Tuan Luu
Proceedings of the 2024 Annual Conference of the North American Chapter of the Association for Computational Linguistics (NAACL 2024)

8. ChatGPT as a Math Questioner? Evaluating ChatGPT on Generating Pre-university Math Questions
Phuoc Pham Van Long*, Duc Anh Vu*, Nhat Minh Hoang*, Xuan Long Do*, Anh Tuan Luu
Proceedings of the 39th ACM/SIGAPP Symposium On Applied Computing, AI for Education Track (ACM/SIGAPP SAC 2024)

2022-2023

7. UniChart: A Universal Vision-language Pretrained Model for Chart Comprehension and Reasoning
Ahmed Masry*, Parsa Kavehzadeh*, Xuan Long Do, Enamul Hoque, Shafiq Joty
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing (EMNLP 2023)

6. Retrieving Multimodal Information for Augmented Generation: A Survey
Ruochen Zhao, Hailin Chen, Weishi Wang, Fangkai Jiao, Xuan Long Do, Chengwei Qin, Bosheng Ding, Xiaobao Guo, Minzhi Li, Xingxuan Li, Shafiq Joty
Proceedings of Findings of 2023 Conference on Empirical Methods in Natural Language Processing (EMNLP 2023 Findings)

5. Modeling What-to-ask and How-to-ask for Answer-unaware Conversational Question Generation
Xuan Long Do, Bowei Zou, Shafiq Joty, Anh Tai Tran, Liangming Pan, Nancy F. Chen, Ai Ti Aw
Proceedings of the 61st Annual Meeting of Association for Computational Linguistics (ACL 2023)

4. CoHS-CQG: Context and History Selection for Conversational Question Generation
Xuan Long Do, Bowei Zou, Liangming Pan, Nancy F. Chen, Shafiq Joty, Ai Ti Aw
Proceedings of the 29th International Conference on Computational Linguistics (COLING 2022)

3. OpenCQA: Open-ended Question Answering with Charts
Shankar Kantharaj, Xuan Long Do, Rixie Tiffany Leong, Jia Qing Tan, Enamul Hoque and Shafiq Joty
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing (EMNLP 2022)

2. ChartQA: A Benchmark for Question Answering about Charts with Visual and Logical Reasoning
Ahmed Masry, Do Xuan Long, Jia Qing Tan, Shafiq Joty, Enamul Hoque
Proceedings of Findings of 60th Annual Meeting of the Association for Computational Linguistics (ACL 2022 Findings)

1. A Deep Learning Platform for Language Education Research and Development
Kye Min Tan, Richeng Duan, Xin Huang, Bowei Zou, Xuan Long Do
Proceedings of 2022 Conference of the International Speech Communication Association (INTERSPEECH 2022)

Educations

  1. Ph.D. in Computer Science (Aug. 2023 - Present)
    National University of Singapore (NUS), 21 Lower Kent Ridge Rd, Singapore
    Advised by Min-Yen Kan, Kenji Kawaguchi, Nancy Chen, Shafiq Joty
  2. B.Sc. in Mathematical and Computer Sciences (Aug. 2019 - Jul. 2023)
    Nanyang Technological University, Singapore (NTU), 50 Nanyang Ave, Singapore
    Advised by Shafiq Joty, Double Major, Honours, Highest Distinction

Selected Research Experiences

  1. Applied Scientist Intern (Aug. 2025 - Present)
    Amazon Core Search, San Francisco Bay Area, California, United States
  2. Student Researcher (May 2025 - Jul. 2025)
    Google Cloud AI Research, San Francisco Bay Area, California, United States
    Hosted by Xingchen Wan and Sercan Ö. Arik
  3. NTU President Research Scholar (Jul. 2021 - Jul. 2023)
    Natural Language Processing group at NTU (NTU-NLP), Singapore
    Under FYP-URECA programme, advised by Prof. Shafiq Rayhan Joty
  4. Research Intern (Dec. 2021 - Jan. 2023)
    Institute for Infocomm Research, A*STAR, Singapore
    Advised by Bowei Zou and Nancy F. Chen
© 2025 Do Xuan Long. All rights reserved.