Ran Xu

Research Scientist, Google DeepMind

2000 N Shoreline Blvd

Mountain View, CA 94043

My name is Ran Xu. I’m a research scientist at Google DeepMind.

My research centers on LLM agents and post-training. In particular, I am interested in enabling language models to effectively use external tools, including search [COLM ‘25; NeurIPS ‘25], code [EMNLP ‘24a; ICLR ‘26a; ICLR ‘26b] as external tools. More broadly, I also work on general LLM post-training over different stages (SFT, DPO, RL) [NAACL ‘25; arxiv ‘26a; arXiv ‘26b]. Overall, my ultimate goal is to build more capable yet secure language models that can better reason, act, and interact with the world.

Before joining Google, I obtained my PhD degree at Department of Computer Science at Emory University in 2026, co-advised by Prof. Carl Yang and Prof. Joyce C. Ho. Prior to that, I obtained my bachelor’s degree (with Highest Honors) also from the Department of Computer Science, Emory University in 2021.


Educations

Emory University (2021 - 2026)
Ph.D. in Computational Science and Informatics
GPA: 3.98/4.00
Research Focus: Large Language Models, Retrieval-augmented Generation, Agents, Data Synthesis with applications in healthcare.
Advisor: Prof. Carl Yang & Prof. Joyce Ho

Emory University (2017 - 2021)
B.S. in Computer Science, Double Major in Applied Mathematics
GPA: 3.97/4.00
Research Focus: Natural Language Processing.
Advisor: Prof. Jinho Choi

Emory University logo

Industrial Experience

Google DeepMind (March 2026 - Present)
Research Scientist

Google DeepMind logo
Search Intelligence, Google DeepMind (Jun 2025 - Nov 2025)
Research Intern
Topic: Agentic Judge Training via Tool-Augmented RL [ICLR 2026].
Mentors: Jingjing Chen, Jiayu Ye, Yu Wu, Manager: Hongkun Yu.

Google DeepMind logo
AI Lab, Tencent America (Feb 2025 - May 2025)
Artificial General Intelligence Research Intern
Topic: Retrieval-augmented GUI Agents with Skill Generation [EMNLP 2025 Main Conference].
Mentors: Kaixin Ma, Wenhao Yu, Hongming Zhang, Manager: Dong Yu.

Tencent logo
Query Understanding Team, Amazon (May 2024 - Oct 2024)
Applied Scientist Intern
Topic: LLM Self-training for Retrieval-augmented Generation [NAACL 2025 Main Conference].
Mentor: Hui Liu, Manager: Qi He.

Amazon logo
Meta Platforms, Inc. (May 2020 - Aug 2020)
Enterprise Engineer Intern
Mentor: Zexi Zhang

Meta logo

News

Mar 9, 2026 I finished my PhD study and joined Google DeepMind as a research scientist.
Jan 26, 2026 Two papers on LLM Agents are accepted to ICLR 2026 with one as Oral (top 1.1%).
Sep 18, 2025 Our paper AceSearcher: Bootstrapping Reasoning and Search for LLMs via Reinforced Self-Play is accepted to NeurIPS 2025 as Spotlight (top 3.2%). See you in San Diego!
Aug 20, 2025 Our paper on improving GUI Agents with tutorials is accepted to EMNLP 2025 Main Conference.
Sep 20, 2024 Three papers on LLMs for Text Retrieval, LLM Agents for Complex Tabular Reasoning and LLM Test-time Adaptation are accepted to EMNLP 2024.

Selected Publications

  1. MedAgentGym: A Scalable Agentic Training Environment for Code-Centric Reasoning in Biomedical Data Science
    Ran Xu*, Yuchen Zhuang*, Yishan Zhong, Yue Yu, Zifeng Wang, Xiangru Tang, Hang Wu, May Dongmei Wang, Peifeng Ruan, Donghan Yang, Tao Wang, Guanghua Xiao, Xin Liu, Carl Yang, Yang Xie, and Wenqi Shi
    Proceedings of ICLR, 2026. (Oral)
  2. Incentivizing Agentic Reasoning in LLM Judges via Tool-Integrated Reinforcement Learning
    Ran Xu, Jingjing Chen, Jiayu Ye, Yu Wu, Jun Yan, Carl Yang, and Hongkun Yu
    Proceedings of ICLR, 2026.
  3. AceSearcher: Bootstrapping Reasoning and Search for LLMs via Reinforced Self-Play
    Ran Xu, Yuchen Zhuang, Zihan Dong, Jonathan Wang, Yue Yu, Joyce C. Ho, Linjun Zhang, Haoyu Wang, Wenqi Shi, and Carl Yang
    Proceedings of NeurIPS, 2025. (Spotlight)
  4. SimRAG: Self-Improving Retrieval-Augmented Generation for Adapting Large Language Models to Specialized Domains
    Ran Xu, Hui Liu, Sreyashi Nag, Zhenwei Dai, Yaochen Xie, Xianfeng Tang, Chen Luo, Yang Li, Joyce C. Ho, Carl Yang, and Qi He
    Proceedings of NAACL, 2025.
  5. Counterfactual and Factual Reasoning over Hypergraphs for Interpretable Clinical Predictions on EHR
    Ran Xu, Yue Yu, Chao Zhang, Mohammed K Ali, Joyce C Ho, and Carl Yang
    Proceedings of ML4H, 2022. (Best Paper Award)