Xingdi Yuan · 袁 星柢
Senior Researcher · Microsoft Research, Montréal
I aim to develop machines that can read, write, and use language as a tool, in both static datasets and interactive worlds.
Current Focus
My recent work focuses on teaching coding AI to better understand codebases and to debug using tools.
Find our recent work in Agentic Coding on the Froggy team's page
.
Selected Publications
Machines that can read
-
NewsQA: A Machine Comprehension DatasetRepL4NLP Workshop, ACL 2017
-
Building Dynamic Knowledge Graphs from Text using Machine Reading ComprehensionICLR 2019
-
Machine Comprehension by Text-to-Text Neural Question GenerationRepL4NLP Workshop, ACL 2017
-
Natural Language Comprehension with the EpiReaderEMNLP 2016
Machines that can write
-
Llama See, Llama Do: A Mechanistic Perspective on Contextual Entrainment and Distraction in LLMs Outstanding PaperACL 2025
-
General-to-Specific Transfer Labeling for Domain Adaptable Keyphrase GenerationACL Findings 2023
-
An Empirical Study on Neural Keyphrase GenerationNAACL 2021
-
One Size Does Not Fit All: Generating and Evaluating Variable Number of KeyphrasesACL 2020
Machines that can use language as a tool
-
debug-gym: A Text-Based Environment for Interactive Debugging2025
-
BugPilot: Complex Bug Generation for Efficient Learning of SWE Skills2025
-
Gistify! Codebase-Level Understanding via Runtime Execution2025
-
Enhancing Agent Learning through World Dynamics ModelingEMNLP 2024 Findings
-
Policy Improvement using Language Feedback ModelsNeurIPS 2024
-
Language-guided Skill Learning with Temporal Variational InferenceICML 2024
-
Can Language Models Serve as Text-Based World Simulators?ACL 2024
-
ByteSized32: A Corpus and Challenge Task for Generating Task-Specific World Models Expressed as Text GamesEMNLP 2023
-
Deep Language Networks: Joint Prompt Training of Stacked LLMs using Variational Inference2023
-
Augmenting Autotelic Agents with Large Language ModelsCoLLAs 2023
-
One-Shot Learning from a Demonstration with Hierarchical Latent LanguageAAMAS 2023
-
Asking for Knowledge (AFK): Training RL Agents to Query External Knowledge Using LanguageICML 2022
-
Interactive Machine Comprehension with Dynamic Knowledge GraphsEMNLP 2021
-
ALFWorld: Aligning Text and Embodied Environments for Interactive LearningICLR 2021
-
Learning Dynamic Knowledge Graphs to Generalize on Text-Based GamesNeurIPS 2020
-
Interactive Fiction Games: A Colossal AdventureAAAI 2020
-
Interactive Machine Comprehension with Information Seeking AgentsACL 2020
-
Interactive Language Learning by Question AnsweringEMNLP 2019
-
TextWorld: A Learning Environment for Text-based GamesComputer Games Workshop, ICML/IJCAI 2018
Experience
-
2017 – Present
Microsoft Research, Montréal
Senior Researcher -
2015 – 2017
Maluuba
(Acquired by Microsoft)
Education
-
2015
M.S. Computer Science
New York University -
2011
B.S. Communications Engineering
Beijing University of Technology
北京工业大学
Service
- Area Chair: ICML, NeurIPS, ARR
- Action Editor: TACL
- Outstanding Reviewer: EMNLP 2020, NeurIPS 2021, ICLR 2022, ICML 2022
- Workshop Organizer: Wordplay Workshop (NeurIPS 2020, NAACL 2022, ACL 2024, EMNLP 2025), KBRL Workshop (IJCAI 2020)
Fun Stuff
asciiko (アス子)
A deep ascii art generator. Check out the YouTube demo or the open-sourced code.