I am a Senior Applied Scientist at Microsoft, where I develop AI agents for enterprises.
Prior to joining Microsoft, I was a Postdoctoral Research Fellow of the Center for Human-Compatible Artificial Intelligence (CHAI) at the University of CaliforniaâBerkeley, where I was fortunate to be mentored by Prof. Stuart Russell, who co-wrote the best-selling Introduction textbook on AI and invented the mathematical foundations for human-AI alignment. Before that, I was a postdoc at the Princeton NLP group working with Prof. Karthik Narasimhan, a pioneer in AI agents. I obtained my PhD at the University of MarylandâCollege Park, advised by the great Hal DaumĂ© III.
My research statement summarizes my research accomplishments and vision. At a high level, I create artificial agents that have the communication skills and incentives to assist humans. Specifically, I explore the following questions:
- How to enable AI agents to learn from natural human feedback (listening skill): My [EMNLPâ17] paper demonstrated for the first time the feasibility of using only noisy, complete-output ratings to improve the performance of a neural text generator. This work was followed by studies that used real human ratings at eBay and OpenAI, ultimately leading to the development of InstructGPT that popularized RLHF.
More recently, I have been developing frameworks for learning from language feedback with theoretical guarantees [ICMLâ21, ACLâ24WS]. - How to identify and share with humans what AI agents know and do not know (speaking skill): I was an early explorer of calibration analysis for NLP models [EMNLPâ15â] and pioneered the development of robots that ask for help [CVPRâ19, EMNLPâ19, ICMLâ22]. Lately, I develop models that guide human navigation with language, improving their pragmatic reasoning capability [ACLâ23] and making them useful even when they generate inaccurate instructions [EMNLPâ24].
- How to drive AI agents toward efficient and beneficial communicative behavior (incentive): I create agents that learn with progressive efficiency [NeurIPSâ23WS], i.e. the more you talk to them, the less effort it will take to teach them. In an ongoing work, I characterize the limitations of the popular RLHF approach and propose a new alignment framework that emphasizes alignment with not only with the human principal but also with reality.
Some personal facts:
- My real name is Nguyá»
n XuĂąn KhĂĄnh đą. My first name (KhĂĄnh) means âjoyâ or âhappinessâ. Please do not confuse it with
KhanorKahn:( - I was born in Viá»t Nam :vietnam:, a peaceful country (click here for inspiration to visit us).
- I am also proud to be a PTNK (Phá» ThĂŽng NÄng Khiáșżu) alumnus.