My research aims to create AI agents that can reliably serve and collaborate with humans. To accomplish goal, I focus on enhancing human-AI communication, building agents that can and are willing to communicate effectively with humans.
I am currently pushing forward three directions:
Learning from human feedback: I conducted the first simulated study on using reinforcement learning to train text generators with noisy human feedback (RLHF) [EMNLP’17]. Since then, I don’t think teaching agents with rewards is a good idea because it is a terrible way of communication. I am developing frameworks that enable learning from rich, abstract language [ICML’21, ArXiv’23].
Learning to ask questions: It is a mistake to think that only humans should ask AI for help and not the reverse. By asking a question, an agent can: (i) express its uncertainties (not just uncertainty), and (ii) obtain information to expand its capabilities. So more safety and more utility! I have authored a series of papers to disseminate this message [EMNLP’15’, CVPR’19, EMNLP’19, ICML’22].
Modeling humans and the world: I show that vanilla language models implement a very primitive “model of thought” [ToM@ICML’23]. To become more reliable, they need to develop robust models of the world and the humans in it. I recently focus on improving this capbility for instruction-generation models [ACL’23].
More facts:
My real name is Nguyễn Xuân Khánh . My first name is usually confused with Khan or Kahn :(
I was born in Việt Nam , a peaceful country (click here for inspiration to visit us).
I am also proud to be a PTNK (Phổ Thông Năng Khiếu) alumnus.
New paper on task-oriented cognitive capabilities. TLDR; we found and improved the deficiency in the pragmatic capability of instruction generation models. Received outstanding paper award at the ToM workshop at ICML 2023.
Aug 17, 2022
I will be organizing InterNLP workshop at NeurIPS 2022. Please submit your papers if interested!
@inproceedings{nguyen2017banditnmt,title={Reinforcement Learning for Bandit Neural Machine Translation with Simulated Human Feedback},author={Nguyen, Khanh and Daum{\'e} III, Hal and Boyd-Graber, Jordan},booktitle={EMNLP},year={2017},address={Copenhagen, Denmark},publisher={Association for Computational Linguistics},url={https://www.aclweb.org/anthology/D17-1153},doi={10.18653/v1/D17-1153},pages={1464--1474},summary={<ul><li>First simulated study on training text generators with reinforcement learning from noisy human feedback (RLHF)</li><li>Later upgrades of this approach are powering large language models</li></ul>}}
Posterior calibration and exploratory analysis for natural language processing models
Khanh Nguyen, and Brendan O’Connor
EMNLP, 2015
First paper on calibration for structured prediction
Inspires subsequent studies on calibration of neural networks, out-of-distribution detection methods, calibration theories, etc.
@inproceedings{nguyen15calibration,title={Posterior calibration and exploratory analysis for natural language processing models},author={Nguyen, Khanh and O{'}Connor, Brendan},booktitle={EMNLP},year={2015},address={Lisbon, Portugal},publisher={Association for Computational Linguistics},url={https://www.aclweb.org/anthology/D15-1182},doi={10.18653/v1/D15-1182},pages={1587--1598},summary={<ul><li>First paper on calibration for structured prediction</li><li>Inspires subsequent studies on calibration of neural networks, out-of-distribution detection methods, calibration theories, etc.</li></ul>}}
Interactive Learning from Activity Description
Khanh Nguyen, Dipendra Misra, Robert Schapire, Miro Dudı́k, and Patrick Shafto
ICML, 2021
One of the first frameworks for learning from language feedback with theoretical guanrantees
@inproceedings{nguyen2021iliad,title={Interactive Learning from Activity Description},author={Nguyen, Khanh and Misra, Dipendra and Schapire, Robert and Dud{\'\i}k, Miro and Shafto, Patrick},booktitle={ICML},year={2021},summary={<ul><li>One of the first frameworks for learning from language feedback with theoretical guanrantees</li></ul>}}
Help, Anna! Visual Navigation with Natural Multimodal Assistance via Retrospective Curiosity-Encouraging Imitation Learning
Khanh Nguyen, and Hal Daumé III
EMNLP, 2019
First paper that introduces the task of vision-language navigation with human assistance.
Evaluates collaborative capability rather than autonomous capability
@inproceedings{nguyen2019hanna,author={Nguyen, Khanh and Daum{\'e} III, Hal},title={Help, Anna! Visual Navigation with Natural Multimodal Assistance via Retrospective Curiosity-Encouraging Imitation Learning},booktitle={EMNLP},month={},year={2019},summary={<ul><li>First paper that introduces the task of vision-language navigation with human assistance.</li><li>Evaluates collaborative capability rather than autonomous capability</li>}}