Huaxiu Yao's Personal Website

About me
Short Bio. I ampost-training studies adaptive intelligence through alignment, interaction, and learning, and is affiliated with the UNC NLP group. I was a Postdoctoral Scholar at Stanford University hosted by Chelsea Finn. I received my Ph.D. degree in 2021 at Pennsylvania State University under the advisory of Zhenhui (Jessie) Li. During my Ph.D, I also spent time visiting CMU hosted by Eric P. Xing.

Lab Openings:
- We will recruit 3 Ph.D. students for Fall 2027 and multiple interns or visiting students all year. Please read THIS for detailed recruitment information.

Research Interests. My research focuses on building AI systems that perceive the world and continuously learn and optimize. Currently, my primary endeavors revolve around the following key directions:
Self-evolving AI agents that improve autonomously through closed-loop training and deployment.
Autoresearch agents that translate open-ended objectives into self-driven optimization.
Trustworthy evaluation of foundation models and agents, moving from benchmark accuracy to real-world actionability.
You can follow me on Twitter at @HuaxiuYaoML or 小红书 at Huaxiu Yao.
News and Travel
[2026-2027 Service] Senior Area Chair in EMNLP 2026; Area Chair in ICML 2026, NeurIPS 2026, ICLR 2026, AISTATS 2026; Action Editor: TMLR
[2026.05] Four papers were accepted by ACL 2026 (two main track, two findings), four papers were accepted by ICML 2026
[2026.01] Six papers were accepted by ICLR 2026, two papers were accepted by MLSys 2026 and ICRA 2026, respectively.
[2025.09] Four papers were accepted by EMNLP 2025 (two main track, two findings)
[2025.07] Two papers were accepted by COLM 2025
[2025.05] Three papers were accepted by ICML 2025, four papers were accepted by ACL 2025 (two main track, two findings)
[2025.01] Six papers were accepted by ICLR 2025, two papers were accepted by findings of NAACL 2025, and one paper was accepted by ICRA
[2024.09] Five papers were accepted by NeurIPS 2024 (three main track, two D&B track), One paper was accepted by EMNLP 2024
Honors & Awards
Provost’s AI Acceleration Program Fellowship Award
ICLR 2026 RSI Workshop Outstanding Paper Award
ICLR 2026 MemAgents Best Paper Award Runner-Up
Cisco Faculty Research Award, 2026
Amazon Research Awards, 2025
UNC Junior Faculty Development Award, 2025
PharmAlliance Early Career Researcher Award, 2025
KDD Health Day Distinguished Vision Award, 2025
TMLR Outstanding Paper Award, 2024
KDD Best Paper Award, 2024
Cisco Faculty Research Award, 2024
National AI Research Resource Pilot Award, 2024
Creativity Hubs Seed-funding Winner, 2024
NC TraCS Pilot Award, 2024
AAAI New Faculty Highlights, 2024
College of IST Ph.D. Award for Research Excellence, 2020
Selected Recent publicationS
Please see the complete list in Google Scholar.
The underline (co-)first authors are students mentored by me; †: equal advising

Foundation Model Algorithms & Evaluation
[1] Jiaqi Liu*, Shi Qiu*, ..., Huaxiu Yao, AutoResearchClaw: Self-Reinforcing Autonomous Research with Human-AI Collaboration, arXiv 2605.20025.
[AI Agent]
[2] Jiaqi Liu, Xinyu Ye, Peng Xia, Zeyu Zheng, Cihang Xie, Mingyu Ding, Huaxiu Yao, EvolveMem: Self-Evolving Memory Architecture via AutoResearch for LLM Agents, arXiv 2605.13941.
[AI Agent]
[3] Peng Xia*, Jianwen Chen*, Xinyu Yang*, Haoqin Tu*, Jiaqi Liu*, Kaiwen Xiong*, Siwei Han, Shi Qiu, Haonian Ji, Yuyin Zhou, Zeyu Zheng, Cihang Xie, Huaxiu Yao*, MetaClaw: Just Talk – An Agent That Meta-Learns and Evolves in the Wild, arXiv 2603.17187.
[AI Agent] [Transfer Learning]
[4] Peng Xia*, Jianwen Chen*, Hanyang Wang*, Jiaqi Liu, Kaide Zeng, Yu Wang, Siwei Han, Yiyang Zhou, Xujiang Zhao, Haifeng Chen, Zeyu Zheng, Cihang Xie, Huaxiu Yao, SkillRL: Evolving Agents via Recursive Skill-Augmented Reinforcement Learning, arXiv 2602.08234. ICLR 2026 MemAgents Best Paper Award Runner-Up
[AI Agent] [Transfer Learning]
[5] Jiaqi Liu, Zipeng Ling, Shi Qiu, Yanqing Liu, Siwei Han, Peng Xia, Haoqin Tu, Zeyu Zheng, Cihang Xie, Charles Fleming, Mingyu Ding, Huaxiu Yao, Omni-SimpleMem: Autoresearch-Guided Discovery of Lifelong Multimodal Agent Memory, in Proceeding of the Forty-third International Conference on Machine Learning (ICML 2026), Seoul, South Korea, Jul 2026.
[AI Agent]
[6] Jiaqi Liu*, Yaofeng Su*, Peng Xia, Siwei Han, Zeyu Zheng, Cihang Xie, Mingyu Ding, Huaxiu Yao, SimpleMem: Efficient Lifelong Memory for LLM Agents, arXiv 2601.02553.
[AI Agent]
[7] Peng Xia, Kaide Zeng, Jiaqi Liu, Can Qin, Fang Wu, Yiyang Zhou, Caiming Xiong, Huaxiu Yao, Agent0: Unleashing self-evolving agents from zero data via tool-integrated reasoning, arXiv 2511.16043. ICLR 2026 RSI Workshop Outstanding Paper Award
[AI Agent]
[8] Zhaoyang Wang, Canwen Xu, Boyi Liu, Yite Wang, Siwei Han, Zhewei Yao, Huaxiu Yao^†, Yuxiong He^†, Agent World Model: Infinity Synthetic Environments for Agentic Reinforcement Learning, in Proceeding of the Forty-third International Conference on Machine Learning (ICML 2026), Seoul, South Korea, Jul 2026.
[AI Agent]
[9] Siwei Han, Kaiwen Xiong, Jiaqi Liu, Xinyu Ye, Yaofeng Su, Wenbo Duan, Xinyuan Liu, Cihang Xie, Mohit Bansal, Mingyu Ding, Linjun Zhang, Huaxiu Yao, Alignment Tipping Process: How Self-Evolution Pushes LLM Agents Off the Rails, arXiv 2510.04860.
[AI Agent]
[10] Yiyang Zhou*, Haoqin Tu*, Zijun Wang, Zeyu Wang, Niklas Muennighoff, Fan Nie, Yejin Choi, James Zou, Chaorui Deng, Shen Yan, Haoqi Fan, Cihang Xie, Huaxiu Yao^†, Qinghao Ye^†, When visualizing is the first step to reasoning: Mira, a benchmark for visual chain-of-thought, in Proceeding of The IEEE/CVF Conference on Computer Vision and Pattern Recognition 2026 (CVPR 2026), Denver, CO, Jun 2026.
[Multimodal] [Benchmark & Evaluation]
[11] Zijian Zhang*, Kaiyuan Zheng*, Zhaorun Chen*, Joel Jang, Yi Li, Siwei Han, Chaoqi Wang, Mingyu Ding, Dieter Fox, Huaxiu Yao, Grape: Generalizing robot policy via preference alignment, in Proceeding of 2026 IEEE International Conference on Robotics and Automation (ICRA 2026), Vienna, Austria, Jun 2026.
[Multimodal]
[12] Xinyu Geng*, Peng Xia*, Zhen Zhang, Xinyu Wang, Qiuchen Wang, Ruixue Ding, Chenxi Wang, Jialong Wu, Yida Zhao, Kuan Li, Yong Jiang, Pengjun Xie, Fei Huang, Huaxiu Yao, Yi R. Feng, Jingren Zhou, Webwatcher: Breaking new frontier of vision-language deep research agent, in Proceeding of the 14th International Conference on Learning Representations (ICLR 2026), Rio de Janeiro, Brazil, Apr 2026.
[AI Agent] [Multimodal]
[13] Yiyang Zhou*, Yangfan He*, Yaofeng Su, Siwei Han, Joel Jang, Gedas Bertasius, Mohit Bansal, Huaxiu Yao, ReAgent-V: A Reward-Driven Multi-Agent Framework for Video Understanding, in Proceeding of the Thirty-Ninth Conference on Neural Information Processing Systems (NeurIPS 2025), San Diego, CA, Dec 2025.
[AI Agent] [Multimodal]
[14] Zhaorun Chen*, Yichao Du*, Zichen Wen*, Yiyang Zhou*, Chenhang Cui, Zhenzhen Weng, Haoqin Tu, Chaoqi Wang, Zhengwei Tong, Qinglan Huang, Canyu Chen, Qinghao Ye, Zhihong Zhu, Yuqing Zhang, Jiawei Zhou, Zhuokai Zhao, Rafael Rafailov, Chelsea Finn, Huaxiu Yao, MJ-Bench: Is Your Multimodal Reward Model Really a Good Judge for Text-to-Image Generation?, in Proceeding of the Thirty-Ninth Conference on Neural Information Processing Systems Track on Datasets & Benchmarks (NeurIPS 2025), San Diego, CA, Dec 2025.
[Multimodal] [Benchmark & Evaluation]
[15] Yiyang Zhou*, Zhiyuan Fan*, Dongjie Cheng*, Sihan Yang, Zhaorun Chen, Chenhang Cui, Xiyao Wang, Yun Li, Linjun Zhang, Huaxiu Yao, Calibrated Self-Rewarding Vision Language Models, in Proceeding of the Thirty-Eighth Conference on Neural Information Processing Systems (NeurIPS 2024), Vancouver, Canada, Dec 2024.
[Multimodal]
[16] Yiyang Zhou*, Chenhang Cui*, Rafael Rafailov, Chelsea Finn, Huaxiu Yao, Aligning modalities in vision large language models via preference fine-tuning, arXiv:2402.11411.
[Multimodal]
[17] Yiyang Zhou*, Chenhang Cui*, Jaehong Yoon, Linjun Zhang, Zhun Deng, Chelsea Finn, Mohit Bansal, Huaxiu Yao, Analyzing and Mitigating Object Hallucination in Large Vision-Language Models, in Proceeding of the 12th International Conference on Learning Representations (ICLR 2024), Vienna, Austria, May 2024.
[Multimodal]

[18] Percy Liang, Rishi Bommasani, Tony Lee, [and 47 others, including Huaxiu Yao], Holistic Evaluation of Language Models, Transactions on Machine Learning Research (TMLR, Featured), 2023. Outstanding Paper Award
[Benchmark & Evaluation]
Foundation Model Applications
[1] Jianwen Chen*, Xinyu Yang*, Peng Xia, Arian Azarang, Yueh Z Lee, Gang Li, Hongtu Zhu, Yun Li, Beidi Chen, Huaxiu Yao, MedVerse: Efficient and Reliable Medical Reasoning via DAG-Structured Parallel Execution, in Proceeding of the 64th Annual Meeting of the Association for Computational Linguistics (ACL 2026), San Diego, CA, Jul 2026.
[FM for Health]
[2] Siwei Han, Peng Xia, Ruiyi Zhang, Tong Sun, Yun Li, Hongtu Zhu, Huaxiu Yao, MDocAgent: A Multi-Modal Multi-Agent Framework for Document Understanding, arXiv 2503.13964.
[FM for Document Processing]
[3] Peng Xia*, Jinglu Wang*, Yibo Peng*, Kaide Zeng, Xian Wu, Xiangru Tang, Hongtu Zhu, Yun Li, Shujie Liu, Yan Lu^†, Huaxiu Yao^†, MMedAgent-RL: Optimizing Multi-Agent Collaboration for Multimodal Medical Reasoning, in Proceeding of the 14th International Conference on Learning Representations (ICLR 2026), Rio de Janeiro, Brazil, Apr 2026.
[FM for Health]
[4] Haonian Ji*, Shi Qiu*, Siyang Xin*, Siwei Han*, Zhaorun Chen, Hongyi Wang, Dake Zhang, Huaxiu Yao, From EduVisBench to EduVisAgent: A Benchmark and Multi-Agent Framework for Reasoning-Driven Pedagogical Visualization, in Proceeding of the 14th International Conference on Learning Representations (ICLR 2026), Rio de Janeiro, Brazil, Apr 2026.
[FM for Education]
[5] Kangyu Zhu*, Peng Xia*, Yun Li, Hongtu Zhu, Sheng Wang, Huaxiu Yao, MMedPO: Aligning Medical Vision-Language Models with Clinical-Aware Multimodal Preference Optimization, in Proceeding of the Forty-second International Conference on Machine Learning (ICML 2025), Vancouver, Canada, Jul 2025. [arXiv] [Code]
[FM for Health]
[6] Peng Xia, Kangyu Zhu, Haoran Li, Tianze Wang, Weijia Shi, Sheng Wang, Linjun Zhang, James Zou, Huaxiu Yao, MMed-RAG: Versatile Multimodal RAG System for Medical Vision Language Models, in Proceeding of the 13th International Conference on Learning Representations (ICLR 2025), Singapore, Apr 2025. [arXiv] [Code]
[FM for Health]
[7] Peng Xia, Ze Chen, Juanxi Tian, Yangrui Gong, Ruibo Hou, Yue Xu, Zhenbang Wu, Zhiyuan Fan, Yiyang Zhou, Kangyu Zhu, Wenhao Zheng, Zhaoyang Wang, Xiao Wang, Xuchao Zhang, Chetan Bansal, Marc Niethammer, Junzhou Huang, Hongtu Zhu, Yun Li, Jimeng Sun, Zongyuan Ge, Gang Li, James Zou, Huaxiu Yao, CARES: A Comprehensive Benchmark of Trustworthiness in Medical Vision Language Models, in Proceeding of the Thirty-Eighth Conference on Neural Information Processing Systems Track on Datasets & Benchmarks (NeurIPS 2024), Vancouver, Canada, Dec 2024.
[FM for Health]
[8] Peng Xia*, Kangyu Zhu*, Haoran Li, Hongtu Zhu, Yun Li, Gang Li, Linjun Zhang, Huaxiu Yao, RULE: Reliable Multimodal RAG for Factuality in Medical Vision Language Models, in Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing (EMNLP 2024), Miami, Nov. 2024.
[FM for Health]
Teaching
Lecture
DATA 523: Modeling and Data Mining For Artificial Intelligence, Spring 2026
CS 790-183: Transfer Learning, UNC-CH, Fall 2025
DATA 140: Introduction to Data Structures and Management, UNC-CH, Spring 2025
CS 590/790-183: Transfer Learning, UNC-CH, Spring 2024
CS 790-150: Reliable Machine Learning, UNC-CH, Fall 2023, Fall 2024
CS 330: Deep Multi-Task and Meta Learning (Domain Generalization), Stanford University, Fall 2022
Tutorial
Learning with Small Data. (KDD 2020 [Website] [Slides] [YouTube] [Bilibili]) (WSDM 2020 [Website]) (AAAI 2021)
Meta-learning and Automated Machine Learning: Approaches and Applications. (IJCAI 2020)
Service
Conference Area Chair
International Conference on Machine Learning (ICML), 2024
Conference on Neural Information Processing Systems (NeurIPS), 2024; D&B Track (2022 - 2024)
International Conference on Learning Representations (ICLR), 2025
International Conference on Artificial Intelligence and Statistics (AISTATS), 2024
Empirical Methods in Natural Language Processing (EMNLP), 2024
International Conference on Automated Machine Learning (AutoML-Conf), 2022 - 2024
Learning on Graphs Conference (LoG), 2022 - 2024
Workshop Organizer
Organizer, Workshop on Foundation Models in the Wild, ICML 2024 (Co-organizers: Xinyu Yang, Bilge Acun, Kamalika Chaudhuri, Beidi Chen, Giulia Fanti, Junlin Han, Lianhui Qin, Shengbang Tong, Philip Torr, Hao Wang, Cathy Wu, James Zou)
Lead Organizer, Workshop on Reliable and Responsible Foundation Models, ICLR 2024 (Co-organizers: Mohit Bansal, Zhun Deng, Chelsea Finn, Pavel Izmailov, He He, Pang Wei Koh, Eric Mitchell, Cihang Xie) [Website]
Lead Organizer, Sixth Workshop on Meta-Learning, NeurIPS 2022 (Co-organizers: Fábio Ferreira, Frank Hutter, Joaquin Vanschoren, Eleni Triantafillou, Qi Lei) [Website]
Lead Organizer, the First Workshop on Pre-training: Perspectives, Pitfalls, and Paths Forward, ICML 2022 (Co-organizers: Hugo Larochelle, Colin Rafel, Percy Liang, Jian Tang, Ying Wei, Saining Xie, Eric P. Xing, Chelsea Finn) [Website]
Organizer, Fifth Workshop on Meta-Learning, NeurIPS 2021 (Co-organizers: Fábio Ferreira, Erin Grant, Frank Hutter, Jonathan Schwarz, Joaquin Vanschoren) [Website]
Contact
Office: 254, Sitterson Hall, Chapel Hill, NC 27599

huaxiu@cs.unc.edu
Linkedin
Github
HuaxiuYaoML
Huaxiu_Yao66

Huaxiu Yao (姚骅修)

Assistant Professor

Department of Computer Science | School of Data Science and Society

University of North Carolina at Chapel Hill

Email | Google Scholar | Twitter | LinkedIn | WeChat | 小红书

About me

News and Travel

Honors & Awards

Selected Recent publicationS

Please see the complete list in Google Scholar.
The underline (co-)first authors are students mentored by me; †: equal advising

Foundation Model Algorithms & Evaluation

Foundation Model Applications

Teaching

Service

Conference Area Chair

Workshop Organizer

Contact

Office: 254, Sitterson Hall, Chapel Hill, NC 27599

Huaxiu Yao (姚骅修)

Assistant Professor

Department of Computer Science | School of Data Science and Society

University of North Carolina at Chapel Hill

Email | Google Scholar | Twitter | LinkedIn | WeChat | 小红书

About me

News and Travel

Honors & Awards

Selected Recent publicationS

Please see the complete list in Google Scholar.The underline (co-)first authors are students mentored by me; †: equal advising

Foundation Model Algorithms & Evaluation

Foundation Model Applications

Teaching

Service

Conference Area Chair

Workshop Organizer

Contact

Office: 254, Sitterson Hall, Chapel Hill, NC 27599

Please see the complete list in Google Scholar.
The underline (co-)first authors are students mentored by me; †: equal advising