I am currently a second-year student at The University of Hong Kong pursuing a Ph.D. in Computer Science, advised by Prof. Reynold Cheng. I am a member of HKU BIRD Team focusing on AI for Databases and text-to-SQL.
Before that, I received my M.S. from Institute of Information Engineering, Chinese Academy of Sciences, advised by Prof. Hongbo Xu.
I earned my B.S. in Automation from Beijing Institute of Technology, supervised by Prof.Yuan Li.
During my undergraduate study, I had an opportunity to participate in a summer research program on data mining at the Illinois Institute of Technology, mentored by Dr. Zelenberg.
Additionally, I gained valuable industry and research institute experience through internships at Microsoft, Baidu Inc., and Beijing Academy of Artificial Intelligence (BAAI), etc.
In addition, I feel grateful and honored to collaborate on multiple research projects with
Google
Cloud on text-to-SQL.
My research interests focus on Large Language Model with structured data, e.g. Database, Knowledge Graph (KG), etc..
BIRD-INTERACT: Re-imagining Text-to-SQL Evaluation via Lens of Dynamic Interactions
Nan Huo‡, Xiaohan Xu‡, Jinyang Li‡, Per Jacobsson, Shipei Lin, Bowen Qin, Binyuan Hui, Xiaolong Li, Ge Qu, Shuzheng Si, Linheng Han, Edward Alexander, Xintong Zhu, Rui Qin, Ruihan Yu, Yiyao Jin, Feige Zhou, Weihao Zhong, Yun Chen, Hongyu Liu, Chenhao Ma, Fatma Ozcan, Yannis Papakonstantinou, Reynold Cheng
Arxiv 2025
SWE-SQL: Illuminating LLM Pathways to Solve User SQL Issues in Real-World Applications
Jinyang Li, Xiaolong Li, Ge Qu, Per Jacobsson, Bowen Qin, Binyuan Hui, Shuzheng Si, Nan Huo, Xiaohan Xu, Yue Zhang, Ziwei Tang, Yuanshuai Li, Florensia Widjaja, Xintong Zhu, Feige Zhou, Yongfeng Huang, Yannis Papakonstantinou, Fatma Ozcan, Chenhao Ma, Reynold Cheng
NeurIPS 2025
A survey on knowledge distillation of large language models
Xiaohan Xu, Ming Li, Chongyang Tao, Tao Shen, Reynold Cheng, Jinyang Li, Can Xu, Dacheng Tao, Tianyi Zhou
Arxiv 2024
Leveraging large language models for nlg evaluation: A survey
Zhen Li‡, Xiaohan Xu‡, Tao Shen, Can Xu, Jia-Chen Gu, Chongyang Tao
EMNLP 2024
Re-reading improves reasoning in language models
Xiaohan Xu, Chongyang Tao, Tao Shen, Can Xu, Hongbo Xu, Guodong Long, Jian-guang Lou, Shuai Ma
EMNLP 2024
Subgraph Neighboring Relations Infomax for Inductive Link Prediction on Knowledge Graphs
Xiaohan Xu, Peng Zhang, Yongquan He, Chengpeng Chao, Chaoyang Yan
IJCAI'22 Long Presentation
BIRD-INTERACT: Re-imagining Text-to-SQL Evaluation via Lens of Dynamic Interactions
Nan Huo‡, Xiaohan Xu‡, Jinyang Li‡, Per Jacobsson, Shipei Lin, Bowen Qin, Binyuan Hui, Xiaolong Li, Ge Qu, Shuzheng Si, Linheng Han, Edward Alexander, Xintong Zhu, Rui Qin, Ruihan Yu, Yiyao Jin, Feige Zhou, Weihao Zhong, Yun Chen, Hongyu Liu, Chenhao Ma, Fatma Ozcan, Yannis Papakonstantinou, Reynold Cheng
Arxiv 2025
SWE-SQL: Illuminating LLM Pathways to Solve User SQL Issues in Real-World Applications
Jinyang Li, Xiaolong Li, Ge Qu, Per Jacobsson, Bowen Qin, Binyuan Hui, Shuzheng Si, Nan Huo, Xiaohan Xu, Yue Zhang, Ziwei Tang, Yuanshuai Li, Florensia Widjaja, Xintong Zhu, Feige Zhou, Yongfeng Huang, Yannis Papakonstantinou, Fatma Ozcan, Chenhao Ma, Reynold Cheng
NeurIPS 2025
A survey on knowledge distillation of large language models
Xiaohan Xu, Ming Li, Chongyang Tao, Tao Shen, Reynold Cheng, Jinyang Li, Can Xu, Dacheng Tao, Tianyi Zhou
Arxiv 2024
Leveraging large language models for nlg evaluation: A survey
Zhen Li‡, Xiaohan Xu‡, Tao Shen, Can Xu, Jia-Chen Gu, Chongyang Tao
EMNLP 2024
Re-reading improves reasoning in language models
Xiaohan Xu, Chongyang Tao, Tao Shen, Can Xu, Hongbo Xu, Guodong Long, Jian-guang Lou, Shuai Ma
EMNLP 2024
MMIDR: Teaching Large Language Model to Interpret Multimodal Misinformation via Knowledge Distillation
Longzheng Wang‡, Xiaohan Xu‡, Lei Zhang, Jiarui Lu, Yongxiu Xu, Hongbo Xu, Chuang Zhang
Arxiv 2024
Subgraph Neighboring Relations Infomax for Inductive Link Prediction on Knowledge Graphs
Xiaohan Xu, Peng Zhang, Yongquan He, Chengpeng Chao, Chaoyang Yan
IJCAI'22 Long Presentation
Cross-modal Contrastive Learning for Multimodal Fake News Detection
Longzheng Wang, Chuang Zhang, Hongbo Xu, Yongxiu Xu, Xiaohan Xu, Siqi Wang
ACMMM 2023
PoKE: Prior Knowledge Enhanced Emotional Support Conversation with Latent Variable
Xiaohan Xu, Xuying Meng, Yequan Wang
Preprint 2023
Full Resume in PDF.