Chen Zhang

About

#home

Currently, I am an AI Researcher at Huawei Singapore Search & Recommendation Lab. Previously, I was a research fellow at ECE, National University of Singapore (Sep 2023 – Oct 2024), My research interests include Agentic RAG, LLM Alignment, LLM Evaluation, and Dialogue Systems.

News

  • Mar 2025: Working on Pangu-Native DeepResearch Agent.
  • Nov 2024: Joined Search & Recommendation Lab, Huawei (Singapore).
  • Dec 2023: Area Chair of ACL ARR.
  • Sep 2023: Research Fellow at ECE, NUS.
  • Jun 2023: Ph.D. received from National University of Singapore.
  • Mar 2023: Organizing committee of EMNLP 2023.
  • Apr 2022: Workshop chair for DSTC11.
  • Apr 2021: Main organizer of DSTC10 Track 5.

Education

  • PhD, National University of Singapore (Jan 2019 – Jun 2023) — EDB NUS-Bosch IPP Scholarship
  • Double Degree, Business & Computer Engineering, Nanyang Technological University (Aug 2012 – Jun 2016)
  • GCE A Level, Victoria Junior College (2010 – 2011)
  • GCE O Level, Victoria School (2008 – 2009)

Selected Papers

#papers
All
2026
2025
2024
2023
2022
2021
2020

2026

AudioRAG: A Challenging Benchmark for Audio Reasoning and Information Retrieval
Jingru Lin, Chen Zhang, Tianrui Wang, Haizhou Li
Audio-AAAI
SRR-Judge: Step-Level Rating and Refinement for Enhancing Search-Integrated Reasoning in Search Agents
Chen Zhang, Kuicai Dong, Dexun Li, Wenjun Li, Qu Yang, Wei Han, Yong Liu
arXiv
Doc-researcher: A unified system for multimodal document parsing and deep research
Kuicai Dong, Shurui Huang, Fangda Ye, Wei Han, Zhi Zhang, Dexun Li, Wenjun Li, Qu Yang, Gang Wang, Yichao Wang, Chen Zhang, Yong Liu
WWW-2026

2025

RAGCap-Bench: Benchmarking Capabilities of LLMs in Agentic Retrieval Augmented Generation Systems
Jingru Lin*, Chen Zhang*, Stephen Y. Liu, Haizhou Li
arXiv
DeepDiver: Adaptive Search Intensity Scaling via Open-Web Reinforcement Learning
Wenxuan Shi, Haochen Tan, Chuqiao Kuang, Xiaoguang Li, Xiaozhe Ren, Chen Zhang, Hanting Chen, Yasheng Wang, Lu Hou, Lifeng Shang
NeurIPS 2026
A Survey on Multi-Turn Interaction Capabilities of Large Language Models
Chen Zhang, Xinyi Dai, Yaxiong Wu, Qu Yang, Yasheng Wang, Ruiming Tang, Yong Liu
arXiv
Aligning Language Models Using Follow-up Likelihood as Reward Signal
Chen Zhang, Dading Chong, Feng Jiang, Chengguang Tang, Anningzhe Gao, Guohua Tang, Haizhou Li
AAAI 2025

2024

TS-Align: A Teacher-Student Collaborative Framework for Scalable Iterative Finetuning of Large Language Models
Chen Zhang, Chengguang Tang, Dading Chong, Ke Shi, Guohua Tang, Feng Jiang, Haizhou Li
Findings of EMNLP 2024
CrossTune: Black-Box Few-Shot Classification with Label Enhancement
Danqing Luo*, Chen Zhang*, Yan Zhang, Haizhou Li
LREC-COLING 2024
A Comprehensive Analysis of the Effectiveness of Large Language Models as Automatic Dialogue Evaluators
Chen Zhang, Luis Fernando D'Haro, Yiming Chen, Malu Zhang, Haizhou Li
AAAI 2024

2023

xDial-Eval: A Multilingual Open-Domain Dialogue Evaluation Benchmark
Chen Zhang, Luis Fernando D'Haro, Chengguang Tang, Ke Shi, Guohua Tang, Haizhou Li
Findings of EMNLP 2023
Self-Supervised Modeling for Open-Domain Dialogue Evaluation
Chen Zhang
Ph.D. Thesis
PoE: a Panel of Experts for Generalized Automatic Dialogue Assessment
Chen Zhang, Luis Fernando D'Haro, Qiquan Zhang, Thomas Friedrichs, Haizhou Li
IEEE/ACM TASLP

2022

FineD-Eval: Fine-grained Automatic Dialogue-Level Evaluation
Chen Zhang, Luis Fernando D'Haro, Qiquan Zhang, Thomas Friedrichs, Haizhou Li
EMNLP 2022
MDD-Eval: Self-Training on Augmented Data for Multi-Domain Dialogue Evaluation
Chen Zhang, Luis Fernando D'Haro, Thomas Friedrichs, Haizhou Li
AAAI 2022

2021

Automatic Evaluation and Moderation of Open-domain Dialogue Systems
Chen Zhang, João Sedoc, Luis Fernando D'Haro, Rafael Banchs, Alexander Rudnicky
DSTC10 Track 5
Investigating the Impact of Pre-trained Language Models on Dialog Evaluation
Chen Zhang, Luis Fernando D'Haro, Yiming Chen, Thomas Friedrichs, Haizhou Li
IWSDS 2021 (Best Paper)
DynaEval: Unifying Turn and Dialogue Level Evaluation
Chen Zhang, Yiming Chen, Luis Fernando D’Haro, Yan Zhang, Thomas Friedrichs, Grandee Lee, Haizhou Li
ACL 2021

2020

D-score: Holistic Dialogue Evaluation without Reference
Chen Zhang, Grandee Lee, Luis Fernando D’Haro, Haizhou Li
IEEE/ACM TASLP
Deep AM-FM: Toolkit for automatic dialogue evaluation
Chen Zhang, Luis Fernando D’Haro, Rafael E. Banchs, Thomas Friedrichs, Haizhou Li
Conversational Dialogue Systems for the Next Decade

Contact

#contact

Best way to reach me is email: chen_zhang@u.nus.edu