Homepage - > Kaishuai Xu

Kaishuai Xu (许凯帅)

I am a last-year PhD student in PolyU NLP Group at The Hong Kong Polytechnic University, fortunately advised by Prof. Maggie, Wenjie Li.

Before that, I received my MSc and BMgt degrees from Huazhong University of Science and Technology.

Recently, I am interested in LLM-based reasoning and Medical AI. The specific topics include:

Math Reasoning—subtle errors in reasoning and optimization strategies with jumping-start initialization;
LLM Evaluation—robust multi-faceted evaluation and diverse test-scaling approaches;
Clinician-like Medical Dialogue Systems—structured clinical communication, comprehensive differential diagnosis, and prototypical clinical reasoning processes;

🎙️ An invited talk I gave at Stanford MedAI is available on Youtube.

🔥 I am available on the job market and actively looking for industry opportunities!

News

2025

🔥 One paper has been accepted by

NeurIPS 2025.

Sep 20

🔥 One paper has been accepted by EMNLP 2025 Findings.

Aug 20

One paper has been accepted by

npj Artificial Intelligence.

May 17

Three papers have been accepted by

ACL 2025.

May 15

One paper has been accepted by

ICLR 2025.

Jan 22

2024

Two papers have been accepted by EMNLP 2024.

Sep 20

I am honored to receive an invitation from

Stanford MedAI, and will give a talk about medical dialogue systems on May 20th (YouTube).

May 20

One paper has been accepted by

ACL 2024 Findings.

May 15

2023

Our paper RECAP has been accepted by EMNLP 2023 Findings.

Oct 06

Our paper ORGAN and DFMed have been accepted by

ACL 2023 Main proceedings and Findings.

May 01

Selected Publications (View all)

RAR^2: Retrieval-Augmented Medical Reasoning via Thought-Driven Retrieval

Kaishuai Xu, Wenjun Hou*, Yi Cheng*, Wenjie Li (* equal contribution)

EMNLP 2025 Findings Findings

In this work, we propose RAR^2, a joint learning framework that improves both Reasoning-Augmented Retrieval and Retrieval-Augmented Reasoning. Moreover, we design two test-time scaling strategies to explore the boundaries of our framework.

[Paper]

RAR^2: Retrieval-Augmented Medical Reasoning via Thought-Driven Retrieval

Kaishuai Xu, Wenjun Hou*, Yi Cheng*, Wenjie Li (* equal contribution)

EMNLP 2025 Findings Findings

[Paper]

Subtle Errors in Reasoning: Preference Learning via Error-injected Self-editing

Kaishuai Xu, Tiezheng Yu, Wenjun Hou, Yi Cheng, Chak Tou Leong, Liangyou Li, Xin Jiang, Lifeng Shang, Qun Liu, Wenjie Li

ACL 2025 Main Conference

In this work, we propose a novel preference learning framework called eRror-Injected Self-Editing (RISE), which injects predefined subtle errors into pivotal tokens in reasoning or computation steps to construct hard pairs for error mitigation. Compared with other preference learning methods, RISE further refines the training objective without requiring fine-grained sampling or preference annotation.

[Paper] [Code]

Subtle Errors in Reasoning: Preference Learning via Error-injected Self-editing

Kaishuai Xu, Tiezheng Yu, Wenjun Hou, Yi Cheng, Chak Tou Leong, Liangyou Li, Xin Jiang, Lifeng Shang, Qun Liu, Wenjie Li

ACL 2025 Main Conference

[Paper] [Code]

RADAR: Enhancing Radiology Report Generation with Supplementary Knowledge Injection

Wenjun Hou, Yi Cheng*, Kaishuai Xu*, Heng Li, Yan Hu, Wenjie Li, Jiang Liu (* equal contribution)

ACL 2025 Main Conference

We propose Radar, a framework for enhancing radiology report generation with supplementary knowledge injection. Radar improves report generation by systematically leveraging both the internal knowledge of an LLM and externally retrieved information.

[Paper] [Code]

RADAR: Enhancing Radiology Report Generation with Supplementary Knowledge Injection

Wenjun Hou, Yi Cheng*, Kaishuai Xu*, Heng Li, Yan Hu, Wenjie Li, Jiang Liu (* equal contribution)

ACL 2025 Main Conference

[Paper] [Code]

Reasoning Like a Doctor: Improving Medical Dialogue Systems via Diagnostic Reasoning Process Alignment

Kaishuai Xu, Yi Cheng*, Wenjun Hou*, Qiaoyu Tan, Wenjie Li (* equal contribution)

ACL 2024 Findings Findings

We propose a novel framework, Emulation, designed to generate an appropriate response that relies on abductive and deductive diagnostic reasoning analyses and aligns with clinician preferences through thought process modeling. Experimental results on two datasets confirm the efficacy of Emulation. Crucially, our framework furnishes clear explanations for the generated responses, enhancing its transparency in medical consultations.

[Paper] [Code]

Reasoning Like a Doctor: Improving Medical Dialogue Systems via Diagnostic Reasoning Process Alignment

Kaishuai Xu, Yi Cheng*, Wenjun Hou*, Qiaoyu Tan, Wenjie Li (* equal contribution)

ACL 2024 Findings Findings

[Paper] [Code]

Medical Dialogue Generation via Intuitive-then-Analytical Differential Diagnosis

Kaishuai Xu, Wenjun Hou, Yi Cheng, Jian Wang, Wenjie Li

arXiv preprint 2024 Preprint

We propose a medical dialogue generation framework with the Intuitive-then-Analytic Differential Diagnosis (IADDx). Our method starts with a differential diagnosis via retrieval-based intuitive association and subsequently refines it through a graph-enhanced analytic procedure. The resulting differential diagnosis is then used to retrieve medical knowledge and guide response generation.

[Paper]

Medical Dialogue Generation via Intuitive-then-Analytical Differential Diagnosis

Kaishuai Xu, Wenjun Hou, Yi Cheng, Jian Wang, Wenjie Li

arXiv preprint 2024 Preprint

[Paper]

ORGAN: Observation-Guided Radiology Report Generation via Tree Reasoning

Wenjun Hou, Kaishuai Xu*, Yi Cheng*, Wenjie Li, Jiang Liu (* equal contribution)

ACL 2023 Main Proceedings

In this paper, we propose an Observation-guided radiology Report Generation framework (ORGan). It first produces an observation plan and then feeds both the plan and radiographs for report generation, where an observation graph and a tree reasoning mechanism are adopted to precisely enrich the plan information by capturing the multi-formats of each observation. Experimental results demonstrate that our framework outperforms previous state-of-the-art methods regarding text quality and clinical efficacy.

[Paper] [Code]

ORGAN: Observation-Guided Radiology Report Generation via Tree Reasoning

Wenjun Hou, Kaishuai Xu*, Yi Cheng*, Wenjie Li, Jiang Liu (* equal contribution)

ACL 2023 Main Proceedings

[Paper] [Code]

Medical Dialogue Generation via Dual Flow Modeling

Kaishuai Xu, Wenjun Hou*, Yi Cheng*, Jian Wang, Wenjie Li (* equal contribution)

ACL 2023 Findings Findings

We propose a Dual Flow enhanced Medical (DFMed) dialogue generation framework. It extracts the medical entities and dialogue acts used in the dialogue history and models their transitions with an entity-centric graph flow and a sequential act flow, respectively. We employ two sequential models to encode them and devise an interweaving component to enhance their interactions. Experiments on two datasets demonstrate that our method exceeds baselines in both automatic and manual evaluations.

[Paper] [Code]

Medical Dialogue Generation via Dual Flow Modeling

Kaishuai Xu, Wenjun Hou*, Yi Cheng*, Jian Wang, Wenjie Li (* equal contribution)

ACL 2023 Findings Findings

[Paper] [Code]

Warning

Action required

News

Selected Publications (View all)

RAR^2: Retrieval-Augmented Medical Reasoning via Thought-Driven Retrieval

RAR^2: Retrieval-Augmented Medical Reasoning via Thought-Driven Retrieval

Subtle Errors in Reasoning: Preference Learning via Error-injected Self-editing

Subtle Errors in Reasoning: Preference Learning via Error-injected Self-editing

RADAR: Enhancing Radiology Report Generation with Supplementary Knowledge Injection

RADAR: Enhancing Radiology Report Generation with Supplementary Knowledge Injection

Reasoning Like a Doctor: Improving Medical Dialogue Systems via Diagnostic Reasoning Process Alignment

Reasoning Like a Doctor: Improving Medical Dialogue Systems via Diagnostic Reasoning Process Alignment

Medical Dialogue Generation via Intuitive-then-Analytical Differential Diagnosis

Medical Dialogue Generation via Intuitive-then-Analytical Differential Diagnosis

ORGAN: Observation-Guided Radiology Report Generation via Tree Reasoning

ORGAN: Observation-Guided Radiology Report Generation via Tree Reasoning

Medical Dialogue Generation via Dual Flow Modeling

Medical Dialogue Generation via Dual Flow Modeling