Publications

2026

LaDi-RL
Beyond Mode Elicitation: Diversity-Preserving Reinforcement Learning via Latent Diffusion Reasoner
Haoqiang Kang, Yizhe Zhang, Nikki Lijing Kuang, Yi-An Ma, and Lianhui Qin
arXiv, 2026.
[paper]
TSRBench
TSRBench: A Comprehensive Multi-task Multi-modal Time Series Reasoning Benchmark for Generalist Models
Fangxu Yu, Xingming Guo, Linjie Yuan, Haoqiang Kang, Hongyu Zhao, Lianhui Qin, Furong Huang, Bin Hu, and Tianyi Zhou
arXiv, 2026.
[paper]

2025

LaDiR
LaDiR: Latent Diffusion Enhances LLMs for Text Reasoning
Haoqiang Kang, Yizhe Zhang, Nikki Lijing Kuang, Nicklas Majamaki, Navdeep Jaitly, Yi-An Ma, and Lianhui Qin
International Conference on Learning Representations (ICLR), 2026.
[paper]
PAN
PAN: A World Model for General, Interactable, and Long-Horizon World Simulation
Jiannan Xiang, Yi Gu, Zihan Liu, Zeyu Feng, Qiyue Gao, Yiyan Hu, Benhao Huang, Guangyi Liu, Yichi Yang, Kun Zhou, Davit Abrahamyan, Arif Ahmad, Ganesh Bannur, Junrong Chen, Kimi Chen, Mingkai Deng, Ruobing Han, Xinqi Huang, Haoqiang Kang, Zheqi Liu, Enze Ma, Hector Ren, Yashowardhan Shinde, Rohan Shingre, Ramsundar Tanikella, Kaiming Tao, Dequan Yang, Xinle Yu, Cong Zeng, Binglin Zhou, Zhengzhong Liu, Zhiting Hu, and Eric P. Xing
arXiv, 2025.
[paper]
VideoScience-Bench
Benchmarking Scientific Understanding and Reasoning for Video Generation using VideoScience-Bench
Lanxiang Hu, Abhilash Shankarampeta, Yixin Huang, Zilin Dai, Haoyang Yu, Yujie Zhao, Haoqiang Kang, Daniel Zhao, Tajana Rosing, and Hao Zhang
arXiv, 2025.
[paper]
GFlowVLM
GFlowVLM: Enhancing Multi-step Reasoning in Vision-Language Models with Generative Flow Networks
Haoqiang Kang, Enna Sachdeva, Piyush Gupta, Sangjae Bae, and Kwonjoon Lee
Conference on Computer Vision and Pattern Recognition (CVPR), 2025.
[paper]
Flow of Reasoning
Flow of Reasoning: Training LLMs for Divergent Problem Solving with Minimal Examples
Fangxu Yu, Lai Jiang, Haoqiang Kang, Shibo Hao, and Lianhui Qin
International Conference on Machine Learning (ICML), 2025.
[paper]

2024

FinBen
The FinBen: An Holistic Financial Benchmark for Large Language Models
Qianqian Xie, Weiguang Han, Zhengyu Chen, Ruoyu Xiang, Xiao Zhang, Yueru He, Mengxi Xiao, Dong Li, Yongfu Dai, Duanyu Feng, Yijing Xu, Haoqiang Kang, Ziyan Kuang, Chenhan Yuan, Kailai Yang, Zheheng Luo, Tianlin Zhang, Zhiwei Liu, Guojun Xiong, Zhiyang Deng, Yuechen Jiang, Zhiyuan Yao, Haohang Li, Yangyang Yu, Gang Hu, Jiajia Huang, Xiao-Yang Liu, Alejandro Lopez-Lira, Benyou Wang, Yanzhao Lai, Hao Wang, Min Peng, Sophia Ananiadou, and Jimin Huang
NeurIPS Datasets and Benchmarks Track, 2024.
[paper]
Multilingual Hallucination
Comparing Hallucination Detection Metrics for Multilingual Generation
Haoqiang Kang, Terra Blevins, and Luke Zettlemoyer
arXiv, 2024.
[paper]
Translate to Disambiguate
Translate to Disambiguate: Zero-shot Multilingual Word Sense Disambiguation with Pretrained Language Models
Haoqiang Kang*, Terra Blevins*, and Luke Zettlemoyer
Proceedings of the 18th Conference of the European Chapter of the Association for Computational Linguistics (EACL), 2024. (Oral)
[paper]

2023

EVER
EVER: Mitigating Hallucination in Large Language Models through Real-Time Verification and Rectification
Haoqiang Kang, Juntong Ni, and Huaxiu Yao
arXiv, 2023.
[paper]
Finance Hallucination
Hallucination of Large Language Models in Finance: An Empirical Examination
Haoqiang Kang and Xiao-Yang Liu
NeurIPS Workshop on I Can't Believe It's Not Better (ICBINB), 2023.
[paper]