Qixun Wang (王启迅)

About Me

I’m a third-year Ph.D. student in the School of Intelligence Science and Technology at Peking University. My research interests include:

  • Multimodal Large Language Models (MLLMs): improving the perception and reasoning capabilities of MLLMs, including latent reasoning, agentic reasoning, etc.
  • Out-of-distribution (OOD) generalization, including theoretical analysis and algorithm design across various models and scenarios, such as classical computer vision tasks, graph data, and the generalization behavior of LLMs.

🔥 News

  • 2026.05 One paper is released on Arxiv (ArtifactBench for assessing MLLMs in AI-generated video detection).
  • 2026.05 One paper is accepted by ICML 2026 (Semantic enriched latent visual reasoning).
  • 2026.03 I will be joining Kling Team (Kuaishou Technology) as a research intern.
  • 2026.02 Two papers are accepted by CVPR 2026 (Reasoning in latent visual space & Benchmark for unified multimodal models).
  • 2025.02 One paper is accepted by CVPR 2025 (Audio-Visual Instance Segmentation).

🎓 Education

  • B.S. in Intelligence Science and Technology, EECS — Peking University (2019~2023)
  • Ph.D. Candidate in ML & Computer Vision, SIST — Peking University (2023~present)

📚 Selected Publications

Full publication list

ArtifactBench teaser

Artifact-Bench: Evaluating MLLMs on Detecting and Assessing the Artifacts of AI-Generated Videos
Yuqi Tang*, Yang Shi*, Zhuoran Zhang*, Qixun Wang*, Xuehai Bai, Yue Ding, Ruizhe Chen, Bohan Zeng, Xinlong Chen, Xuanyu Zhu, Bozhou Li, Yuran Wang, Yifan Dai, Chengzhuo Tong, Xinyu Liu, Yiyan Ji, Yujie Wei, Yuhao Dong, Shilin Yan, Fengxiang Wang, Yi-Fan Zhang†, Haotian Wang†, Yuanxing Zhang†, Pengfei Wan Arxiv Preprint 2026paper · code

SLVR teaser

Semantic-Enriched Latent Visual Reasoning
Tianrun Xu, Yue Sun, Qixun Wang, Jingyi Lu, Yuan Wang, Tianren Zhang, Longteng Guo, Fengyun Rao, Jing LYU, Feng Chen, Jing Liu
ICML 2026paper · code

Monet teaser

Monet: Reasoning in Latent Visual Space Beyond Images and Language
Qixun Wang, Yang Shi, Yifei Wang, Yuanxing Zhang, Pengfei Wan, Kun Gai, Xianghua Ying, Yisen Wang
CVPR 2026paper · code

ICL-OOD teaser

Can In-context Learning Really Generalize to Out-of-distribution Tasks?
Qixun Wang, Yifei Wang, Xianghua Ying, Yisen Wang
ICLR 2025paper · code

Invariant Learning on Graphs teaser

Dissecting the Failure of Invariant Learning on Graphs
Qixun Wang, Yifei Wang, Yisen Wang, Xianghua Ying
NeurIPS 2024paper · code

OOD robustness teaser

Improving Out-of-distribution Robustness by Adversarial Training with Structured Priors
Qixun Wang*, Yifei Wang*, Hong Zhu, Yisen Wang
NeurIPS 2022 Spotlightpaper · code

💼 Experiences

  • Kling Team, Kuaishou Technology (快手科技) — Research Intern (2026.03–Now)
    Working on multimodal agents

🏆 Awards

  • The Third-Class Scholarship of Peking University (2025)
  • Merit Student at Peking University (2025)
  • Outstanding Graduate of Peking University (2023)
  • Yanchuang Capital Scholarship (Top 6%) (2022)
  • Merit Student at Peking University (Top 6%) (2022)
  • Academic Innovation Award at Peking University (Top 1%) (2022)
  • Award for Academic Excellence (2021)