Junfei Wu
Ph.D. Student in MLLMs @ CASIA
I’m Junfei Wu, a Ph.D. student at the Institute of Automation, Chinese Academy of Sciences (State Key Laboratory of Multimodal Artificial Intelligence), advised by Prof. Tieniu Tan. I also collaborate closely with Prof. Qiang Liu, Shu Wu, and Liang Wang. My research centers on trustworthy and interpretable reasoning in multimodal large language models (MLLMs), with a focus on spatial reasoning, hallucination detection, and test-time adaptation.
I interned at Ant Group’s Multimodal LLM team, where I developed the “Drawing to Reason in Space” framework to enhance visual thinking in LVLMs—published at NeurIPS 2025. I also pioneered representation-based hallucination mitigation techniques (EMNLP 2025) and logic-consistency-based detection frameworks (ACL 2024 Findings).
When not debugging tensors, I enjoy playing basketball, badminton, table tennis, yo-yo, or solving a Rubik’s Cube.
Feel free to reach me at junfei.wu@cripac.ia.ac.cn.
news
| Sep 19, 2025 | Our paper “Reinforcing Spatial Reasoning in Vision-Language Models with Interwoven Thinking and Visual Drawing” has been accepted to the NeurIPS 2025. See you in San Diego! |
|---|---|
| Aug 21, 2025 | Our paper “SHARP: Steering Hallucination in LVLMs via Representation Engineering” has been accepted to EMNLP 2025. See you in Suzhou! |
selected publications
-
Reinforcing spatial reasoning in vision-language models with interwoven thinking and visual drawingarXiv preprint arXiv:2506.09965, 2025 -
SHARP: Steering Hallucination in LVLMs via Representation EngineeringIn Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, 2025 -
Logical Closed Loop: Uncovering Object Hallucinations in Large Vision-Language ModelsIn Findings of the Association for Computational Linguistics ACL 2024, 2024 -
Adversarial contrastive learning for evidence-aware fake news detection with graph neural networksIEEE Transactions on Knowledge and Data Engineering, 2023 -
Bias mitigation for evidence-aware fake news detection by causal interventionIn Proceedings of the 45th International ACM SIGIR conference on research and development in information retrieval, 2022