Junfei Wu
Ph.D. Student in MLLMs @ CASIA
I’m Junfei Wu, a Ph.D. student at the Institute of Automation, Chinese Academy of Sciences (State Key Laboratory of Multimodal Artificial Intelligence), advised by Prof. Tieniu Tan and collaborating with Prof. Qiang Liu, Shu Wu, and Liang Wang. My research focuses on multimodal reasoning in Large Vision-Language Models (LVLMs), including spatial reasoning, hallucination mitigation.
I am currently interning at Qwen-VL, focusing on spatial intelligence. Previously, I interned at Ant NLP Research Group, working on visual reasoning and hallucination in vision-language models.
Outside of research, I enjoy basketball, badminton, table tennis, yo-yo, and Rubik’s Cube.
news
| Sep 19, 2025 | Our paper “Reinforcing Spatial Reasoning in Vision-Language Models with Interwoven Thinking and Visual Drawing” has been accepted to the NeurIPS 2025. See you in San Diego! |
|---|---|
| Aug 21, 2025 | Our paper “SHARP: Steering Hallucination in LVLMs via Representation Engineering” has been accepted to EMNLP 2025. See you in Suzhou! |
selected publications
-
SHARP: Steering Hallucination in LVLMs via Representation EngineeringIn Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, 2025 -
Logical Closed Loop: Uncovering Object Hallucinations in Large Vision-Language ModelsIn Findings of the Association for Computational Linguistics ACL 2024, 2024 -
Adversarial contrastive learning for evidence-aware fake news detection with graph neural networksIEEE Transactions on Knowledge and Data Engineering, 2023 -
Bias mitigation for evidence-aware fake news detection by causal interventionIn Proceedings of the 45th International ACM SIGIR conference on research and development in information retrieval, 2022