I am a Ph.D. student in Fudan University, supervided by Prof. Wenqiang Zhang. I achieved my B.E. degress in Information Security from Fudan University in 2022.
My research interest includes:
- Computer vision
- Multimodal Large Language Model
- Video understanding
- Image generation
- 3D vision
- Instance segmentation
- Multimodal
š„ News
-
2025.01: Ā šš One paper is accepted by ICLR 2025.
-
2024.09: Ā šš One paper is accepted by NeurIPS 2024.
-
2024.08: Ā šš Two papers are accepted by ACM MM 2024.
-
2024.07: Ā šš Two papers are accepted by ECCV 2024.
-
2024.07: Ā šš Iām organizing the 6th Large-Scale Video Object Segmentation (LSVOS) Challenge! Welcome to attend!
-
2024.04: Ā šš LVOS V2 has been released! Welcome for following!
-
2024.03: Ā šš One paper is accepted by CVPR 2024 and is presented as Highlight! Congratulations to all co-authors!
-
2023.09: Ā šš One paper is accepted by NeurIPS 2023.
-
2023.08: Ā šš Three papers are accepted by ACM MM 2023. Congratulations to all co-authors!
-
2023.07: Ā šš LVOS has been accepted by ICCV 2023.
-
2022.11: Ā šš LVOS (the first long-term video object segmentation benchmark) has been public!
š Publications

General Compression Framework for Efficient Transformer Object Tracking
Lingyi Hong, Jinglun Li, Xinyu Zhou, Shilin Yan, Pinxue Guo, Kaixun Jiang, Zhaoyu Chen, Shuyong Gao, Wei Zhang, Hong Lu, Wenqiang Zhang
[Paper]
- General compression framework for efficient SOT.
- Support any teacher and student structure, any input resolution, and any layer numbers.
- Balance between efficiency and effectiveness (2.17 x speed up with 96% accuracy).

LVOS: A Benchmark for Large-scale Long-term Video Object Segmentation
Lingyi Hong, Zhongying Liu, Wenchao Chen, Chenzhi Tan, Yuang Feng, Xinyu Zhou, Pinxue Guo, Jinglun Li, Zhaoyu Chen, Shuyong Gao, Wei Zhang, Wenqiang Zhang
LVOS: A Benchmark for Long-term Video Object Segmentation
Lingyi Hong, Wenchao Chen, Zhongying Liu, Wei Zhang, Pinxue Guo, Zhaoyu Chen, Wenqiang Zhang
[Paper V2] [Paper V1] [Home Page] [Github]
- The first long-term video object segmentation benchmark.

(Highlight) OneTracker: Unifying Visual Object Tracking with Foundation Models and Efficient Tuning
Lingyi Hong, Shilin Yan, Renrui Zhang, Wanyun Li, Xinyu Zhou, Pinxue Guo, Kaixun Jiang, Yiting Chen, Jinglun Li, Zhaoyu Chen, Wenqiang Zhang
[Paper]
- The first one to unify RGB and RGB+X tracking in a general framework.
- Introduce the foundation model and parameter-efficient tuning manner into object tracking and break traditional full finetuning stragety.
- SOTA performance on 6 tracking task 11 benchmarks.
-
ICLR 2025
DynaPrompt: Dynamic Test-Time Prompt Tuning, Zehao Xiao, Shilin Yan, Jack Hong, Jiayin Cai, Xiaolong Jiang, Yao Hu, Jiayi Shen, Qi Wang, Cees GM Snoek -
NeurIPS 2024
DeTrack: In-model Latent Denoising Learning for Visual Object Tracking, Xinyu Zhou, Jinglun Li, Lingyi Hong, Kaixun Jiang, Pinxue Guo, Weifeng Ge, Wenqiang Zhang -
ACM MM 2024
X-prompt: Multi-modal visual prompt for video object segmentation, Pinxue Guo, Wanyun Li, Hao Huang, Lingyi Hong, Xinyu Zhou, Zhaoyu Chen, Jinglun Li, Kaixun Jiang, Wei Zhang, Wenqiang Zhang -
ACM MM 2024
TagOOD: A Novel Approach to Out-of-Distribution Detection via Vision-Language Representations and Class Center Learning, Jinglun Li, Xinyu Zhou, Kaixun Jiang, Lingyi Hong, Pinxue Guo, Zhaoyu Chen, Weifeng Ge, Wenqiang Zhang -
ECCV 2024
OneVOS: Unifying Video Object Segmentation with All-in-One Transformer Framework, Wanyun Li, Pinxue Guo, Xinyu Zhou, Lingyi Hong, Yangji He, Xiangyu Zheng, Wei Zhang, Wenqiang Zhang -
ECCV 2024
PanoVOS: Bridging Non-panoramic and Panoramic Views with Transformer for Video Segmentation, Shilin Yan, Xiaohao Xu, Renrui Zhang, Lingyi Hong, Wenchao Chen, Wenqiang Zhang, Wei Zhang -
NeurIPS 2023
Reading Relevant Feature from Global Representation Memory for Visual Object Tracking, Xinyu Zhou, Pinxue Guo, Lingyi Hong, Jinglun Li, Wei Zhang, Weifeng Ge, Wenqiang Zhang -
ACM MM 2023
SimulFlow: Simultaneously Extracting Feature and Identifying Target for Unsupervised Video Object Segmentation, Lingyi Hong, Wei Zhang, Shuyong Gao, Hong Lu, WenQiang Zhang -
ACM MM 2023
Exploring the Adversarial Robustness of Video Object Segmentation via One-shot Adversarial Attacks, Kaixun Jiang, Lingyi Hong, Zhaoyu Chen, Pinxue Guo, Zeng Tao, Yan Wang, Wenqiang Zhang -
ACM MM 2023
(Oral) Towards Decision-based Sparse Attacks on Video Recognition, Kaixun Jiang, Zhaoyu Chen, Xinyu Zhou, Jingyu Zhang, Lingyi Hong, JiaFeng Wang, Bo Li, Yan Wang, Wenqiang Zhang
š Organizations
š Educations
- 2022.09 - Now, Ph. D. candidate, School of Computer Science, Fudan University, Shanghai China.
- 2018.09 - 2022.06, Undergraduate, School of Computer Science, Fudan University, Shanghai China.
š Contests
-
2024.08: 2nd Place, Global Multimedia Deepfake Detection Challenge.
-
2022.06: 2nd Place, The 4th Large-scale Video Object Segmentation Challenge. CVPRW 2022.
š Services
- Reviewer for TPAMI, TIP, TCSVT, ICML 2025, NeurIPS 2024, ICLR 2025, CVPR 2024 - 2025, ICCV 2023, ECCV 2024, ACM MM 2023 - 2024.