Qi She (佘琪)

Qi She (佘琪)

ByteDance · Research Scientist

MLLMs · Agentic AI · AIGC

Selected Top-tier Publication

[Note] Selected peer-reviewed papers listed below. For the full and most up-to-date publication list, see Google Scholar: Qi She

★ Selected Highlights

MammothModa2: A Unified AR-Diffusion Framework for Multimodal Understanding and Generation
2025
MammothModa2: A Unified AR-Diffusion Framework for Multimodal Understanding and Generation
T Shen, X Wan, T Chen, R Zhang, J Pan, D Lu, F Lei, Z Lu, Y Yang, ...
Beyond text-visual attention: Exploiting visual cues for effective token pruning in vlms
ICCV 2025
Beyond text-visual attention: Exploiting visual cues for effective token pruning in vlms
Q Zhang, A Cheng, M Lu, R Zhang, Z Zhuo, J Cao, S Guo, Qi She, ...
Mammothmoda: Multi-modal large language model
2024
Mammothmoda: Multi-modal large language model
Qi She, J Pan, X Wan, R Zhang, D Lu, K Huang
Generative Adversarial Networks in Computer Vision: A Survey and Taxonomy
CSUR 2021
Generative Adversarial Networks in Computer Vision: A Survey and Taxonomy
Zhengwei Wang, Qi She , Tomas E Ward.
On Learning Contrastive Representations for Learning with Noisy Labels
CVPR 2022
On Learning Contrastive Representations for Learning with Noisy Labels
Li Yi, Sheng Liu, Qi She, Lei Zhu, A. Ian McLeod, Boyu Wang
Learning from Temporal Gradient for Semi-supervised Action Recognition
CVPR 2022
Learning from Temporal Gradient for Semi-supervised Action Recognition
Junfei Xiao, Longlong Jing, Lin Zhang, Ju He, Qi She, Zongwei Zhou, Alan Yuille, Yingwei Li
MINE: Towards Continuous Depth MPI with NeRF for Novel View Synthesis
ICCV 2021
MINE: Towards Continuous Depth MPI with NeRF for Novel View Synthesis
Jiaxin Li, Zijian Feng, Qi She, Henghui Ding, Changhu Wang, Gim Hee Lee.
Involution: Inverting the Inherence of Convolution for Visual Recognition
CVPR 2021
Involution: Inverting the Inherence of Convolution for Visual Recognition
Duo Li, Jie Hu, Changhu Wang, Xiangtai Li, Qi She, Lei Zhu, Tong Zhang, Qifeng Chen.
ACTION-Net: Multipath Excitation for Action Recognition
CVPR 2021
ACTION-Net: Multipath Excitation for Action Recognition
Zhengwei Wang, Qi She, Aljosa Smolic.
OpenLORIS-Object: A Robotic Vision Dataset and Benchmark for Lifelong Deep Learning
ICRA 2020
OpenLORIS-Object: A Robotic Vision Dataset and Benchmark for Lifelong Deep Learning
Qi She, Fan Feng, Xinyue Hao, Qihan Yang, Chuanlin Lan, Vincenzo Lomonaco, Xuesong Shi, Zhengwei Wang, Yao Guo, Yimin Zhang, Fei Qiao, Rosa H. M. Chan.
Are We Ready for Service Robots? The OpenLORIS-Scene Datasets for Lifelong SLAM
ICRA 2020
Are We Ready for Service Robots? The OpenLORIS-Scene Datasets for Lifelong SLAM
Xuesong Shi, Dongjiang Li, Pengpeng Zhao, Qinbin Tian, Yuxin Tian, Qiwei Long, Chunhao Zhu, Jingwei Song, Fei Qiao, Le Song, Yangquan Guo, Zhigang Wang, Yimin Zhang, Baoxing Qin, Wei Yang, Fangshi Wang, Rosa H. M. Chan, Qi She
Year:
Topic:

Journal 7

Conference 23

Workshop 3

Preprint 9

Patent 2

  • Object identification based on adaptive learning
    US Patent 12,511,887, 2025.
  • Trajectory prediction using directed graph and destination features
    US Patent 12,198,460, 2025.