Publications

Please also check the Google Scholar for a comprehensive list.

2025

  1. COLM
    True Multimodal In-Context Learning Needs Attention to the Visual Context
    Shuo Chen , Jianzhe Liu , Zhen Han , Yan Xia , Daniel Cremers , Philip Torr , Volker Tresp , and Jindong Gu
    Conference on Language Modeling (COLM), 2025
  2. EMNLP
    METok: Multi-Stage Event-based Token Compression for Efficient Long Video Understanding
    Mengyue Wang , Shuo Chen , Kristian Kersting , Volker Tresp , and Yunpu Ma
    Conference on Empirical Methods in Natural Language Processing (EMNLP) Main, 2025
  3. COLM
    Supposedly Equivalent Facts That Aren’t? Entity Frequency in Pre-training Induces Asymmetry in LLMs
    Yuan He , Bailan He , Zifeng Ding , Alisia Lupidi , Yuqicheng Zhu , Shuo Chen , Caiqi Zhang , Jiaoyan Chen , Yunpu Ma , and Volker Tresp
    COLM, 2025
  4. ACL
    Multimodal pragmatic jailbreak on text-to-image models
    Tong Liu , Zhixin Lai , Jiawen Wang , Gengyuan Zhang , Shuo Chen , Philip Torr , Vera Demberg , Volker Tresp , and Jindong Gu
    ACL, 2025
  5. WACV
    Can Multimodal Large Language Models Truly Perform Multimodal In-Context Learning?
    Shuo Chen , Zhen Han , Bailan He , Mark Buckley , Philip Torr , Volker Tresp , and Jindong Gu
    In IEEE/CVF Winter Conference on Applications of Computer Vision (WACV) , 2025

2024

  1. EMNLP
    Visual question decomposition on multimodal large language models
    Haowei Zhang , Jianzhe Liu , Zhen Han , Shuo Chen , Bailan He , Volker Tresp , Zhiqiang Xu , and Jindong Gu
    Conference on Empirical Methods in Natural Language Processing (EMNLP) Findings, 2024
  2. COLM
    Stop Reasoning! When Multimodal LLMs with Chain-of-Thought Reasoning Meets Adversarial Images
    Zefeng Wang , Zhen Han , Shuo Chen , Fan Xue , Zifeng Ding , Xun Xiao , Volker Tresp , Philip Torr , and Jindong Gu
    In Conference on Language Modeling (COLM) 2024 , 2024
  3. SET LLM @ ICLR
    Red Teaming GPT-4V: Are GPT-4V Safe Against Uni/Multi-Modal Jailbreak Attacks?
    Shuo Chen , Zhen Han , Bailan He , Zifeng Ding , Wenqian Yu , Philip Torr , Volker Tresp , and Jindong Gu
    In ICLR 2024 Workshop on Secure and Trustworthy Large Language Models , 2024
  4. arXiv
    PERFT: Parameter-Efficient Routed Fine-Tuning for Mixture-of-Expert Model
    Yilun Liu , Yunpu Ma , Shuo Chen , Zifeng Ding , Bailan He , Zhen Han , and Volker Tresp
    arXiv preprint arXiv:2411.08212, 2024

2023

  1. NeurIPS
    Benchmarking robustness of adaptation methods on pre-trained vision-language models
    Shuo Chen , Jindong Gu , Zhen Han , Yunpu Ma , Philip Torr , and Volker Tresp
    In Conference on Neural Information Processing Systems (NeruIPS) , 2023
  2. arXiv
    A Systematic Survey of Prompt Engineering on Vision-Language Foundation Models
    Jindong Gu , Zhen Han , Shuo Chen , Ahmad Beirami , Bailan He , Gengyuan Zhang , Ruotong Liao , Yao Qin , Volker Tresp , and Philip Torr
    arXiv preprint arXiv:2307.12980, 2023

2022

  1. arXiv
    Social Networks are Divulging Your Identity behind Crypto Addresses
    Shuo Chen , and Shaikh Muhammad Uzair Norman
    arXiv preprint, 2022