Zhibo Wang

Full Professor, Zhejiang University

1 paper at NeurIPS 2025

Homepage· OpenReview· Semantic Scholar· Google Scholar

Poster Session 5

1 paper

Friday, December 5, 2025 · 11:00 AM → 2:00 PM

Exhibit Hall C,D,E

ICLScan: Detecting Backdoors in Black-Box Large Language Models via Targeted In-context Illumination

#1411 · Xiaoyi Pang, Xuanyi Hao, Song Guo, Qi Luo, Zhibo Wang

This paper reveals that backdoored LLM are more susceptible to new trigger injection during in-context learning (ICL) and proposes a targeted ICL-based lightweight backdoor detection framework for black-box LLMs used for generative tasks.