Poster Session 3 · Thursday, December 4, 2025 11:00 AM → 2:00 PM
#3914
TransferBench: Benchmarking Ensemble-based Black-box Transfer Attacks
Abstract
Ensemble-based black-box transfer attacks optimize adversarial examples on a set of surrogate models, claiming to reach high success rates by querying the (unknown) target model only a few times. In this work, we show that prior evaluations are systematically biased, as such methods are tested only under overly optimistic scenarios, without considering
- how the choice of surrogate models influences transferability,
- how they perform against robust target models, and
- whether querying the target to refine the attack is really required.
To address these gaps, we introduce TransferBench, a framework for evaluating ensemble-based black-box transfer attacks under more realistic and challenging scenarios than prior work. Our framework considers 17 distinct settings on CIFAR-10 and ImageNet, including diverse surrogate-target combinations, robust targets, and comparisons to baseline methods that do not use any query-based refinement mechanism.
Our findings reveal that existing methods fail to generalize to more challenging scenarios, and that query-based refinement offers little to no benefit, contradicting prior claims. These results highlight that building reliable and query-efficient black-box transfer attacks remains an open challenge.
We release our benchmark and evaluation code at: https://github.com/pralab/transfer-bench.