Omni-AD: A Large-scale and Versatile Benchmark for Industrial Anomaly Detection
Dahu Shi ⋅ Chengshen He ⋅ Shaochen Zhang ⋅ Bo Qian ⋅ Xiaochen Quan ⋅ Wencong Zhang ⋅ Xing Wei
Abstract
Industrial Anomaly Detection (IAD) has attracted significant attention and witnessed rapid development.However, the advancement in this field is hindered by two key issues: the performance saturation of existing benchmarks, limiting discriminative evaluation of different IAD methods, and the absence of benchmarks tailored to assess recent multi-modal large language models (MLLMs) in anomaly detection.To this end, we present Omni-AD, a comprehensive IAD benchmark featuring:\romannumeral1) \textbf{Large scale}:The dataset consists of approximately 35K images (6$\times$ larger than MVTec) with 150 product categories (10$\times$ larger than MVTec) spanning 16 industrial sectors, delivering unprecedented diversity in terms of both category and image scale compared with existing datasets.\romannumeral2) \textbf{Versatility}:The benchmark supports both conventional unsupervised and emerging MLLM-based IAD evaluation protocols. The latter is achieved by defining three subtasks of progressive difficulty, with two structured as visual question answering (VQA) and one as visual grounding.\romannumeral3) \textbf{Challenge}:Extensive experimental results of state-of-the-art methods reveal that the Omni-AD benchmark is more challenging than existing benchmarks, which can drive the future development of the IAD field.
Successful Page Load