IndustryBench-MIPU: Benchmarking Multi-Image Attribute Value Extraction for Industrial Products (opens in new tab)
Industrial products such as valves and circuit breakers are defined by dense technical specifications that govern procurement, compatibility, and safety across supply chains. These specifications are scattered across multiple heterogeneous product images, including specification tables, nameplates, and technical drawings, yet whether Multimodal Large Language Models (MLLMs) can reliably recover them remains underexplored. To fill this gap, we in...
Read the original article