Mistral OCR 4 brings multilingual structured document extraction and improved performance (opens in new tab)
Mistral OCR 4, advancing document understanding with support for bounding boxes, block classification, and inline confidence scores. Each extracted content block is now localized, classified by type, and accompanied by per-page and per-word confidence metrics, alongside the textual output. The model expands accessibility by supporting 170 languages across 10 language groups, including those that are rare or low-resource, addressing a gap in many existing solutions. Building on these enhanceme...
Read the original article