Skip to main content

πŸ‘οΈ Mistral OCR

warning

This tutorial is a community contribution and is not supported by the Open WebUI team. It serves only as a demonstration on how to customize Open WebUI for your specific use case. Want to contribute? Check out the contributing tutorial.

πŸ‘οΈ Mistral OCR​

This documentation provides a step-by-step guide to integrating Mistral OCR with Open WebUI. Mistral OCR is an optical character recognition library designed to extract text from a variety of image-based file formatsβ€”including scanned PDFs, images, and handwritten documentsβ€”into structured data such as JSON or plain text. With advanced support for multilingual text recognition, layout analysis, and handwriting interpretation, Mistral OCR simplifies the process of digitizing and processing documents for AI applications like search, summarization, and data extraction, all through a robust and customizable interface.

Prerequisites​

  • Open WebUI instance
  • Mistral AI account

Integration Steps​

Step 1: Sign Up or Login to Mistral AI console​

  • Go to https://console.mistral.ai
  • Follow the instructions as instructed on the process
  • After successful authorization, you should be welcomed to the Console Home

Step 2: Generate an API key​

  • Go to API Keys or https://console.mistral.ai/api-keys
  • Create a new key and make sure to copy it

Step 3: Configure Open WebUI to use Mistral OCR​

  • Log in to your Open WebUI instance.
  • Navigate to the Admin Panel settings menu.
  • Click on Settings.
  • Click on the Documents tab.
  • Change the Default content extraction engine dropdown to Mistral OCR.
  • Paste the API Key on the field
  • Save the Admin Panel.

Verifying Mistral OCR

To verify that Mistral OCR is working correctly in script, please refer to https://docs.mistral.ai/capabilities/document/

Conclusion​

Integrating Mistral OCR with Open WebUI is a simple and effective way to enhance document processing and content extraction capabilities. By following the steps in this guide, you can set up Mistral OCR as the default extraction engine and leverage its advanced text recognition features. Once configured, Mistral OCR enables powerful, multilingual document parsing with support for various formats, enhancing AI-driven document analysis capabilities in Open WebUI.