Multi Receipts Detector OCR

Mindee’s Multi Receipts Detector OCR API uses deep learning to automatically, accurately, and instantaneously parse your receipt coordinates. In under a second, the API extracts a set of data from your PDFs or photos of any document showing multiple receipts, including:

  • Receipt Position by Bounding Box

It can be easily combined with Mindee's Receipts and Financial Documents OCR API to provide an end to end workflow from detecting to extracting key information on receipts.

Set up the API

Create an API key

To begin using the Mindee V1 OCR API, your first step is to create your V1 API key.

  1. You'll need a document including one or multiple receipts. You can use the sample document provided below.

  1. Access your API by clicking on the Multi Receipts Detector card in the Utilities

  1. From the left navigation, go to documentation > API Reference, you'll find sample code in popular languages and command line.

  • Replace my-api-key-here with your new API key, or use the select an API key feature and it will be filled automatically.

  • Copy and paste the sample code of your desired choice in your application, code environment, terminal etc.

  • Replace /path/to/the/file.ext with the path to your input document.

API Response

Here is the full JSON response you get when you call the API:

You can find the prediction within the prediction key found in two locations:

  • In document > inference > prediction for document-level predictions: it contains the different fields extracted at the document level, meaning that for multi-pages PDFs, we reconstruct a single List of Receipts object using all the pages.

  • In document > inference > pages[ ] > prediction for page-level predictions: it gives the prediction for each page independently. With images, there is only one element on this array, but with PDFs, you can find the extracted data for each PDF page.

Each predicted field may contain one or several values:

  • a confidence score

  • a bounding_box highlighting the information location

  • a page_id where the information was found (document level only)

Extracted data

Using the above example the following are the basic fields that can be extracted.

List of Receipts

  • receipts: This field outputs the coordinates for each receipt detected.

Last updated

Was this helpful?