# Glossary

You will frequently encounter these concepts throughout this documentation as Mindee is centered around them.

#### Annotation

The values (labels) a human enters to train, test, or alter an [ML](#ml-machine-learning) processing pipeline.

Mindee uses annotations notably for the RAG feature.

#### API (Application Programming Interface)

A set of protocols and rules to define how applications or devices can connect to and communicate with each other.

In the context of Mindee, the API defines how to send files and receive results.

#### Catalog

A list of prebuilt [models](#model) you can browse, copy, and adapt to your needs.

#### Custom Model

A [model](#model) you create for your own [document](#document) type and use case.

#### Data Schema

A set of [fields](#field) and their [guidelines](#guideline) for data extraction. Associated to a [model](#model).

#### Document

A structured or semi-structured written or graphical representation of human thought, such as an invoice, receipt, ID card, W9 form, contract, train ticket, etc.

Mindee processes documents contained in various [file](#file) formats.

#### Extraction Model

A type of Mindee model designed to extract structured data from a [document](#document).

Includes a [Data Schema](#data-schema) and any optional features.

#### Field

A single data point to extract from a document. Part of a [Data Schema](#data-schema).

#### File

A computer file. In the context of Mindee, a file may contain one or several [documents](#document).

#### Guideline

A set of instructions provided by the user, to clarify or alter the behavior of a [ML model](#ml-machine-learning).

#### Inference

The processing step where a [ML model](#ml-machine-learning) performs an operation on a [file](#file).

By extension, this also refers to response or results that our API returns for a given file.

#### Job

The asynchronous processing task created when a file or URL is sent to the Mindee [API](#api-application-programming-interface) for processing.

#### ML (Machine Learning)

Algorithms that learn and generalize from data, and can perform tasks without explicit programming instructions.

This is the backbone of Mindee's technology.

#### Model

A set of instructions for processing [files](#file). There are several types of Mindee models that correspond to different uses.

A model can operate on a single document or multiple documents within the same file.

#### Multimodal Model

A type of deep-learning model that processes multiple types of data, such as text and vision.

Mindee uses multimodal models when processing documents.

#### OpenAPI

An industry specification that defines a language-agnostic interface for RESTful APIs.

#### Payload

A payload refers to data that is submitted to the Mindee server and the data returned by the server when an API request is made.

#### Polling

A result retrieval flow where your application repeatedly checks the processing status of a [job](#job).

#### SDK (Software Development Kit)

A set of software development tools developers can use to facilitate the creation of applications.

#### Utility Model

A [model](#model) used to preprocess or analyze documents, such as OCR, Split, Crop, or Classification.

#### Webhook

A result retrieval flow where Mindee sends the processing result directly to your Web server.


---

# Agent Instructions: Querying This Documentation

If you need additional information that is not directly available in this page, you can query the documentation dynamically by asking a question.

Perform an HTTP GET request on the current page URL with the `ask` query parameter:

```
GET https://docs.mindee.com/getting-started/glossary.md?ask=<question>
```

The question should be specific, self-contained, and written in natural language.
The response will contain a direct answer to the question and relevant excerpts and sources from the documentation.

Use this mechanism when the answer is not explicitly present in the current page, you need clarification or additional context, or you want to retrieve related documentation sections.
