> For the complete documentation index, see [llms.txt](https://docs.docbits.com/llms.txt). Markdown versions of documentation pages are available by appending `.md` to page URLs; this page is available as [Markdown](https://docs.docbits.com/administration-and-setup/settings/global-settings/document-types/model-training/import-data-model-training.md).

# Import Data: Model Training

Provide detailed instructions on how to import sample documents for training, including the format and document types to use.

**To import sample documents for training, follow these steps:**

* **Prepare the sample documents:** Make sure the sample documents are in a supported format, such as PDF, Word, Excel, etc. These documents should cover a variety of types and formats that may be encountered in production operations of the document processing system.
* **Navigate to the import function:** Log in to the administration area of ​​the document processing system and navigate to the area where you can import new documents.

<figure><img src="/files/Ahe2TGXCaZ8P6SAGLegP" alt=""><figcaption><p>Global Settings</p></figcaption></figure>

<figure><img src="/files/5QrLZUJle44QecIbfaQJ" alt=""><figcaption></figcaption></figure>

* **Select the option to import documents:** Click the button or link to import documents. There may be an option such as "Import".

<figure><img src="/files/N6f1g4ugMyFqIxiGr4kD" alt=""><figcaption></figcaption></figure>

* **Select amount & date format:**

<figure><img src="/files/DV3QCcSYrsBv7C0wXQNz" alt=""><figcaption></figcaption></figure>

**Amount Format:**

The amount format may vary by region, but in general there are some common conventions:

1. Currency symbol: The currency symbol is usually provided before the amount, e.g. "$" for US dollars, "€" for euros, "£" for British pounds, etc.
2. Thousands separator: In some countries, long numbers are separated by a thousand separator for better readability. In the US, a comma is commonly used (e.g. 1,000), while in many European countries a period is used (e.g. 1,000).
3. Decimal separator: The decimal separator is used to separate the integer part from the decimal places. Most English-speaking countries use a period (e.g. 10.99), while many European countries use a comma (e.g. 10.99).

#### Date Format:

The date format also varies by region, with different countries having different conventions. Here are the most common formats:

1. Day-Month-Year (DD-MM-YY or DD.MM.YY): In many European countries, the date is specified in day-month-year format. For example, "21.05.24" represents May 21, 2024.
2. Month-Day-Year (MM-DD-YY or MM/DD/YY): In the United States, the month-day-year format is often used. For example, "05/21/24" represents May 21, 2024.
3. Year-Month-Day (YY-MM-DD or YY/MM/DD): In some other countries, the year-month-day format is preferred. For example, "24/05/21" represents May 21, 2024.

It is important to note the specific format to avoid misunderstandings, especially in international communications or financial transactions.

* **Select the sample documents:** Select the sample documents you want to import. This can be done by uploading the files from your local computer or by selecting documents from an already connected location.

<figure><img src="/files/KLpKx7fhB3e9x62I2QCz" alt=""><figcaption></figcaption></figure>

* **Configure the document types and subtypes (if required):** If your system supports different document types or subtypes, assign the appropriate type to each imported document. This will help the system to categorize and process the documents correctly.
* **Start the import process:** Confirm the selection of documents and start the import process. Depending on the size and number of documents, this process may take some time.

<figure><img src="/files/fY8TBpjMIifb7XfsNQDf" alt=""><figcaption></figcaption></figure>

* **Check the import status:** Check the status of the import process to make sure that all documents were imported successfully. Make sure that no errors occurred and that the documents were processed correctly.
* **Train the model:** After the documents are imported, use them to train the document processing system model. Perform training according to the system's instructions to make sure it can process the sample data effectively.

<figure><img src="/files/CPUIzYKT2RswIyv79Lxn" alt=""><figcaption></figcaption></figure>

<figure><img src="/files/6edLm11fixtJvaDTf4Nc" alt="" width="284"><figcaption></figcaption></figure>

By regularly adding sample documents for training, you can ensure that your document processing system is always up to date and provides accurate and efficient processing.


---

# Agent Instructions
This documentation is published with GitBook. GitBook is the documentation platform designed so that both humans and AI agents can read, navigate, and reason over technical content effectively. Learn more at gitbook.com.

## Querying This Documentation
If you need additional information that is not directly available in this page, you can query the documentation dynamically by asking a question.

Perform an HTTP GET request on the current page URL with the `ask` query parameter, and the optional `goal` query parameter:

```
GET https://docs.docbits.com/administration-and-setup/settings/global-settings/document-types/model-training/import-data-model-training.md?ask=<question>&goal=<endgoal>
```

`ask` is the immediate question: it should be specific, self-contained, and written in natural language.
`goal` is optional and describes the broader end goal you are ultimately trying to accomplish on behalf of the user. GitBook uses it to tailor the answer towards what is most useful for that goal.

The response will contain a direct answer to the question and relevant excerpts and sources from the documentation.

Use this mechanism when the answer is not explicitly present in the current page, you need clarification or additional context, or you want to retrieve related documentation sections.
