Advanced Settings

In some documents, table structures can be complex—spanning multiple lines, containing grouped information, or including unnecessary extra rows. The Advanced Settings in training mode allow you to fine-tune table extraction for such cases, improving accuracy and consistency.

To access these settings, activate Training Mode and click the Settings gear icon in the top action bar:

advanced-settings

Header Row Count

Use this setting to define how many lines make up the table header.

Some tables have multi-line headers. For example, this table’s header spans two lines:

Set the Header row count to match:

header row count

Why is this important?

If you don’t set this, DocBits may treat the second line as data instead of part of the header, leading to extraction errors:

Before:

After:

Move Extra Rows to Trash

Use this to discard unwanted multi-line entries, such as overflow descriptions.

In this example, the description spills into multiple rows, but only the first line is relevant:

Enable Move Extra Rows to Trash to remove the overflow:

Result after mapping:

Minimum Grouped Rows

Use this when rows need to be grouped together under one main row (e.g. line items with multiple sub-lines).

Here, only three out of six rows are relevant. Two key columns are mapped (e.g. Position, Description), while others are treated as custom fields.

Start by setting Header row count and the Minimum grouped rows:

Also enable Move Extra Rows to Trash to clean up irrelevant data:

Then define the grouping key column, e.g. Position:

Result:

Reverse Grouping

Use this when the grouping row appears after the rows it should group.

If the row that should be grouped with other data appears above the grouping key, enable this option:

Enable Reverse grouping, group by a main column (e.g. Net amount), and use Move Extra Rows to Trash if needed:

Final result:

Summary

Use the Advanced Settings to teach DocBits how to accurately handle more complex or inconsistent table structures. These settings improve extraction precision by accounting for:

  • Multi-line headers

  • Multi-row descriptions

  • Grouped line items

  • Reverse order of grouped data

Enabling these options during training ensures DocBits remembers the correct layout for future documents from the same supplier.

Last updated

Was this helpful?