Advanced Settings

In the table extraction view, you will find the menu item Settings in the upper action bar (make sure that the training mode is activated). If you click on the gear icon, a window will open in which you will find the Advanced Settings.

Below functionalities are available in general settings:

Header row count

Here you can define the number of lines of a table header. For example, the table header line can be two lines:

Accordingly, the value in “Header row count” is set to two

Why is this needed? It might be that DocBits does not recognize the second line in the table header as part of the header line. In this case, it incorrectly inserts it into the table as an extracted value. This can be easily prevented with this function.

Example before

Example after

Move Extra Rows to Trash

In this example, the item description in the table spans several rows, but you only need the first one. To extract only this and include it in the Description column, select Move Extra Rows to Trash.

After naming the columns and mapping them to position, you get the following result

The functionalities below are available in the advanced settings:

Minimum grouped rows

Enter the minimum number of rows in your grouped column here.

In this table you see six rows of which only three are relevant for you. In the first two columns there are two criteria that have to be extracted separately. These will be your mapped columns all the other ones have to be trained as custom columns. And this is how it works step by step:

Select the two header rows as well as two minimum grouped rows as these should be grouped to one row.

Also select the Move extra rows to Trash option to be able to train all the other columns as custom columns.

Name the first column Position and group on that one.

After naming all the columns and training the values, this is your result:

Reverse grouping

If you want to combine all the rows above the grouped attribute, check the box here.

In this example, the table starts with a row that is above all other information but also needs to be extracted along with the information below it. It could be that DocBits (DOC²) extracts this row as an additional row and the grouping of the information, e.g. by position, does not work properly.

After grouping on net amount, checking the box, selecting the Move extra rows to Trash option

After naming all the columns, this is your result.

Last updated