In the table extraction view, you will find the menu item Settings in the upper action bar (make sure that the training mode is activated). If you click on the gear icon, a window will open in which you will find the Advanced Settings.
Below functionalities are available in general settings:
Here you can define the number of lines of a table header. For example, the table header line can be two lines:
Accordingly, the value in “Header row count” is set to two
Why is this needed? It might be that DocBits does not recognize the second line in the table header as part of the header line. In this case, it incorrectly inserts it into the table as an extracted value. This can be easily prevented with this function.
Example before
Example after
In this example, the item description in the table spans several rows, but you only need the first one. To extract only this and include it in the Description column, select Move Extra Rows to Trash.
After naming the columns and mapping them to position, you get the following result
The functionalities below are available in the advanced settings:
Enter the minimum number of rows in your grouped column here.
In this table you see six rows of which only three are relevant for you. In the first two columns there are two criteria that have to be extracted separately. These will be your mapped columns all the other ones have to be trained as custom columns. And this is how it works step by step:
Select the two header rows as well as two minimum grouped rows as these should be grouped to one row.
Also select the Move extra rows to Trash option to be able to train all the other columns as custom columns.
Name the first column Position and group on that one.
After naming all the columns and training the values, this is your result:
If you want to combine all the rows above the grouped attribute, check the box here.
In this example, the table starts with a row that is above all other information but also needs to be extracted along with the information below it. It could be that DocBits (DOC²) extracts this row as an additional row and the grouping of the information, e.g. by position, does not work properly.
After grouping on net amount, checking the box, selecting the Move extra rows to Trash option
After naming all the columns, this is your result.