Training data
Tabelle public.training_data
public.training_data
Spalten
id
: internalorg_id
: ID to organisationdoc_type
: DOC Typedoc_origin
: Doc Origindoc_locale
: Doc Localedoc_language
: Languagefirst_page_content
: textsupplier_name
: Supplier Namesupplier_vat
: internaldoc_created_on
: timestamp with time zoneis_training_sample
: booleansample_doc_id
: internalsample_file_name
: internalcreated_on
: timestamp with time zonelast_modified_on
: timestamp with time zoneis_deleted
: booleanis_table_training_sample
: boolean
Analyse & Beschreibung
Technical Documentation for public.training_data
Table
public.training_data
TableIntroduction
The public.training_data
table appears to be designed for storing various document-related data, likely used for training models or systems that process and analyze document contents. The table includes information about the document type, origin, language, supplier details, and timestamps related to the creation and modification of records, among other attributes. It seems particularly focused on invoices and medical receipts, which suggests its use in financial or healthcare data processing applications.
Column Descriptions
id
Meaning: Unique identifier for each record in the table.
Role: Primary key.
Typical Contents: A universally unique identifier (UUID) in character format.
Note: Essential for uniquely identifying each entry.
org_id
Meaning: Identifier for the organization associated with the document.
Role: Foreign key that links to an organization record.
Typical Contents: UUID representing an organization.
Note: Helps in associating documents with specific organizations.
doc_type
Meaning: Type of document stored in the record.
Role: Descriptive field.
Typical Contents: Values such as 'INVOICE' or 'MEDICAL_RECEIPT'.
Note: Useful for categorizing the documents.
doc_origin
Meaning: Origin or source country code of the document.
Role: Descriptive field.
Typical Contents: Country code like 'DE' for Germany.
Note: Indicates the geographical origin of the document.
doc_locale
Meaning: Locale information of the document.
Role: Descriptive field.
Typical Contents: Locale code such as 'en_DE'.
Note: Combines language and region for localization purposes.
doc_language
Meaning: Language in which the document is written.
Role: Descriptive field.
Typical Contents: Language code like 'en' or 'de'.
Note: Indicates the primary language used in the document.
first_page_content
Meaning: Content of the first page of the document.
Role: Descriptive field.
Typical Contents: Text data containing the document's first page content.
Note: May be used for text analysis or processing.
supplier_name
Meaning: Name of the supplier associated with the document.
Role: Descriptive field.
Typical Contents: Supplier's name in text format.
Note: Important for identifying the source or creator of the document.
supplier_vat
Meaning: VAT identification number of the supplier.
Role: Descriptive field.
Typical Contents: VAT ID, e.g., 'DE 129375678'.
Note: Useful for fiscal and tax-related processes.
doc_created_on
Meaning: Timestamp when the document was created.
Role: Descriptive field.
Typical Contents: Date and time in UTC format.
Note: Provides temporal context for the document creation.
is_training_sample
Meaning: Indicator of whether the record is used as a training sample.
Role: Technical field.
Typical Contents: Boolean value (
True
orFalse
).Note: Identifies records specifically marked for training purposes.
sample_doc_id
Meaning: Identifier for the sample document.
Role: Descriptive field or possible foreign key.
Typical Contents: UUID linking to another document.
Note: May reference a related or original document used for sampling.
sample_file_name
Meaning: Name of the file associated with the document sample.
Role: Descriptive field.
Typical Contents: File name in text format.
Note: Useful for file management or retrieval processes.
created_on
Meaning: Timestamp when the record was created.
Role: Technical field.
Typical Contents: Date and time in UTC format.
Note: Indicates when the record was added to the database.
last_modified_on
Meaning: Timestamp of the last modification of the record.
Role: Technical field.
Typical Contents: Date and time in UTC format.
Note: Tracks when the record was last updated.
is_deleted
Meaning: Indicator of whether the record has been marked as deleted.
Role: Technical field.
Typical Contents: Boolean value (
True
orFalse
).Note: Supports soft delete functionality.
is_table_training_sample
Meaning: Indicator of whether the record is a table-specific training sample.
Role: Technical field.
Typical Contents: Boolean value (
True
orFalse
).Note: Differentiates from general training samples, focusing on table data.
Was this helpful?