Overview
Extracting and validating data from documents or other digital content is an important component of a document capture solution. Extracted data can range from simple indexing information such as form type, name, and SSN to comprehensive extraction of all fields on a complex form. The extracted data may be in the form of barcodes, handwriting, machine type, or checkboxes. Regardless of the format, the capture solution must accurately locate and extract the information. When the accuracy is in question the document capture solution must provide an efficient method for a subject matter expert to evaluate and correct issues.
Data Extraction and Validation Activities
Data extraction and validation activities revolve around defining what information is to be extracted, what document it is in, and where it is expected on a document. Business rules, OCR/ICR confidence thresholds, validation workflows, and user screens need to be established and configured. Specific tasks include:
1. Define extraction requirements
2. Evaluate sample extraction results
3. Develop and implement extraction rules
4. Map extraction data to output schema
5. Define validation business rules and OCR/ICR confidence thresholds
6. Configure validation workflows and user screens
7. Assess forms/document redesign to increase extraction automation levels
InfoCap Data Extraction and Validation Services
1. Efficiently evaluate forms and other content for extraction expectations
2. Evaluate extraction requirements and recommend appropriate extraction technologies
3. Develop extraction rules and validation properties
4. Translate extraction requirements into a schema for internal and external processing
5. Design and configure validate processes to meet accuracy goals
Overview
Extracting and validating data from documents or other digital content is an important component of a document capture solution. Extracted data can range from simple indexing information such as form type, name, and SSN to comprehensive extraction of all fields on a complex form. The extracted data may be in the form of barcodes, handwriting, machine type, or checkboxes. Regardless of the format, the capture solution must accurately locate and extract the information. When the accuracy is in question the document capture solution must provide an efficient method for a subject matter expert to evaluate and correct issues.
Data Extraction and Validation Activities
Data extraction and validation activities revolve around defining what information is to be extracted, what document it is in, and where it is expected on a document. Business rules, OCR/ICR confidence thresholds, validation workflows, and user screens need to be established and configured. Specific tasks include:
1. Define extraction requirements
2. Evaluate sample extraction results
3. Develop and implement extraction rules
4. Map extraction data to output schema
5. Define validation business rules and OCR/ICR confidence thresholds
6. Configure validation workflows and user screens
7. Assess forms/document redesign to increase extraction automation levels
Infocap Data Extraction and Validation Services
Efficiently evaluate forms
and other content for
extraction expectations
Evaluate extraction
requirements and
recommend appropriate
extraction technologies
Develop extraction rules
and validation properties
Translate extraction
requirements into a
schema for internal and
external processing
Design and configure
validate processes to
meet accuracy goals