Which AWS machine-learning service is designed to automatically extract printed text, handwriting, and structured data from scanned documents and images?
Amazon Textract
What features does Amazon Textract provide?
Which input formats does Amazon Textract accept?
JPEG, PNG, PDF, and TIFF
What output does Amazon Textract provide after a successful operation?
Structured extraction results that include the raw text, parsed key–value pairs and tables, semantic labels, confidence scores, and positional information
Can Amazon Textract process documents asynchronously?
Textract supports:
- synchronous (real-time) operations for smaller documents
- asynchronous (batch) jobs for large or multi-page documents