Textract Flashcards

This deck aims to help retain concepts related to the Amazon Textract service. (5 cards)

1
Q

Which AWS machine-learning service is designed to automatically extract printed text, handwriting, and structured data from scanned documents and images?

A

Amazon Textract

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What features does Amazon Textract provide?

A
  • Text detection and extraction
  • Document analysis (names, addresses, dates)
  • Receipt analysis (prices, vendors, dates, line items)
  • Identity document analysis (abstract fields)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Which input formats does Amazon Textract accept?

A

JPEG, PNG, PDF, and TIFF

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What output does Amazon Textract provide after a successful operation?

A

Structured extraction results that include the raw text, parsed key–value pairs and tables, semantic labels, confidence scores, and positional information

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Can Amazon Textract process documents asynchronously?

A

Textract supports:
- synchronous (real-time) operations for smaller documents
- asynchronous (batch) jobs for large or multi-page documents

How well did you know this?
1
Not at all
2
3
4
5
Perfectly