About Amazon Textract
Amazon Textract is a machine learning service from AWS that automatically extracts printed text, handwriting, tables, forms, key-value pairs, and structured data from scanned documents and images. It goes beyond simple OCR with specialized APIs for expense documents, identity documents, and mortgage lending packages. Pricing is pay-per-page with no upfront commitments, and a free tier is available for new customers.
AI Agent Use Cases
- Detect Document Text API for OCR of printed text and handwriting
- Analyze Document API with Forms, Tables, Queries, and Signatures detection
- Analyze Expense API for automated invoice and receipt parsing
- Analyze ID API for identity document (passport, driver's license) data extraction
- Analyze Lending API for mortgage document classification and extraction
- Asynchronous processing for multi-page PDF documents
Available Actions
These are the specific actions that AI agents can perform with this tool