Image of Amazon Textract logo icon - a gray square with a polymorphic blend in the background. In the foreground, a graphical line art depiction of a document with a circle overlaying it and a small "T" in the center.

Amazon Textract

by Solodev

Amazon Textract is a managed service that automatically extracts text and data from virtually any document. Scan, read, identify key information, and push it to services like Amazon S3 or Comprehend.

Product Features

Track your text with Amazon Textract! Now you can automatically extract printed text, handwriting, and data from any document and leverage it across your cloud services and applications with a fully managed solution. 

Amazon Textract uses machine learning to instantly “read” virtually any type of document to accurately extract text and data without the need for any manual effort or custom code. Amazon Textract goes beyond simple optical character recognition (OCR) to also identify the contents of fields in forms and information stored in tables.

  • Extract data quickly and accurately: Amazon Textract makes it easy to quickly and accurately extract data from documents, forms, and tables. Textract automatically detects a document’s layout and the key elements on the page, understands the data relationships in any embedded forms or tables, and extracts everything with its context intact. 
  • Powered by advanced AI: Extract text and structured data from documents using artificial intelligence (AI).
  • Designed for scalability: You can easily process millions of documents using Textract's text extraction APIs.
  • Go beyond simple optical character recognition (OCR): Unlike most OCR services, Textract, allows lets you extract more than raw data by pulling the relationships, structure, and text from scanned documents. 
  • No code or templates to maintain: Amazon Textract's pre-trained machine learning models eliminate the need to write code for data extraction, because they have already been trained on tens of millions of documents from across industries, including contracts, tax documents, sales orders, enrollment forms, benefit applications, insurance claims, policies, and more. 
  • Export data to your destination of choice: Once captured, Textract can export your text and data into a JSON format or integrate with other AWS services including S3, ElastiSearch, Comprehend, or DynamoDB.
  • Improve security and compliance: With robust data privacy, encryption, security controls, and support compliance, Textract helps enhance your security posture. Meet rigorous standards including HIPAA, GDPR, and more. 
  • Lower document processing costs: Amazon Textract provides data extraction at a very low cost – and you only pay for what you use. There are no upfront commitments or long-term contracts. 
  • Conduct human reviews: You can easily implement a layer of human oversight with Amazon Augmented AI (A2I) to manage nuanced or sensitive workflows and audit predictions. 

Product Details

Amazon Textract is a fully-managed service for automatically extracting printed text, handwriting, and data from any document. 

  • Extract data quickly and accurately
  • Powered by advanced AI 
  • Process millions of documents 
  • Capture text, data, and relationships
  • No code or templates to maintain
  • Improve security and compliance
  • Lower document processing costs
  • Conduct human reviews

Support

AWS provides online documentation and resources for Amazon Textract. For more information and help, visit the AWS Knowledge Center.

Instructions

Looking to deploy Amazon Textract? Want to create a custom solution for a specific use case? Contact us and speak to one of our AWS architects about your goals and how we can help. 

Pricing and installation

Amazon Textract

Amazon Textract Enterprise

  • Extract data quickly and accurately
  • Powered by advanced AI 
  • Process millions of documents 
  • Capture text, data, and relationships
  • No code or templates to maintain
  • Improve security and compliance
  • Lower document processing costs
  • Human reviews

Amazon Textract is provided by a third-party and is governed by separate terms of service, privacy policy, and support documentation.