Since basic OCRs typically spit out jumbled information when taking text from tables and forms, companies have to resort to manual data entry that could be both costly and time consuming. Textract can process millions of pages in just a few hours, which can lower document processing costs. Plus, customers can use it even though they don’t have previous machine learning experience.
Amazon says Textract can recognize information like names and social security numbers, allowing it to transfer table data from PDFs, for instance, into easily searchable spreadsheets. For much larger stacks of documents, the information it extracts could be used to build smart searches or could be loaded into databases. The bad news for some AWS customers is that the product is only available in some parts of the US (Ohio, N. Virginia, Oregon) and Ireland for now. It will, however, make its way to more regions over the next year.