We’re living in an era of transition from inventions of the Renaissance to the technological revolution of automation. Even so, a lot of valuable records and information are still locked inside unstructured documents. Transforming these documents into digital format to perform various operations requires complex AI techniques on top of simple optical character recognition.
Today, a number of industries including medicine, finance, law, and real-estate rely on human-intensive processes to convert forms and other documents into digital formats. Be it patients history locked in medical record files of hospitals, mortgage applications, or tax forms, these transcriptions all requires effort and human intervention on top of simple OCR techniques — that used to detect text without keeping the composition of document intact — to convert them into valuable digital assets.
Continue reading Amazon Textract — Going beyond optical character recognition (OCR)