Essays about: "OCR post-processing"

Found 3 essays containing the words OCR post-processing.

  1. 1. Post-processing of optical character recognition for Swedish addresses

    University essay from KTH/Skolan för elektroteknik och datavetenskap (EECS)

    Author : Moa Andersson; [2022]
    Keywords : OCR; OCR post-processing; OCR post-correction; NMT; Lexical model; OCR; efterbehandling för OCR; post-korrigering för OCR; NMT; lexikal modell;

    Abstract : ​​Optical character recognition (Optical Character Recognition (OCR)) has many applications, such as digitizing historical documents, automating processes, and helping visually impaired people read. However, extracting text from images into a digital format is not an easy problem to solve, and the outputs from the OCR frameworks often include errors. READ MORE

  2. 2. Exploring Machine Learning Solutions in the Context of OCR Post-Processing of Invoices

    University essay from KTH/Skolan för elektroteknik och datavetenskap (EECS)

    Author : Jacob Dwyer; Sara Bertse; [2022]
    Keywords : Machine learning; Optical character recognition; BERT; Error detection; Invoice; Maskininläsning; Optisk teckenläsning; BERT; Feldetektering; Faktura;

    Abstract : Large corporations receive and send large volumes of invoices containing various fields detailing a transaction. Such fields include VAT, due date, total amount, etc. One common way to automatize invoice processing is optical character recognition (OCR). This technology entails automatic reading of characters from scanned images. READ MORE

  3. 3. Form data enriching using a post OCR clustering process : Measuring accuracy of field names and field values clustering

    University essay from Mittuniversitetet/Institutionen för informationssystem och –teknologi

    Author : Adil Aboulkacim; [2022]
    Keywords : Optical Character Recognition; Form Processing; Data enrichment; Optisk teckenläsning; Formulärbearbetning; Databerikning;

    Abstract : Med OCR teknologier kan innehållet av ett formulär läsas in, positionen av varje ord och dess innehåll kan extraheras, dock kan relationen mellan orden ej förstås. Denna rapport siktar på att lösa problemet med att berika data från ett strukturerat formulär utan någon förinställd konfiguration genom användandet utav klustring. READ MORE