Essays about: "Text-extraction"

Showing result 1 - 5 of 15 essays containing the word Text-extraction.

  1. 1. Accurately extracting information from a finite set of different report categories and formats

    University essay from KTH/Skolan för elektroteknik och datavetenskap (EECS)

    Author : Jonatan Holmbäck; [2023]
    Keywords : Text Extraction; PDF; Excel; Text Parsing; Data Analysis; Text Extrahering; PDF; Excel; Text Parsing; Data Analys;

    Abstract : POC Sports (hereafter simply POC) is a company that manufactures gear and accessories for winter sports as well as cycling. Their mission is to “Protect lives and reduce the consequences of accidents for athletes and anyone inspired to be one”. READ MORE

  2. 2. Automated Image Pre-Processing for Optimized Text Extraction Using Reinforcement Learning and Genetic Algorithms

    University essay from

    Author : Rahmat Rohoullah; Månsson Joakim; [2023]
    Keywords : BRISK; YOLO; Reinforcement learning; Evolutionary algorithm; OCR; Image pre-processing; Computer vision; BRISK; YOLO; Förstärkningslärning; Evolutionär algorithm; OCR; Bildförbehandling; Datorseende;

    Abstract : This project aims to develop an automated image pre-processing chain to extract valuable information from appliance labels before recycling. The primary goal is to improve optical character recognition accuracy by addressing noise issues using reinforcement learning and an evolutionary algorithm. READ MORE

  3. 3. Generic Data Harvester

    University essay from KTH/Skolan för elektroteknik och datavetenskap (EECS)

    Author : William Asp; Johannes Valck; [2022]
    Keywords : News; Articles; Newspapers; Web crawler; Web site parsing; Optimization; Web robot; Web spider; Web data extraction; HTML; Scrapy; Nyheter; Artiklar; Tidningar; Sökrobot; Analys av hemsida; Optimering; Webbrobot; Webbspindel; Data extrahering hemsidor; HTML; Scrapy;

    Abstract : This report goes through the process of developing a generic article scraper which shall extract relevant information from an arbitrary web article. The extraction is implemented by searching and examining the HTML of the article, by using Python and XPath. READ MORE

  4. 4. Computer Vision for Document Image Analysis and Text Extraction

    University essay from KTH/Skolan för elektroteknik och datavetenskap (EECS)

    Author : Omar Benchekroun; [2022]
    Keywords : Optical Character Recognition; Document Analysis; Text Extraction; Transformers; Convolutional Neural Networks; Optisk teckenigenkänning; dokumentanalys; textutvinning; transformatorer; konvolutionella neurala nätverk;

    Abstract : Automatic document processing has been a subject of interest in the industry for the past few years, especially with the recent technological advances in Machine Learning and Computer Vision. This project investigates in-depth a major component used in Document Image Processing known as Optical Character Recognition (OCR). READ MORE

  5. 5. "For the fifty-eleventh time" : Examining cross-linguistic properties of hyperbolic numerals and quasi-numeral expressions through parallel text extraction

    University essay from Stockholms universitet/Institutionen för lingvistik

    Author : Amanda Kann; [2022]
    Keywords : hyperbole; hyperbolic quantification; numeral typology; parallel texts; quasi-numerals; hyperbol; hyperbolisk kvantifiering; kvasinumeriska uttryck; parallelltexter; räkneordstypologi;

    Abstract : In some languages, vague and exaggerated quantities can be represented using certain conventionalised numeral expressions with cross-linguistically varying values, such as Danish hundredesytten "117". Hyperbolic quantities can also be expressed using other quantifier expressions (such as English zillion) which, while they do not denote a specific numerical value, have both structural and functional similarities with exact numerals. READ MORE