Essays about: "Textrensning"

Found 1 essay containing the word Textrensning.

  1. 1. Neural Cleaning of Swedish Textual Data : Using BERT-based methods for Token Classification of Running and Non-Running Text

    University essay from KTH/Skolan för elektroteknik och datavetenskap (EECS)

    Author : Andreas Ericsson; [2023]
    Keywords : Natural Language Processing; Text Cleaning; Transformers; BERT; Token Classification; Deep Learning; Språkteknologi; Textrensning; Transformers; BERT; Token-klassificering; Djupinlärning;

    Abstract : Modern natural language processing methods requires big textual datasets to function well. A common method is to scrape the internet to acquire the needed data. This does, however, come with the issue that some of the data may be unwanted – for instance, spam websites. READ MORE