Essays about: "corpus size"

Showing result 1 - 5 of 32 essays containing the words corpus size.

  1. 1. A lightweight deep learning architecture for text embedding : Comparison between the usage of Transformers and Mixers for textual embedding

    University essay from KTH/Skolan för elektroteknik och datavetenskap (EECS)

    Author : Corentin Royer; [2023]
    Keywords : Deep Learning; Entity Retrieval; Mixer; Transformer;

    Abstract : Text embedding is a widely used method for comparing pieces of text together by mapping them to a compact vector space. One such application is deduplication which consists in finding textual records that refer to the same underlying idea in order to merge them or delete one of them. READ MORE

  2. 2. Applicability of GPT models to high-performance compute languages

    University essay from Uppsala universitet/Högenergifysik

    Author : Urlich Icimpaye; [2023]
    Keywords : Data Science; Machine Learning; Artificial Intelligence; Transformers;

    Abstract : This thesis aims to investigate the feasibility of generating code in high-performance computing languages such as C++ with neural networks. This has been investigated by transfer learning publicly available pretrained transformers on C++ code. READ MORE

  3. 3. Analysing CSR reporting over the years, company size, region, and sector through dictionary-based text mining

    University essay from Högskolan Dalarna/Institutionen för information och teknik

    Author : Anuj Singhvi; Dorna Jahangoshay Sarijlou; [2023]
    Keywords : Corporate Social Responsibility; Sustainability Reporting; Natural Language Processing; Text Mining; CSR Dictionaries;

    Abstract : As Corporate Social Responsibility (CSR) reports become more prevalent and systematised, there is a strong need to develop approaches that seek to analyse the contents of these reports. In this thesis, we present two valuable contributions. READ MORE

  4. 4. Exploring Automatic Synonym Generation for Lexical Simplification of Swedish Electronic Health Records

    University essay from Linköpings universitet/Institutionen för hälsa, medicin och vård

    Author : Anna Jänich; [2023]
    Keywords : Lexical Simplification; EHR; Electronic Health Records; Swedish EHRs; patient records; complex medical terminology; synonym generation; synonym replacement; Word2Vec; BERT; NLP; ML;

    Abstract : Electronic health records (EHRs) are used in Sweden's healthcare systems to store patients' medical information. Patients in Sweden have the right to access and read their health records. Unfortunately, the language used in EHRs is very complex and presents a challenge for readers who lack medical knowledge. READ MORE

  5. 5. Grammatical Error Correction for Learners of Swedish as a Second Language

    University essay from Uppsala universitet/Institutionen för lingvistik och filologi

    Author : Martina Nyberg; [2022]
    Keywords : grammatical error correction; swedish; machine translation; language modeling; machine learning;

    Abstract : Grammatical Error Correction refers to the task of automatically correcting errors in written text, typically with respect to texts written by learners of a second language. The work in this thesis implements and evaluates two methods to Grammatical Error Correction for Swedish. READ MORE