Essays about: "N-gram"

Showing result 1 - 5 of 27 essays containing the word N-gram.

  1. 1. IŻ SWÓJ JĘZYK MAJĄ! An exploration of the computational methods for identifying language variation in Polish

    University essay from Göteborgs universitet / Institutionen för filosofi, lingvistik och vetenskapsteori

    Author : Maria Irena Szawerna; [2023-06-19]
    Keywords : language variation; Polish; diachronic linguistics; part-of-speech tagging; lemmatization; corpus linguistics;

    Abstract : Computational approaches to language variation continue to contribute in a relevant way to various fields, including Natural Language Processing (NLP) and linguistics. Being able to accommodate variation within natural language increases the robustness of NLP models and their usefulness in real-life applications; simultaneously, detecting and describing variation and trends that govern it is one of the main goals of sociolinguistics and historical linguistics, meaning that some of the advances in NLP can contribute to these fields as well. READ MORE

  2. 2. Models, Keys, and Cryptanalysis: Evaluating historical statistical language models in cryptanalysis of homophonic substitution ciphers

    University essay from Göteborgs universitet/Institutionen för filosofi, lingvistik och vetenskapsteori

    Author : Filip Fornmark; [2023-01-19]
    Keywords : statistical language models; cryptanalysis; historical cryptology; homophonic substitution;

    Abstract : This thesis presents an empirical study connected to historical cryptography and especially within the framework of the research project DECRYPT. One of the research questions in the DECRYPT project relates to the use of language models for automatic cryptanalysis. READ MORE

  3. 3. Domain Adaptation with N-gram Language Models for Swedish Automatic Speech Recognition : Using text data augmentation to create domain-specific n-gram models for a Swedish open-source wav2vec 2.0 model

    University essay from KTH/Skolan för elektroteknik och datavetenskap (EECS)

    Author : Viktor Enzell; [2022]
    Keywords : Automatic Speech Recognition; Domain Adaptation; Language Models; Ngram Models; Wav2vec2; Taligenkänning; Domänanpassning; Språkmodeller; N-gramModeller; Wav2vec2;

    Abstract : Automatic Speech Recognition (ASR) enables a wide variety of practical applications. However, many applications have their own domain-specific words, creating a gap between training and test data when used in practice. READ MORE

  4. 4. Comparing state-of-the-art machine learning malware detection methods on Windows

    University essay from

    Author : Filip Ahlgren; [2021]
    Keywords : Malware; Machine Learning; Static Analysis;

    Abstract : Background. Malware has been a major issue for years and old signature scanning methods for detecting malware are outdated and can be bypassed by most advanced malware. With the help of machine learning, patterns of malware behavior and structure can be learned to detect the more advanced threats that are active today. Objectives. READ MORE

  5. 5. Transformer decoder as a method to predict diagnostic trouble codes in heavy commercial vehicles

    University essay from KTH/Skolan för elektroteknik och datavetenskap (EECS)

    Author : Haris Poljo; [2021]
    Keywords : Diagnostic Trouble Codes; Deep Learning; Machine Learning; Neural Networks; Transformer Decoder; Felkoder; Djupinlärning; Maskininlärning; Neurala Nätverk; Transformer Decoder;

    Abstract : Diagnostic trouble codes (DTC) have traditionally been used by mechanics to figure out what is wrong with a vehicle. A vehicle generates a DTC when a specific condition in the vehicle is met. This condition has been defined by an engineer and represents some fault that has happened. READ MORE