Essays about: "parallel corpora"

Showing result 1 - 5 of 13 essays containing the words parallel corpora.

  1. 1. Syntax-based Concept Alignment for Machine Translation

    University essay from Göteborgs universitet/Institutionen för data- och informationsteknik

    Author : Arianna Masciolini; [2023-03-30]
    Keywords : computational linguistic; machine translation; concept alignment; syntax; dependency parsing; Universal Dependencies; Grammatical Framework;

    Abstract : This thesis presents a syntax-based approach to Concept Alignment (CA), the task of finding semantical correspondences between parts of multilingual parallel texts, with a focus on Machine Translation (MT). Two variants of CA are taken into account: Concept Extraction (CE), whose aim is to identify new concepts by means of mere linguistic comparison, and Concept Propagation (CP), which consists in looking for the translation equivalents of a set of known concepts in a new language. READ MORE

  2. 2. Head-to-head Transfer Learning Comparisons made Possible : A Comparative Study of Transfer Learning Methods for Neural Machine Translation of the Baltic Languages

    University essay from Uppsala universitet/Institutionen för lingvistik och filologi

    Author : Mathias Stenlund; [2023]
    Keywords : machine translation; transfer learning; Latvian; Lithuanian; low-resource languages; transformers; parent language; child language; comparative study;

    Abstract : The struggle of training adequate MT models using data-hungry NMT frameworks for low-resource language pairs has created a need to alleviate the scarcity of sufficiently large parallel corpora. Different transfer learning methods have been introduced as possible solutions to this problem, where a new model for a target task is initialized using parameters learned from some other high-resource task. READ MORE

  3. 3. A Hybrid Approach to Hate Speech Detection

    University essay from Umeå universitet/Institutionen för datavetenskap

    Author : Simon Rickardsson; [2023]
    Keywords : ;

    Abstract : An interesting question is to what extent can background knowledge help in the context of text classification. To address this in more detail, can a traditional rulebased classifier help boost the accuracy of learned models? We explore this here for detecting hate speech and offensive language in online text. READ MORE

  4. 4. NLP methods for the automatic generation of exercises for second language learning from parallel corpus data

    University essay from Göteborgs universitet/Institutionen för filosofi, lingvistik och vetenskapsteori

    Author : Arianna Zanetti; [2020-09-25]
    Keywords : ICALL; language learning; parallel corpus; exercise generation;

    Abstract : Intelligent Computer Assisted Language Learning (ICALL), or Intelligent Computer Assisted Language Instruction (ICALI), is a field of research that combines Artificial Intelligence and Computer Assisted Language Learning (CALL) in order to produce tools that can aid second language learners without human intervention. The automatic generation of exercises for language learners from a corpus enables the students to self-pace learning activities and offers a theoretically infinite, un-mediated and un-biased content. READ MORE

  5. 5. Turning Back to Again Using Parallel Texts : Structuring the Semantic Domain of Repetition and Restitution

    University essay from Stockholms universitet/Institutionen för lingvistik

    Author : Althea Löfgren; [2020]
    Keywords : typology; semantics; repetition; restitution; semantic maps; Multi-Dimensional Scaling; parallel texts;

    Abstract : This study investigates expressions akin to ‘again’, which inhabit the semantic domain of repetition and restitution, from a cross-linguistic perspective. Using massively parallel corpora as the primary source of data the aim of this study is to investigate whether the encoding of repetitive and restitutive meaning is a cross-linguistically valid difference and if there are any patterns in the language specific variation of the repetitive and restitutive domain. READ MORE