Essays about: "parallel corpora"
Showing result 1 - 5 of 13 essays containing the words parallel corpora.
-
1. Syntax-based Concept Alignment for Machine Translation
University essay from Göteborgs universitet/Institutionen för data- och informationsteknikAbstract : This thesis presents a syntax-based approach to Concept Alignment (CA), the task of finding semantical correspondences between parts of multilingual parallel texts, with a focus on Machine Translation (MT). Two variants of CA are taken into account: Concept Extraction (CE), whose aim is to identify new concepts by means of mere linguistic comparison, and Concept Propagation (CP), which consists in looking for the translation equivalents of a set of known concepts in a new language. READ MORE
-
2. Head-to-head Transfer Learning Comparisons made Possible : A Comparative Study of Transfer Learning Methods for Neural Machine Translation of the Baltic Languages
University essay from Uppsala universitet/Institutionen för lingvistik och filologiAbstract : The struggle of training adequate MT models using data-hungry NMT frameworks for low-resource language pairs has created a need to alleviate the scarcity of sufficiently large parallel corpora. Different transfer learning methods have been introduced as possible solutions to this problem, where a new model for a target task is initialized using parameters learned from some other high-resource task. READ MORE
-
3. A Hybrid Approach to Hate Speech Detection
University essay from Umeå universitet/Institutionen för datavetenskapAbstract : An interesting question is to what extent can background knowledge help in the context of text classification. To address this in more detail, can a traditional rulebased classifier help boost the accuracy of learned models? We explore this here for detecting hate speech and offensive language in online text. READ MORE
-
4. NLP methods for the automatic generation of exercises for second language learning from parallel corpus data
University essay from Göteborgs universitet/Institutionen för filosofi, lingvistik och vetenskapsteoriAbstract : Intelligent Computer Assisted Language Learning (ICALL), or Intelligent Computer Assisted Language Instruction (ICALI), is a field of research that combines Artificial Intelligence and Computer Assisted Language Learning (CALL) in order to produce tools that can aid second language learners without human intervention. The automatic generation of exercises for language learners from a corpus enables the students to self-pace learning activities and offers a theoretically infinite, un-mediated and un-biased content. READ MORE
-
5. Turning Back to Again Using Parallel Texts : Structuring the Semantic Domain of Repetition and Restitution
University essay from Stockholms universitet/Institutionen för lingvistikAbstract : This study investigates expressions akin to ‘again’, which inhabit the semantic domain of repetition and restitution, from a cross-linguistic perspective. Using massively parallel corpora as the primary source of data the aim of this study is to investigate whether the encoding of repetitive and restitutive meaning is a cross-linguistically valid difference and if there are any patterns in the language specific variation of the repetitive and restitutive domain. READ MORE