Essays about: "cross-lingual word embeddings"

Found 5 essays containing the words cross-lingual word embeddings.

  1. 1. Cross-Lingual and Genre-Supervised Parsing and Tagging for Low-Resource Spoken Data

    University essay from Uppsala universitet/Institutionen för lingvistik och filologi

    Author : Iliana Fosteri; [2023]
    Keywords : dependency parsing; part-of-speech tagging; low-resource languages; transcribed speech; large language models; cross-lingual learning; transfer learning; multi-task learning; Universal Dependencies;

    Abstract : Dealing with low-resource languages is a challenging task, because of the absence of sufficient data to train machine-learning models to make predictions on these languages. One way to deal with this problem is to use data from higher-resource languages, which enables the transfer of learning from these languages to the low-resource target ones. READ MORE

  2. 2. Zero-shot cross-lingual transfer learning for sentiment analysis on Swedish chat conversations

    University essay from Uppsala universitet/Institutionen för informationsteknologi

    Author : Siri Ann Göhl; [2022]
    Keywords : ;

    Abstract : As the field of machine learning grows, so do the publicly available datasets. However, in the field of natural language processing, datasets within specific languages and tasks can be scarce. READ MORE

  3. 3. Cross-lingual Word Embeddings Beyond Zero-shot Machine Translation

    University essay from Uppsala universitet/Institutionen för lingvistik och filologi

    Author : Chen Shifei; [2020]
    Keywords : word embeddings; cross-lingual word embeddings; machine translation; multilingual machine translation; neural machine translation; zero-shot; zero-shot machine translation;

    Abstract : Zero-shot translation is a transfer learning setup that refers to the ability of neural machine translation to generalize translation information into unseen language pairs. It provides an appealing solution to the lack of available materials for low-resource languages by transferring knowledge from high-resource languages. READ MORE

  4. 4. Exploring Cross-lingual Sublanguage Classification with Multi-lingual Word Embeddings

    University essay from Linköpings universitet/Statistik och maskininlärning

    Author : Min-Chun Shih; [2020]
    Keywords : ;

    Abstract : Cross-lingual text classification is an important task due to the globalization and the increased availability of multilingual data. This thesis explores the method of implementing cross-lingual classification on Swedish and English medical corpora. READ MORE

  5. 5. Low Supervision, Low Corpus size, Low Similarity! Challenges in cross-lingual alignment of word embeddings : An exploration of the limitations of cross-lingual word embedding alignment in truly low resource scenarios

    University essay from Uppsala universitet/Institutionen för lingvistik och filologi

    Author : Andrew Dyer; [2019]
    Keywords : word embeddings; cross-lingual; multilingual; low-resource; corpus size; Vecmap; FastText; alignment; orthogonal; eigenvalues; Laplacian; isospectral; isomorphic; bilingual lexicon induction;

    Abstract : Cross-lingual word embeddings are an increasingly important reseource in cross-lingual methods for NLP, particularly for their role in transfer learning and unsupervised machine translation, purportedly opening up the opportunity for NLP applications for low-resource languages.  However, most research in this area implicitly expects the availablility of vast monolingual corpora for training embeddings, a scenario which is not realistic for many of the world's languages. READ MORE