Essays about: "LaBSE"
Found 4 essays containing the word LaBSE.
-
1. Improving BERTScore for Machine Translation Evaluation Through Contrastive Learning
University essay from Uppsala universitet/Institutionen för lingvistik och filologiAbstract : Since the advent of automatic evaluation, tasks within Natural Language Processing (NLP), including Machine Translation, have been able to better utilize both time and labor resources. Later, multilingual pre-trained models (MLMs)have uplifted many languages’ capacity to participate in NLP research. READ MORE
-
2. Comparing Text Classification Libraries in Scala and Python : A comparison of precision and recall
University essay from KTH/Skolan för elektroteknik och datavetenskap (EECS)Abstract : In today’s internet era, more text than ever is being uploaded online. The text comes in many forms, such as social media posts, business reviews, and many more. For various reasons, there is an interest in analyzing the uploaded text. For instance, an airline business could ask their customers to review the service they have received. READ MORE
-
3. QPLaBSE: Quantized and Pruned Language-Agnostic BERT Sentence Embedding Model : Production-ready compression for multilingual transformers
University essay from KTH/Skolan för elektroteknik och datavetenskap (EECS)Abstract : Transformer models perform well on Natural Language Processing and Natural Language Understanding tasks. Training and fine-tuning of these models consume a large amount of data and computing resources. Fast inference also requires high-end hardware for user-facing products. READ MORE
-
4. DistillaBSE: Task-agnostic distillation of multilingual sentence embeddings : Exploring deep self-attention distillation with switch transformers
University essay from KTH/Skolan för elektroteknik och datavetenskap (EECS)Abstract : The recent development of massive multilingual transformer networks has resulted in drastic improvements in model performance. These models, however, are so large they suffer from large inference latency and consume vast computing resources. Such features hinder widespread adoption of the models in industry and some academic settings. READ MORE