Essays about: "tf-idf"

Showing result 21 - 25 of 74 essays containing the word tf-idf.

  1. 21. Evaluating semantic similarity using sentence embeddings

    University essay from KTH/Skolan för elektroteknik och datavetenskap (EECS)

    Author : Jacob Malmberg; [2021]
    Keywords : ;

    Abstract : Semantic similarity search is the task of searching for documents or sentences which contain semantically similar content to a user-submitted search term. This task is often carried out, for instance when searching for information on the internet. READ MORE

  2. 22. Semantic Topic Modeling and Trend Analysis

    University essay from Linköpings universitet/Statistik och maskininlärning

    Author : Jasleen Kaur Mann; [2021]
    Keywords : NLP; unsupervised topic modelling; trend analysis; LDA; BERT; Sentence-BERT; TF-IDF; transformer based language models; document clustering;

    Abstract : This thesis focuses on finding an end-to-end unsupervised solution to solve a two-step problem of extracting semantically meaningful topics and trend analysis of these topics from a large temporal text corpus. To achieve this, the focus is on using the latest develop- ments in Natural Language Processing (NLP) related to pre-trained language models like Google’s Bidirectional Encoder Representations for Transformers (BERT) and other BERT based models. READ MORE

  3. 23. Categorization of Swedish e-mails using Supervised Machine Learning

    University essay from KTH/Hälsoinformatik och logistik

    Author : Anna Mann; Olivia Höft; [2021]
    Keywords : Classification; categorization; e-mails; preprocessing; TF-IDF; machine learning; supervised learning; Naïve Bayes; Support Vector Machine; Decision Tree; Klassificering; kategorisering; e-postmeddelanden; förbehandling av data; TF-IDF; maskininlärning; övervakad inlärning; Naïve Bayes; Support Vector Machine; Decision Tree;

    Abstract : Society today is becoming more digitalized, and a common way of communication is to send e-mails. Currently, the company Auranest has a filtering method for categorizing e-mails, but the method is a few years old. The filter provides a classification of valuable e-mails for jobseekers, where employers can make contact. READ MORE

  4. 24. Performance comparison of different machine learningmodels in detecting fake news

    University essay from Högskolan Dalarna/Institutionen för information och teknik

    Author : Zhibin Wan; Huatai Xu; [2021]
    Keywords : Text classification; Fake news detection; Machine learning; Feature ex-traction;

    Abstract : The phenomenon of fake news has a significant impact on our social life, especially in the political world. Fake news detection is an emerging area of research. The sharing of infor-mation on the Web, primarily through Web-based online media, is increasing. The ability to identify, evaluate, and process this information is of great importance. READ MORE

  5. 25. Automated error matching system using machine learning and data clustering : Evaluating unsupervised learning methods for categorizing error types, capturing bugs, and detecting outliers.

    University essay from Linköpings universitet/Programvara och system

    Author : Jonatan Bjurenfalk; August Johnson; [2021]
    Keywords : Unsupervised learning; machine learning; clustering; DBSCAN; HDBSCAN; X-Means; outlier detection; error log clustering;

    Abstract : For large and complex software systems, it is a time-consuming process to manually inspect error logs produced from the test suites of such systems. Whether it is for identifyingabnormal faults, or finding bugs; it is a process that limits development progress, and requires experience. READ MORE