Essays about: "word clustering"

Showing result 1 - 5 of 25 essays containing the words word clustering.

  1. 1. Form data enriching using a post OCR clustering process : Measuring accuracy of field names and field values clustering

    University essay from Mittuniversitetet/Institutionen för informationssystem och –teknologi

    Author : Adil Aboulkacim; [2022]
    Keywords : Optical Character Recognition; Form Processing; Data enrichment; Optisk teckenläsning; Formulärbearbetning; Databerikning;

    Abstract : Med OCR teknologier kan innehållet av ett formulär läsas in, positionen av varje ord och dess innehåll kan extraheras, dock kan relationen mellan orden ej förstås. Denna rapport siktar på att lösa problemet med att berika data från ett strukturerat formulär utan någon förinställd konfiguration genom användandet utav klustring. READ MORE

  2. 2. Discover patterns within train log data using unsupervised learning and network analysis

    University essay from KTH/Skolan för elektroteknik och datavetenskap (EECS)

    Author : Zehua Guo; [2022]
    Keywords : Log analysis; Natural language processing; Unsupervised learning; Clustering; Network analysis; Logganalys; Bearbetning av naturligt språk; Oövervakat lärande; Clustering; Nätverksanalys;

    Abstract : With the development of information technology in recent years, log analysis has gradually become a hot research topic. However, manual log analysis requires specialized knowledge and is a time-consuming task. Therefore, more and more researchers are searching for ways to automate log analysis. READ MORE

  3. 3. Cluster selection for Clustered Federated Learning using Min-wise Independent Permutations and Word Embeddings

    University essay from KTH/Skolan för elektroteknik och datavetenskap (EECS)

    Author : Pulasthi Raveen Bandara Harasgama; [2022]
    Keywords : Federated learning; Distributed machine learning; Clustering; Word Embeddings; Federerad inlärning; Distribuerad maskininlärning; Klustring; Ordinbäddningar;

    Abstract : Federated learning is a widely established modern machine learning methodology where training is done directly on the client device with local client data and the local training results are shared to compute a global model. Federated learning emerged as a result of data ownership and the privacy concerns of traditional machine learning methodologies where data is collected and trained at a central location. READ MORE

  4. 4. Semantic Topic Modeling and Trend Analysis

    University essay from Linköpings universitet/Statistik och maskininlärning

    Author : Jasleen Kaur Mann; [2021]
    Keywords : NLP; unsupervised topic modelling; trend analysis; LDA; BERT; Sentence-BERT; TF-IDF; transformer based language models; document clustering;

    Abstract : This thesis focuses on finding an end-to-end unsupervised solution to solve a two-step problem of extracting semantically meaningful topics and trend analysis of these topics from a large temporal text corpus. To achieve this, the focus is on using the latest develop- ments in Natural Language Processing (NLP) related to pre-trained language models like Google’s Bidirectional Encoder Representations for Transformers (BERT) and other BERT based models. READ MORE

  5. 5. Clustering and Summarization of Chat Dialogues : To understand a company’s customer base

    University essay from Linköpings universitet/Artificiell intelligens och integrerade datorsystem

    Author : Oskar Hidén; David Björelind; [2021]
    Keywords : Machine Learning; NLP; Text Representations; Clustering; Extractive summarization; TFIDF; S-BERT; FastText; K-means; DBSCAN; HDBSCAN; LSA; TextRank; Word Mover s Distance WMD ;

    Abstract : The Customer Success department at Visma handles about 200 000 customer chats each year, the chat dialogues are stored and contain both questions and answers. In order to get an idea of what customers ask about, the Customer Success department has to read a random sample of the chat dialogues manually. READ MORE