Essays about: "algorithm kernels"

Showing result 1 - 5 of 18 essays containing the words algorithm kernels.

  1. 1. A Conjugate Residual Solver with Kernel Fusion for massive MIMO Detection

    University essay from Högskolan i Halmstad/Centrum för forskning om tillämpade intelligenta system (CAISR)

    Author : Ioannis Broumas; [2023]
    Keywords : MIMO; massive MIMO; GPU; CUDA; Software Defined Radio; SDR; MMSE; ZF; zero-forcing; parallel detection; iterative methods; conjugate residual; parallel computing; kernel fusion;

    Abstract : This thesis presents a comparison of a GPU implementation of the Conjugate Residual method as a sequence of generic library kernels against implementations ofthe method with custom kernels to expose the performance gains of a keyoptimization strategy, kernel fusion, for memory-bound operations which is to makeefficient reuse of the processed data. For massive MIMO the iterative solver is to be employed at the linear detection stageto overcome the computational bottleneck of the matrix inversion required in theequalization process, which is 𝒪(𝑛3) for direct solvers. READ MORE

  2. 2. Efficient multiple hypothesis tracking using a purely functional array language

    University essay from Linköpings universitet/Datorseende

    Author : Marcus Nolkrantz; [2022]
    Keywords : multiple object tracking; multiple hypothesis tracking; tracking-by-detection; GPGPU; GPU code abstraction; functional programming; Futhark;

    Abstract : An autonomous vehicle is a complex system that requires a good perception of the surrounding environment to operate safely. One part of that is multiple object tracking, which is an essential component in camera-based perception whose responsibility is to estimate object motion from a sequence of images. READ MORE

  3. 3. AXI-PACK : Near-memory Bus Packing for Bandwidth-Efficient Irregular Workloads

    University essay from KTH/Skolan för elektroteknik och datavetenskap (EECS)

    Author : Chi Zhang; [2022]
    Keywords : General propose processor; on-chip bus protocol; irregular memory access; ASIC digital circuit design.; Generellt förslag på processor; on-chip-bussprotokoll; oregelbunden minnesåtkomst; digital ASIC-kretsdesign.;

    Abstract : General propose processor (GPP) are demanded high performance in dataintensive applications, such as deep learning, high performance computation (HPC), where algorithm kernels like GEMM (general matrix-matrix multiply) and SPMV (sparse matrix-vector multiply) kernels are intensively used. The performance of these data-intensive applications are bounded with memory bandwidth, which is limited by computing & memory access coupling and memory wall effect. READ MORE

  4. 4. Real-time Scheduling in Datacentre Clusters

    University essay from Lunds universitet/Institutionen för elektro- och informationsteknik

    Author : Fabian Frankel; Sepehr Tayari; [2021]
    Keywords : Kubernetes; Realtime; Realtime Scheduling; Linux; Industry 4.0; Docker; Cloud; Technology and Engineering;

    Abstract : Industry 4.0 can be described as the next generation-factories that is characterised by putting a high demand for automation and flexible production lines. The proposed way to achieve this goal is through a large number of Industrial IoT devices(IIoT) in the factory, some having high availability- and low-latency requirements. READ MORE

  5. 5. Automatic Pronoun Resolution for Swedish

    University essay from KTH/Skolan för elektroteknik och datavetenskap (EECS)

    Author : Camilla Ahlenius; [2020]
    Keywords : Pronoun resolution; Mitkov’s algorithm; Support Vector Machine; Supervised learning; SVM-Light-TK; Tree kernels; Dependency trees; Word embeddings; Pronomenbestämning; Mitkovs algoritm; Stödvektormaskin; Övervakad inlärning; SVM-Light-TK; Trädkärnor; Dependensträd; Ordvektorer;

    Abstract : This report describes a quantitative analysis performed to compare two different methods on the task of pronoun resolution for Swedish. The first method, an implementation of Mitkov’s algorithm, is a heuristic-based method — meaning that the resolution is determined by a number of manually engineered rules regarding both syntactic and semantic information. READ MORE