Essays about: "parallel computing"

Showing result 16 - 20 of 191 essays containing the words parallel computing.

  1. 16. Register Caching for Energy Efficient GPGPU Tensor Core Computing

    University essay from KTH/Skolan för elektroteknik och datavetenskap (EECS)

    Author : Qiran Qian; [2023]
    Keywords : Computer Architecture; GPGPU; Tensor Core; GEMM; Energy Efficiency; Register File; Cache; Instruction Scheduling; Datorarkitektur; GPGPU; Tensor Core; GEMM; energieffektivitet; registerfil; cache; instruktionsschemaläggning;

    Abstract : The General-Purpose GPU (GPGPU) has emerged as the predominant computing device for extensive parallel workloads in the fields of Artificial Intelligence (AI) and Scientific Computing, primarily owing to its adoption of the Single Instruction Multiple Thread architecture, which not only provides a wealth of thread context but also effectively hide the latencies exposed in the single threads executions. As computational demands have evolved, modern GPGPUs have incorporated specialized matrix engines, e. READ MORE

  2. 17. Implementation of Bolt Detection and Visual-Inertial Localization Algorithm for Tightening Tool on SoC FPGA

    University essay from KTH/Skolan för elektroteknik och datavetenskap (EECS)

    Author : Muhammad Ihsan Al Hafiz; [2023]
    Keywords : Bolt detection; Visual-Inertial localization; System-on-Chip SoC ; Field-Programmable Gate Array FPGA ; Machine learning; Perspective-n-Points; Error-State Extended Kalman Filter ESEKF ; High-Level Synthesis HLS ; YOLO; Tightening tool; Bultdetektering; visuell-tröghetslokalisering; System-on-Chip SoC ; Field-Programmable Gate Array FPGA ; Machine Learning; Perspective-n-Points; Error-State Extended Kalman Filter ESEKF ; High-Level Synthesis HLS ; YOLO; åtdragningsverktyg;

    Abstract : With the emergence of Industry 4.0, there is a pronounced emphasis on the necessity for enhanced flexibility in assembly processes. In the domain of bolt-tightening, this transition is evident. Tools are now required to navigate a variety of bolts and unpredictable tightening methodologies. READ MORE

  3. 18. Towards a tunable, wide-band acoustic transducer operating in the quantum regime.

    University essay from KTH/Tillämpad fysik

    Author : Abel Hugot; [2022]
    Keywords : Hybrid quantum systems; Acoustics; Impedance matching; SQUIDs; Hybrida kvantsystem; Akustik; Impedansanpassning; SQUIDs.;

    Abstract : In the past decade we have seen fast development of new quantum technologies that promise to revolutionise communications and computing. Many different routes are explored to physically implement such quantum technologies. Among others, we can mention superconducting circuits, spin-based devices and photonic devices. READ MORE

  4. 19. Mapping DNNs onto the NoC Platform

    University essay from KTH/Skolan för elektroteknik och datavetenskap (EECS)

    Author : Hanbo Xu; [2022]
    Keywords : ;

    Abstract : This thesis uses an existing NoC simulation platform to construct a Network on Chip-based many-core system. The network is an 8_8 mesh topology. This thesis chooses LeNet5, ResNet, VGGNet, and AlexNet as the computing load, and tries to obtain a deep neural network mapping algorithm based on a NoC design method that can be widely used. READ MORE

  5. 20. Predictability of Optimal Core Distribution Based on Weight and Speedup

    University essay from Umeå universitet/Institutionen för datavetenskap

    Author : Rasmus Eriksson; [2022]
    Keywords : Resource distribution; Core distribution; Shared memory; High performance computing; DGEMM; OpenMP;

    Abstract : Efficient use of hardware resources is a vital part of getting good results within high performance computing. This thesis explores the predictability of optimal CPU-core distribution between two tasks running in parallel on a shared-memory machine, with the intent to reach the shortest total runtime possible. READ MORE