Essays about: "CUDA."

Showing result 1 - 5 of 118 essays containing the word CUDA..

  1. 1. ML implementation for analyzing and estimating product prices

    University essay from Karlstads universitet/Institutionen för matematik och datavetenskap (from 2013)

    Author : Abel Getachew Kenea; Gabriel Fagerslett; [2024]
    Keywords : Machine Learning; ML; Regression; Deep Learning; Artificial Neural Network; ANN; TensorFlow; ScikitLearn; CUDA; cuDNN; Estimation; Prediction; AI; Artificial Intelligence; Price Tracking; Price Logging; Price Estimation; Supervised Learning; Random Forest; Decision Trees; Batch Learning; Hyperparameter Tuning; Linear Regression; Multiple Linear Regression; Maskininlärning; Djup lärning; Artificiellt Neuralt Nätverk; Regression; TensorFlow; SciktLearn; ML; ANN; Estimation; Uppskattning; CUDA; cuDNN; AI; Artificiell Intelligens; pris loggning; pris estimation; prisspårning; Batchinlärning; Hyperparameterjustering; Linjär Regression; Multipel Linjär Regression; Supervised Learning; Random Forest; Decision Trees;

    Abstract : Efficient price management is crucial for companies with many different products to keep track of, leading to the common practice of price logging. Today, these prices are often adjusted manually, but setting prices manually can be labor-intensive and prone to human error. READ MORE

  2. 2. A Conjugate Residual Solver with Kernel Fusion for massive MIMO Detection

    University essay from Högskolan i Halmstad/Centrum för forskning om tillämpade intelligenta system (CAISR)

    Author : Ioannis Broumas; [2023]
    Keywords : MIMO; massive MIMO; GPU; CUDA; Software Defined Radio; SDR; MMSE; ZF; zero-forcing; parallel detection; iterative methods; conjugate residual; parallel computing; kernel fusion;

    Abstract : This thesis presents a comparison of a GPU implementation of the Conjugate Residual method as a sequence of generic library kernels against implementations ofthe method with custom kernels to expose the performance gains of a keyoptimization strategy, kernel fusion, for memory-bound operations which is to makeefficient reuse of the processed data. For massive MIMO the iterative solver is to be employed at the linear detection stageto overcome the computational bottleneck of the matrix inversion required in theequalization process, which is 𝒪(𝑛3) for direct solvers. READ MORE

  3. 3. Evaluating the performance of CUDA Graphs in common GPGPU programming patterns

    University essay from KTH/Skolan för elektroteknik och datavetenskap (EECS)

    Author : Mert Demirsü; Axel Lervik; [2023]
    Keywords : ;

    Abstract : CUDA Graphs is a relatively new and unexplored feature of CUDA that presents an alternative for submitting work to the GPU. This report aims to evaluate the performance benefits of CUDA Graphs. The means to this end was a benchmark suite containing CUDA applications that do not utilise CUDA Graphs. READ MORE

  4. 4. Performance Benefits of Extending JavaScript Through WASM and CUDA : A Comparative Performance Investigation

    University essay from KTH/Skolan för elektroteknik och datavetenskap (EECS)

    Author : Hampus Hallkvist; Edvin Nordqvist; [2023]
    Keywords : ;

    Abstract : JavaScript is today the largest language based on usage with regard to web development. There is, therefore, a great incentive to make the language perform at its best and allow applications to take advantage of this. READ MORE

  5. 5. Design and evaluation of a plain MPI-based cluster execution backend for the SkePU 3 skeleton programming framework

    University essay from Linköpings universitet/Programvara och system

    Author : Alexander Zeijlon; [2023]
    Keywords : SkePU; skeleton programming; algorithmic skeletons; HPC; cluster; parallel programming; MPI; OpenMP; CUDA; Hybrid; NUMA;

    Abstract : SkePU 3 is a framework for parallel program execution that uses higher order functions called skeletons, which provide a layer of abstraction between user code and the parallel implementation it provides through its backends. The backend that enables SkePU to run on an HPC cluster has a slowdown of a factor two. READ MORE