Essays about: "fault tolerance"

Showing result 16 - 20 of 96 essays containing the words fault tolerance.

  1. 16. A Study on Fault-tolerance of Deep Neural Networks for Embedded Systems

    University essay from KTH/Skolan för elektroteknik och datavetenskap (EECS)

    Author : Elaheh Malekzadeh; [2021]
    Keywords : Embedded systems; Fault tolerance; Deep Learning; Single event upset; Edge AI.; Inbyggda system; Feltolerans; Djupinlärning; Edge AI.;

    Abstract : Deep learning is replacing many traditional data processing methods in computer vision, speech recognition, natural language processing and many more diverse end applications. Until only a few years ago, using deep learning networks for inference required large amount of computational resources such as memory, processing power and energy. READ MORE

  2. 17. Distributed Robust Learning

    University essay from KTH/Skolan för elektroteknik och datavetenskap (EECS)

    Author : Akhil Yerrapragada; [2021]
    Keywords : Byzantine resilient decentralized training; Gradient aggregation rules; α; f Byzantine resilience; Fault tolerance; Ring all-reduce.; Byzantinsk motståndskraftig decentraliserad träning; Gradientaggregeringsregler; α; f Byzantinsk motståndskraft; Feltolerans; Ring allreducera.;

    Abstract : Accuracy obtained when training deep learning models with large amounts of data is high, however, training a model with such huge amounts of data on a single node is not feasible due to various reasons. For example, it might not be possible to fit the entire data set in the memory of a single node, training times can significantly increase since the dataset is huge. READ MORE

  3. 18. Efficient serverless resource scheduling for distributed deep learning.

    University essay from Umeå universitet/Institutionen för datavetenskap

    Author : Johan Sundkvist; [2021]
    Keywords : Serverless; distributed; deep learning; scheduling; regression;

    Abstract : Stemming from the growth and increased complexity of computer vision, natural language processing, and speech recognition algorithms; the need for scalability and fault tolerance of machine learning systems has risen. In order to comply with these demands many have turned their focus towards implementing machine learning on distributed systems. READ MORE

  4. 19. Validation of theoretical cost model for Power and Reliability : Case study of a reliable Central Direct Memory Access system

    University essay from KTH/Skolan för elektroteknik och datavetenskap (EECS)

    Author : Sonal Shrivastava; [2021]
    Keywords : Single event upsets; Extra-functional properties; System on Chip; Mean Time Between Failure; Power consumption; Enstaka händelse störs; Extra funktionella egenskaper; System på chip; Medeltid mellan misslyckande; Energiförbrukning;

    Abstract : Safety-critical applications employed in automotive, avionics and aerospace domains are placed under strict demands for performance, power efficiency and fault tolerance. Development of system hardware and software satisfying all criteria is challenging and time-consuming. READ MORE

  5. 20. A COMPARISON OF DATA INGESTION PLATFORMS IN REAL-TIME STREAM PROCESSING PIPELINES

    University essay from Mälardalens högskola/Akademin för innovation, design och teknik

    Author : Sebastian Tallberg; [2020]
    Keywords : stream processing; data ingestion; Redis Streams; Apache Kafka; Apache Pulsar; performance benchmark; real-time streaming;

    Abstract : In recent years there has been an increasing demand for real-time streaming applications that handle large volumes of data with low latency. Examples of such applications include real-time monitoring and analytics, electronic trading, advertising, fraud detection, and more. READ MORE