Real-time Audio Classification onan Edge Device : Using YAMNet and TensorFlow Lite

University essay from Linnéuniversitetet/Institutionen för datavetenskap och medieteknik (DM)

Abstract: Edge computing is the idea of moving computations away from the cloud andinstead perform them at the edge of the network. The benefits of edge computing arereduced latency, increased integrity, and less strain on networks. Edge AI is the practiceof deploying machine learning algorithms to perform computations on the edge.In this project, a pre-trained model YAMNet is retrained and used to perform audioclassification in real-time to detect gunshots, glass shattering, and speech. The modelis deployed onto the edge device both as a full TensorFlow model and as TensorFlowLite models. Comparing results of accuracy, inference time, and memory allocationfor full TensorFlow and TensorFlow Lite models with and without optimization. Resultsfrom this research were that it was a valid option to use both TensorFlow andTensorFlow Lite but there was a lot of performance to gain by using TensorFlow Litewith little downside.

  AT THIS PAGE YOU CAN DOWNLOAD THE WHOLE ESSAY. (follow the link to the next page)