Real-time sound classification of live audio stream using TensorFlow?

MelvinEng · July 26, 2024, 5:10am

Hi there,

I’m building a VR app that involves real-time sound classification of a live audio stream from the microphone using TensorFlow. Basically I’m trying to detect the presence and intensity of a certain sound(in particular a bell-like sound) in the audio stream captured by the microphone, and then use the intensity value subsequently.
Now while there’s already a tutorial on sound classification using TensorFlow, it only works for pre-recorded audio clips and not live audio streams.
So I’m wondering if anybody can give some pointers on doing this with a live audio stream? (pardon my ignorance as I’m still new to TensorFlow

Many thanks!
Melvin Eng.

Arash_Farahani · July 27, 2024, 4:28am

Have you tried what Gemini answers on this? Basically, capture live audio using pyaudio, preprocess it into features like MFCCs, and use a TensorFlow model for real-time predictions. Continuously process the audio data to get sound intensity and integrate this setup into your VR app to trigger events based on the detected intensity.

Topic		Replies	Views
Live Audio recognition General Discussion help_request , tfcore	3	1326	October 22, 2021
Audio Processing and ASR Processing using Tensorflow TensorFlow keras , model	1	403	May 15, 2024
Deep Learning for signal processing problem TensorFlow help_dev	9	3238	January 6, 2023
Simple audio recognition: Recognizing keywords \| TensorFlow Core General Discussion models , help_request , tfcore	8	3691	December 28, 2022
[Voice Recognition] How can I use the model? General Discussion models , help_request	1	663	October 6, 2021

Real-time sound classification of live audio stream using TensorFlow?

Related topics