How can I classify the language of voice data?

ruorch · August 30, 2022, 10:49pm

How can I classify the language of voice data? Specifically, I speak English, Japanese, French, Italian, and Russian to the voice data. I want to make a model learn this and create a model that classifies what language the new voice data is speaking. What kind of preprocessing, feature extraction, and model selection should be performed?

Kiran_Sai_Ramineni · October 11, 2024, 4:38am

Hi @ruorch, The use case you are trying to implement will comes under Audio classification task. Please refer to this tutorial to know how to implement a image classification using CNN’s. Thank You.

Topic		Replies	Views
Possibility to differentiate each person voice using machine learning TensorFlow	0	397	December 30, 2022
Code-switch language identification General Discussion help_request , models	1	520	January 11, 2024
Fine-tuning speech to text model General Discussion help_request , datasets	1	1418	November 30, 2021
Natural Language Processing - speech synthesis General Discussion help_request	1	368	August 22, 2024
Not getting desired result with speech-to-text API TensorFlow help_request	0	806	March 25, 2022

How can I classify the language of voice data?

Related topics