PyData Global 2022

Implementation and analysis of deep learning models for codeswitched speech classification
12-03, 10:00–10:30 (UTC), Talk Track I

Automatic Speech recognition (ASR) is used in many devices to identify Bilingual speech data. Bilingual language or in more scientific terms a code switched language is one or more languages being mixed in a speech utterance. In this presentation, learn about different deep learning techniques that can be used for the classification of such speech utterances. If you are a beginner in this field and don't know where to start, join me to explore this use case and learn something new!


Motivation: Code-Switching occurs when a speaker alternates between two or more languages, or language varieties, in the context of a single conversation or situation. In the Automatic Speech Recognition(ASR) which is used in many virtual assistants like Alexa & Siri, Code-switching is an important challenge due to globalization. Recent research in multilingual ASR shows potential improvement over monolingual systems. Though all of this looks good in the news feed and tech newsletter it is important to dive deep and try to understand how all of this happens on a ground level, by implementing such use cases in the basic stages through personal projects.

Problem Statement:
Speech Recognition (ASR) is widely used in mobile and personal devices. In countries like India, the data content from a provider can be in many languages(Hindi, Tamil, Gujarati, etc). Indian speakers tend to code-switch(CS) (change a language) while speaking most of the time. The goal here is to discuss the two deep learning techniques using Convolutional Neural Network and Recurrent Neural Network to classify between English, Hindi, Code switched speech at utterance level.

Results/Conclusion:
Attendees will gain an understanding of the two main deep learning approaches which were used for code-switched speech classification. This session will help them first understand the problem at hand and then dive deep with solutions thus gaining wider visibility over the topic.


Prior Knowledge Expected

Previous knowledge expected

I am Yashasvi Misra, recent computer science engineering graduate currently working as a Associate Data Scientist - 1 at ABInBev India. Enthusiastic about exploring & implementing new tech stack with good background in working on research projects, being a recipient of Excellence award from Samsung Research India. Extremely passionate about engaging in open source communities and contributing to diversity & inclusion.