Speech Recognition

AI Speech and Voice

Creating an 86,000 Hour Speech Dataset with Apache Spark and TPUs

As part of its machine learning benchmarking efforts, MLCommons has built an 86,000 hour open supervised speech recognition dataset with a commercial-use license known as The People’s Speech, incorporating subtitled videos and audio in the public domain scraped from the Internet.

Read More
Research Speech and Voice

Sound Capture and Speech Enhancement for Communication and Distant Speech Recognition

Microsoft Research discusses the general architecture of speech enhancement pipelines for the needs of hands-free telecommunication and distant speech recognition. The talk will discuss both classical approaches using statistical signal processing and deep learning using neural networks. It will be illustrated with real-life examples from the speech enhancement audio pipelines in Kinect, HoloLens, and Teams.

Read More
AI Alexa

Designing for Speech

Here’s a great talk from Build 2019 about the importance of design in creating Voice and chat virtual assistants. Designing a natural language interface can be difficult, is the interface supposed to be able to interpret every single nuance of speech? and how about slang? Or should we aim more towards forced language and make […]

Read More
AI Python Speech and Voice

Speech Recognition and Speech to Text Translation in Python

This Edureka live stream on ‘Speech Recognition in Python’ covers the concepts of speech recognition module in python with a program using speech recognition to translate speech into text. Following are some of the topics discussed: How Speech Recognition Works? How To Install SpeechRecognition In Python? Working With Microphones How To Install Pyaudio In Python?

Read More
AI Natural Language Processing

What’s New with Speech Services

“This week in Cognitive” is focused on providing developers with a weekly look at what’s new and exciting in the area of Cognitive Services and Pre-Built AI. This week Noelle LaCharite takes a look at the newly Generally Available service called Unified Speech as well as some preview services in speech that are fun to […]

Read More
Azure Natural Language Processing Uncategorized

FiveThings About NLP

With devices like Siri, Cortana, Amazon Alexa and Google Home, natural language processing is more relevant than ever. In this episode of Five Things, John and Brian talk about what it takes to get started with Natural Language Processing (NLP). What it is and can be done with it?

Read More