Speech

Data Driven Speech and Voice

Veronika Kolesnikova on Making your applications interactive with Speech Services

Here’s another bonus episode that BAILey has put together. Who’s BAILey? Glad you asked. In the intro, she has a thing or two to say.In this session from the Azure Global Data Fest, Veronika Kolesnikova tell us how make your applications interactive with Speech Services. Original YouTube video: https://www.youtube.com/watch?v=b4RSZ7aIKgE Press the play button below to […]

Read More
AI Speech and Voice

Creating an 86,000 Hour Speech Dataset with Apache Spark and TPUs

As part of its machine learning benchmarking efforts, MLCommons has built an 86,000 hour open supervised speech recognition dataset with a commercial-use license known as The People’s Speech, incorporating subtitled videos and audio in the public domain scraped from the Internet.

Read More
Research Speech and Voice

Sound Capture and Speech Enhancement for Communication and Distant Speech Recognition

Microsoft Research discusses the general architecture of speech enhancement pipelines for the needs of hands-free telecommunication and distant speech recognition. The talk will discuss both classical approaches using statistical signal processing and deep learning using neural networks. It will be illustrated with real-life examples from the speech enhancement audio pipelines in Kinect, HoloLens, and Teams.

Read More
AI

End-to-End Adversarial Text-to-Speech

Text-to-speech engines are usually multi-stage pipelines that transform the signal into many intermediate representations and require supervision at each step. When trying to train TTS end-to-end, the alignment problem arises: Which text corresponds to which piece of sound? This paper uses an alignment module to tackle this problem and produces astonishingly good sound. Paper: https://arxiv.org/abs/2006.03575 […]

Read More
AI Speech and Voice

Demoing Custom Speech and Language Pre-built AI Models

Noelle shares this demo from the VOICE Summit showing off Custom Speech and Custom Language Pre-built AI Models

Read More
AI

Bring the Power of Speech Recognition and Speech Synthesis to Your Apps with Microsoft Speech APIs

Watch Panos Periorelles, PM on Cognitive Services team, to learn about the latest advancements in using speech recognition and speech synthesis including how to create your own custom model.

Read More