Speech and Voice

Speech and Voice

Cool things to do with Voice Interfaces (and lots of things you shouldn’t)

Many voice interface applications (Alexa, Siri, Google Home) now have simple javascript APIs, allowing you to develop your own voice-activated application. But voice interface design is a skill in itself, and only suitable for certain scenarios. Let’s have a look at what we can do with voice apps, and see what works well and why. […]

Read More
AI Python Speech and Voice

Speech Recognition in Python Tutorial – Full Course for Beginners

Learn how to implement speech recognition in Python by building five projects. You will learn how to use the AssemblyAI API for speech recognition. Code: https://github.com/AssemblyAI-Examples/python-speech-recognition-course

Read More
Speech and Voice

Noelle Silver at #Voice21

Here’s a snippet from Noelle Silver’s recent talk at Voice 21 in Arlington, VA.

Read More
AI Speech and Voice

Using Postman to Interact with Azure Speech API: Convert audio to text

This video will walk you through the step-by-step process of how you can make a call to Azure Speech API, which is part of Azure Cognitive Services. It includes resource deployment in Azure, access token generation and then making a call to REST API.

Read More
Databricks Speech and Voice

How Comcast Uses Voice, Data, and AI in Home Entertainment

Here’s an interesting look at how Comcast uses data to enrich customer experience. Comcast’s Data Team is bringing together voice, data, and AI to make home entertainment more accessible to everyone– regardless of age, language proficiency, or ability. With every voice prompt, the Xfinity remote control turns billions of spoken words into actionable insights, personalizing […]

Read More
Data Driven Speech and Voice

Veronika Kolesnikova on Making your applications interactive with Speech Services

Here’s another bonus episode that BAILey has put together. Who’s BAILey? Glad you asked. In the intro, she has a thing or two to say.In this session from the Azure Global Data Fest, Veronika Kolesnikova tell us how make your applications interactive with Speech Services. Original YouTube video: https://www.youtube.com/watch?v=b4RSZ7aIKgE Press the play button below to […]

Read More
AI Speech and Voice

Creating an 86,000 Hour Speech Dataset with Apache Spark and TPUs

As part of its machine learning benchmarking efforts, MLCommons has built an 86,000 hour open supervised speech recognition dataset with a commercial-use license known as The People’s Speech, incorporating subtitled videos and audio in the public domain scraped from the Internet.

Read More
AI Speech and Voice

Make Your Applications Interactive with Speech Services

Another great session from the Global Azure Data & AI Fest on who to make your applications interactive with Speech Services by Veronika Kolesnikova.

Read More
Research Speech and Voice

Sound Capture and Speech Enhancement for Communication and Distant Speech Recognition

Microsoft Research discusses the general architecture of speech enhancement pipelines for the needs of hands-free telecommunication and distant speech recognition. The talk will discuss both classical approaches using statistical signal processing and deep learning using neural networks. It will be illustrated with real-life examples from the speech enhancement audio pipelines in Kinect, HoloLens, and Teams.

Read More