Ivan J. Tashev

Research Speech and Voice

Sound Capture and Speech Enhancement for Communication and Distant Speech Recognition

Microsoft Research discusses the general architecture of speech enhancement pipelines for the needs of hands-free telecommunication and distant speech recognition. The talk will discuss both classical approaches using statistical signal processing and deep learning using neural networks. It will be illustrated with real-life examples from the speech enhancement audio pipelines in Kinect, HoloLens, and Teams.

Read More