Microsoft touts AI, circular microphone advances in overlapped speech recognition work

Microsoft researchers are making advances in overlapped speech recognition, in part, thanks to a new conical microphone array.

At the Interspeech 2018 conference in Hyderabad, India, this week, Microsoft researchers will be talking up advances in overlapped speech recognition that they've achieved. Part of the solution they'll be outlining involves a new circular microphone array -- seemingly the one that attendees of Microsoft's Build 2018 conference saw in a demonstration, but about which Microsoft has declined to reveal specifics.

Also: Windows 10 how-to: Ed Bott's free tech support guide

Microsoft and others working in the speech recognition field have been attempting to address the "cocktail party problem," i.e., the situation where speakers overlap in a noisy environment. Systems need to be able to identify a varying number of speakers with unknown identities, speech patterns and extraneous noise.

Also: Microsoft Windows U-turn removes warning about installing Chrome, Firefox CNET

In a new research paper, "Recognizing Overlapped Speech in Meetings: A Multichannel Separation Approach Using Neural Networks," Microsoft researchers explain how they've tackled overlap detection and speech separation. To do so, they've used both a neural network and traditional signal-processing techniques using an unmixing transducer that can receive microphone signals and generate a number of time-synchronous audio streams.

circularmicarrayspeachresearch.jpg
Credit: Microsoft

From an image that accompanies the September 5 blog post about the research paper (which I've embedded in my post above), it looks like Microsoft researchers have built a seven-channel conical mic array for meeting transcription as part of their solution. The system handles dereverberation, speech separation and automatic speech recognition, the research paper says.

smartmeetingdemobuild.jpg
Credit: Ben Thompson, YouTube

The image of this microphone definitely looks like it matches the mystery device that Microsoft featured at Build 2018 in its demo of the possibilities of meetings in the future. (An image from that demo is embedded above.)

I asked Microsoft if this is, indeed, the same device and if the company has considered turning the mic into a marketable product (by either Microsoft itself or its OEMs) at some point. No word back so far.

Also: Microsoft 365: A cheat sheet TechRepublic

To Microsoft researchers knowledge, according to the blog post, this system "represents the first overlapped speech recognition system that has been demonstrated to work well for actual meetings with no prior assumptions."

Microsoft has used work from its researchers in the automatic speech recognition area in a number of its products, including Cortana, Skype Translator, Office Dictation, HoloLens and Azure Cognitive Services.

Previous and related coverage:

Here's how you can still get a free Windows 10 upgrade

Microsoft's much-hyped free upgrade offer for Windows 10 ended in 2016, right? Not exactly. The GWX tool may be gone, but all the other upgrade tools still work. The end result is an apparently valid digital license, and there's no evidence that the free upgrades will end any time soon.

How to install, reinstall, upgrade and activate Windows 10

Here's everything you need to know before you repair, reinstall, or upgrade Windows 10, including details about activation and product keys.

After Windows 10 upgrade, do these seven things immediately

You've just upgraded to the most recent version of Windows 10. Before you get back to work, use this checklist to ensure that your privacy and security settings are correct and that you've cut annoyances to a bare minimum.

How to upgrade from Windows 10 Home to Pro for free

You've got a new PC running Windows 10 Home. You want to upgrade to Windows 10 Pro. Here's how to get that upgrade for free. All you need is a Pro/Ultimate product key from an older version of Windows.

Related stories: