Never miss a story

Get subscribed to our newsletter


×
Google AI to identify speakers from crowd. Wikimedia Commons

Just as most smartphone cameras now allow users to focus on a single object among many, it may soon be possible to pick out individual voices in a crowd by suppressing all other sounds, thanks to a new Artificial Intelligence (AI) system developed by Google researchers.

This is an important development as computers as not as good as humans at focusing their attention on a particular person in a noisy environment. Known as the cocktail party effect, the capability to mentally “mute” all other voices and sounds comes natural to us humans.



Google AI will identify individual speakers now. Wikimedia Commons

However, automatic speech separation — separating an audio signal into its individual speech sources — remains a significant challenge for computers, Inbar Mosseri and Oran Lang, software engineers at Google Research, wrote in a blog post this week. In a new paper, the researchers presented a deep learning audio-visual model for isolating a single speech signal from a mixture of sounds such as other voices and background noise.

“In this work, we are able to computationally produce videos in which speech of specific people is enhanced while all other sounds are suppressed,” Mosseri and Lang said. The method works on ordinary videos with a single audio track, and all that is required from the user is to select the face of the person in the video they want to hear, or to have such a person be selected algorithmically based on context.

Also Read: Want To Know What Facebook, Google Know About You?

The researchers believe this capability can have a wide range of applications, from speech enhancement and recognition in videos, through video conferencing, to improved hearing aids, especially in situations where there are multiple people speaking. “A unique aspect of our technique is in combining both the auditory and visual signals of an input video to separate the speech,” the researchers said.


This will also help in speech enhancement . VOA

“Intuitively, movements of a person’s mouth, for example, should correlate with the sounds produced as that person is speaking, which in turn can help identify which parts of the audio correspond to that person,” they explained.

The visual signal not only improves the speech separation quality significantly in cases of mixed speech, but, importantly, it also associates the separated, clean speech tracks with the visible speakers in the video, the researchers said. IANS


Popular

Majority of millennials have become more cautious about their finances as a result of the pandemic. | Unsplash

The 'Millennial Mood Index 2021' (MMI) was released by CASHe, India's AI-driven financial wellness platform with a mission to make financial inclusion possible for all. According to the survey, more than 84 per cent of millennials across the country have increased their wealth-management strategy to prepare for future contingencies while also looking for opportunities for stronger and more sustainable growth in the post-pandemic world. The pan-India survey, conducted among more than 30k customers on CASHe's platform, aimed to capture the impact of the Covid-19 pandemic and how it has altered millennials' everyday behaviour across a variety of topics such as health, travel, shopping, savings & credit appetite, and so on.

Also Read : Co-living preferred housing solution for millennials

Keep Reading Show less

Ranjay Gulati shows the catastrophic blunders leaders unintentionally make. | IANS

A renowned Harvard Business School professor delivers a persuasive reconsideration and defence of purpose as a management ethos, demonstrating the enormous performance advantages and societal benefits that can be realised when businesses get their purpose right.

Too many businesses use purpose, or a reason for existing, as a marketing tool to make themselves feel good and appear good to the public.

Keep Reading Show less
Unsplash

Student demonstrations erupted across Bihar, and a passenger train in Gaya was set ablaze. (Image used for representation only)

In India, on January 26, 2022, thousands of youngsters set fire to empty train carriages. They disrupted rail traffic in order to protest what they claim are irregularities in recruiting by the railway department, which is one of the world's major employers. (VOA/ MBI)


Keep reading... Show less