Never miss a story

Get subscribed to our newsletter


×
FILE - A woman walks past the logo for Google at the China International Import Expo in Shanghai, Nov. 5, 2018. VOA

Google has announced its first direct speech-to-speech translation system called “Translatotron” that can convert verbal communication from one language to another while maintaining the speaker’s voice and tempo.

“Translatotron” is based on a sequence-to-sequence network which takes source spectrograms — a visual representation of frequencies — as input and generates spectrograms of the translated content in the target language, Ye Jia and Ron Weiss, software engineers at Google Artificial Intelligence (AI) wrote in a blog post on Wednesday.


The model makes use of two other separately trained components — a neural vocoder that converts output spectrograms to time-domain waveforms and a speaker encoder that can be used to maintain the character of the source speaker’s voice in the synthesised translated speech.

For now, the results of Google’s demonstration of the translation system lag behind a conventional cascade system, but the search engine giant said it has “demonstrated the feasibility of the end-to-end direct speech-to-speech translation”.

“By incorporating a speaker encoder network, ‘Translatotron’ is also able to retain the original speaker’s vocal characteristics in the translated speech, which makes the translated speech sound more natural and less jarring,” Jia and Weiss said.


A man walks past a Google sign outside with a span of the Bay Bridge at rear in San Francisco, May 1, 2019. VOA

The company said that “Translatotron” is the first end-to-end model that can directly translate speech from one language into another.

“We hope that this work can serve as a starting point for future research on end-to-end speech-to-speech translation systems,” the blog-post noted.

Lately, Google has been working aggressively on its translation systems.

Also Read- Apple’s Plan to Introduce its own iPhone 5G Chip Could Take Time

Adding more languages to its real-time translation feature, earlier in 2018, the company introduced accents in Google Translate with a variety of languages in region-based pronunciations.

In February, Google extended the “Interpreter” mode on its Assistant to Home speakers and other third-party devices to help users have real-time conversations with compatible devices and translate them into 26 listed languages. (IANS)


Popular

Photo found on Pixabay

Books on Delhi

The city of Delhi has seen it all; from sultanate rule, to dynasties, and to colonial rule. From monarchy to democracy, Delhi has gone through its phases. But, in order to know and explore the nuances of Delhi, you must read these beautiful books.

1. City of Djinns: A Year in Delhi by William Dalrymple

Keep Reading Show less
Photo by Viator.

Ghats of Varanasi, one of the oldest cities of India

There are some of the Indian cities which are older than time. Therefore, we must know which cities are they, and what has been their history!

1. Varanasi (1200 BC–)

Keep Reading Show less

Human hair wigs have several advantages over synthetic wigs

By- Digital Hub

I prefer synthetic wigs as it isn't something that I would wear all the time - just when I look different. Additionally, their ease of use is an essential factor for me. However, suppose you're looking to wear a wig for a fashionable accessory or as a way for you to show your personality. In that case, I'd recommend buying multiple synthetic wigs of various styles and colors instead of only the one human hair wig at the same amount. However, be cautious - only purchase top-quality synthetic braids that are more expensive as you might be disappointed by the new style you've chosen.

Keep reading... Show less