Google DeepMind gets closer to sounding human

Researchers at DeepMind use WaveNet AI to mimic human speech

Artificial intelligence researchers at DeepMind have created some of the most realistic sounding human-like speech, using neural networks.

Dubbed WaveNet, the AI promises significant improvements to computer-generated speech, and could eventually be used in digital personal assistants such as Siri, Cortana and Amazon's Alexa.

The technology generates voices by sampling real human speech from both English and Mandarin speakers. In tests, the WaveNet generated speech was found to be more realistic than other forms of text-to-speech programs but still falling short of being truly convincing.

Advertisement - Article continues below

In 500 blind tests, respondents were asked to judge sample sentences on a scale of one to five (five being most realistic). WaveNet was rated 4.21 in English and 4.08 in Mandarin (actual human speech was rated 4.55 in English and 4.21 in Mandarin in the tests). That side, WaveNet managed to outperform other speech methods.

While other artificial speech generators focus on language, WaveNet targets the sound waves being produced, analysing raw audio signal waveforms and modelling speech on that. The researchers also used the same technique to produce music after listening to piano solos on YouTube.

"WaveNets open up a lot of possibilities for TTS, music generation and audio modelling in general. The fact that directly generating timestep per timestep with deep neural networks works at all for 16kHz audio is really surprising, let alone that it outperforms state-of-the-art TTS systems. We are excited to see what we can do with them next," said Deepmind in a blog post.

Advertisement
Advertisement - Article continues below

Deepmind has also published a paper that goes into much more detail on the technology.

The research outfit was also responsible for creating an AI system to beat a champion Go player this year.

Featured Resources

Top 5 challenges of migrating applications to the cloud

Explore how VMware Cloud on AWS helps to address common cloud migration challenges

Download now

3 reasons why now is the time to rethink your network

Changing requirements call for new solutions

Download now

All-flash buyer’s guide

Tips for evaluating Solid-State Arrays

Download now

Enabling enterprise machine and deep learning with intelligent storage

The power of AI can only be realised through efficient and performant delivery of data

Download now
Advertisement

Most Popular

Visit/infrastructure/server-storage/355118/hpe-warns-of-critical-bug-that-destroys-ssds-after-40000-hours
Server & storage

HPE warns of 'critical' bug that destroys SSDs after 40,000 hours

26 Mar 2020
Visit/software/video-conferencing/355138/zoom-beaming-ios-user-data-to-facebook-for-targeted-ads
video conferencing

Zoom beams iOS user data to Facebook for targeted ads

27 Mar 2020
Visit/software/355113/companies-offering-free-software-to-fight-covid-19
Software

These are the companies offering free software during the coronavirus crisis

25 Mar 2020
Visit/mobile/mobile-phones/355088/apple-lifts-iphone-purchase-restrictions
Mobile Phones

Apple lifts iPhone purchase restrictions

23 Mar 2020