Nvidia launches AI platform to boost video conferencing
Cloud-based Maxine platform can reduce bandwidth, increase call quality and keep everyone engaged, GPU giant claims
Nvidia has launched a GPU-based video conferencing platform for developers that it claims can fix common issues with video calling.
The service is called Nvidia Maxine, a cloud-based platform that will use AI to realign callers' faces, improve streaming quality, boost resolution, and more.
Video conferencing is the number one source of internet traffic, according to Nvidia, with some 30 million web meetings estimated to take place every day. With that in mind, the company has developed a cloud-based suite of 'GPU-accelerated' AI enhancements.
With Maxine, service providers running the platform in the cloud can offer users new AI effects, such as 'gaze correction', super-resolution, noise cancellation and face relighting. What's more, because the data is processed in the cloud rather than on local devices, Nvidia suggests that end users can enjoy the new features without any specialised hardware.
"Video conferencing is now a part of everyday life, helping millions of people work, learn and play, and even see the doctor," said Ian Buck, vice president and general manager of Accelerated Computing at Nvidia.
"Nvidia Maxine integrates our most advanced video, audio and conversational AI capabilities to bring breakthrough efficiency and new capabilities to the platforms that are keeping us all connected."
Maxine also includes reduced bandwidth whereby the AI software is used to analyse key facial points of each caller and re-animates the face in the video. This takes away the need to stream every single pixel which in turn requires less data to go back and forth across the internet.
The face alignment feature automatically adjusts users so they appear to be facing each other during a call and gaze correction helps simulate eye contact, even if the camera isn't aligned with the user's screen. Nvidia claims these features will help people stay engaged in the conversation rather than looking at their camera.
Democratising AI for all: Transforming your operating model to support AI adoption
How to make the most of AI capabilitiesDownload now
Developers can also add features that allow call participants to choose their own animated avatars with realistic animation automatically driven by their voice and emotional tone, while an auto frame option allows the video feed to follow the speaker even if they move away from the screen.
The new platform follows an announcement that Nvidia is planning to build the UK's most powerful supercomputer for use in medical research. Dubbed Cambridge-1, the £40 million machine would hypothetically rank as the 29th most powerful supercomputer in the global TOP500 list if it were built today.
Managing security risk and compliance in a challenging landscape
How key technology partners grow with your organisationDownload now
Evaluate your order-to-cash process
15 recommended metrics to benchmark your O2C operationsDownload now
AI 360: Hold, fold, or double down?
How AI can benefit your businessDownload now
Getting started with Azure Red Hat OpenShift
A developer’s guide to improving application building and deployment capabilitiesDownload now