Amazon’s New Nova Sonic AI Model Features a ‘More Human-like Voice’

Amazon’s New Nova Sonic Ai Model Features a ‘more Human-like Voice’

Amazon’s New Nova Sonic AI Model Features a ‘More Human-like Voice’

Home » News » Amazon’s New Nova Sonic AI Model Features a ‘More Human-like Voice’
Table of Contents
Screenshot from Amazon's site of Amazon Nova Canvas, one of its foundation models for generating high-quality images.
amazon nova canvas is a basis mannequin for builders to create high quality photos picture amazon

Amazon is the most recent tech large to unveil a voice AI mannequin. Based on Amazon, its Nova Sonic is “a brand new basis mannequin that unifies speech understanding and speech technology right into a single mannequin, to allow extra human-like voice conversations in AI purposes.” Nova Sonic will compete with related AI fashions by OpenAI, Google, and different tech corporations.

Nova Sonic understands greater than phrases

The Nova Sonic doesn’t simply perceive the speaker’s phrases, however it might probably additionally course of the tone, model, and tempo. The AI voice generator adapts to the dialog context, so dialogue flows extra naturally, in comparison with the extra stilted fashions from the primary generations of Alexa. The Nova Sonic can do that as a result of it combines a number of speech processing and producing features right into a single AI mannequin as an alternative of utilizing a number of completely different fashions.

Historically, AI voice instruments concerned working a number of fashions in sequence: a speech recognition mannequin would convert speech to textual content, then a big language mannequin (LLM) would course of the enter textual content and generate responses, and eventually a text-to-speech mannequin would convert textual content again to audio. This advanced pipeline typically stripped away the tone, model, and pacing of the speaker’s authentic dialogue.

Because the Nova Sonic combines all of this in a single mannequin, it might probably adapt to the acoustic context of the enter speech. It additionally responds extra naturally to the cadences of human speech; for example, it received’t interrupt when the speaker hesitates or pauses to take a breath.

Tips on how to get Nova Sonic

Nova Sonic is at the moment obtainable by way of a brand new API in Amazon Bedrock, the corporate’s enterprise utility constructing platform, and can simplify the event of voice purposes.

What builders must find out about Amazon Nova

The tech large not too long ago launched Amazon Nova Act, a brand new AI mannequin skilled to carry out actions inside an internet browser. As well as, there’s an Amazon Nova SDK for builders to discover. One of many basis fashions is Nova Canvas for producing high-quality photos; there are additionally fashions for producing textual content from completely different modalities in addition to movies from textual content and picture enter.

author avatar
roosho Senior Engineer (Technical Services)
I am Rakib Raihan RooSho, Jack of all IT Trades. You got it right. Good for nothing. I try a lot of things and fail more than that. That's how I learn. Whenever I succeed, I note that in my cookbook. Eventually, that became my blog. 
share this article.

Enjoying my articles?

Sign up to get new content delivered straight to your inbox.

Please enable JavaScript in your browser to complete this form.
Name