This AI Software Will Let You Customise Voices for AI Techniques
Hume, a New York-based artificial intelligence (AI) agency, unveiled a brand new software on Monday that may permit customers to customize AI voices. Dubbed Voice Management, the brand new function is aimed toward serving to builders combine these voices into their chatbots and different AI-based functions. As an alternative of providing a wide variety of voices, the corporate affords granular management over 10 completely different dimensions of voices. By choosing the specified parameters in every of the scale, customers can generate distinctive voices for his or her apps.
The corporate detailed the brand new AI software in a blog post. Hume acknowledged that it’s attempting to resolve the issue of enterprises discovering the precise AI voice to match their model identification. With this function, customers can customise completely different facets of the notion of voice and permit builders to create a extra assertive, relaxed, or buoyant voice for AI-based functions.
Hume’s Voice Management is presently out there in beta, however it may be accessed by anybody registered on the platform. Devices 360 workers members had been capable of entry the software and take a look at the function. There are 10 completely different dimensions builders can alter together with gender, assertiveness, buoyancy, confidence, enthusiasm, nasality, relaxedness, smoothness, tepidity, and tightness.
As an alternative of including a prompt-based customisation, the corporate has added a slider that goes from -100 to +100 for every of the metrics. The corporate acknowledged that this strategy was taken to get rid of the vagueness related to the textual description of a voice and to supply granular management over the languages.
In our testing, we discovered altering any of the ten dimensions makes an audible distinction to the AI voice and the software was capable of disentangle the completely different dimensions accurately. The AI agency claimed that this was achieved by growing a brand new “unsupervised strategy” which preserves most traits of every base voice when particular parameters are diverse. Notably, Hume didn’t element the supply of the procured knowledge.
Notably, after creating an AI voice, builders must deploy it to the appliance by configuring its Empathic Voice Interface (EVI) AI mannequin. Whereas the corporate didn’t specify, the EVI-2 mannequin was doubtless used for this experimental function.
Sooner or later, Hume plans to develop the vary of base voices, introduce further interpretable dimensions, improve the preservation of voice traits beneath excessive modifications, and develop superior instruments to analyse and visualise voice traits.