Home Tech OpenAI unveils easy voice assistant creation at 2024 developer event

OpenAI unveils easy voice assistant creation at 2024 developer event

39
0
OpenAI unveils easy voice assistant creation at 2024 developer event

Developers developers developers —

        Altman steps again from the keynote limelight and lets four major API additions keep the speaking.


– Oct 1, 2024 7:16 pm UTC

A lovely OpenAI logo on a blue background.
Benj Edwards

On Monday, OpenAI kicked off its annual DevDay event in San Francisco, unveiling four major API updates for developers that integrate the firm’s AI units into their products. Unlike last three hundred and sixty five days’s single-location event featuring a keynote by CEO Sam Altman, DevDay 2024 is greater than factual at some point, adopting a world reach with extra events planned for London on October 30 and Singapore on November 21.

The San Francisco event, which turned into invitation-handiest and closed to press, featured on-stage audio system going thru technical presentations. Per chance the most notable unusual API feature is the Realtime API, now in public beta, which helps speech-to-speech conversations the utilize of six preset voices and permits developers to originate features very fair like ChatGPT’s Superior Voice Mode (AVM) into their applications.

OpenAI says that the Realtime API streamlines the ability of creating voice assistants. Previously, developers needed to make utilize of extra than one units for speech recognition, text processing, and text-to-speech conversion. Now, they’ll deal with your complete job with a single API call.

The firm plans so as to add audio input and output capabilities to its Chat Completions API in the next few weeks, allowing developers to input text or audio and receive responses in both format.
Two unusual alternatives for more affordable inference
OpenAI additionally offered two features that would possibly per chance presumably well fair support developers steadiness efficiency and price when making AI applications. “Model distillation” offers a diagram for developers to pretty-tune (customise) smaller, more affordable units love GPT-4o mini the utilize of outputs from extra progressed units reminiscent of GPT-4o and o1-preview. This per chance permits developers to bring collectively extra relevant and accurate outputs whereas operating the more affordable mannequin.

Additionally, OpenAI offered “advised caching,” a feature fair like one offered by Anthropic for its Claude API in August. It hurries up inference (the AI mannequin generating outputs) by remembering ceaselessly stale prompts (input tokens). Along the model, the feature offers a 50 percent good deal on input tokens and sooner processing cases by reusing lately considered input tokens.

And last however no longer least, the firm expanded its pretty-tuning capabilities to incorporate images (what it calls “vision pretty-tuning”), allowing developers to customise GPT-4o by feeding it both personalized images and text. Normally, developers can educate the multimodal model of GPT-4o to visually acknowledge definite things. OpenAI says the unusual feature opens up probabilities for improved visual search efficiency, extra accurate object detection for self reliant vehicles, and presumably enhanced scientific relate analysis.

Where’s the Sam Altman keynote?
OpenAI CEO Sam Altman speaks for the duration of the OpenAI DevDay event on November 6, 2023, in San Francisco.

Amplify / OpenAI CEO Sam Altman speaks for the duration of the OpenAI DevDay event on November 6, 2023, in San Francisco.

Getty Shots

Unlike last three hundred and sixty five days, DevDay just isn’t any longer being streamed live, though OpenAI plans to put up utter later on its YouTube channel. The event’s programming involves breakout sessions, neighborhood spotlights, and demos. But the greatest trade since last three hundred and sixty five days is the inability of a keynote look from the firm’s CEO. This three hundred and sixty five days, the keynote turned into handled by the OpenAI product crew.

On last three hundred and sixty five days’s inaugural DevDay, November 6, 2023, OpenAI CEO Sam Altman delivered a Steve Jobs-model live keynote to assembled developers, OpenAI staff, and the click. At some point of his presentation, Microsoft CEO Satya Nadella made a shock look, speaking up the partnership between the corporations.

Eleven days later, the OpenAI board fired Altman, triggering per week of turmoil that resulted in Altman’s return as CEO and a unusual board of administrators. Honest after the firing, Kara Swisher relayed insider sources that stated Altman’s DevDay keynote and the introduction of the GPT store had been a precipitating squawk in the firing (though no longer the key squawk) attributable to some internal disagreements over the firm’s extra individual-love path since the launch of ChatGPT.

With that history in suggestions—and the major target on developers above all else for this event—presumably the firm made up our minds it turned into most effective to let Altman step some distance off from the keynote and let OpenAI’s know-how develop into the major focus of the event as an alternative of him. We are purely speculating on that point, however OpenAI has completely skilled its share of drama all around the last month, so it is going to fair absorb been a prudent resolution.

Despite the inability of a keynote, Altman is fresh at Dev Day San Francisco this day and is scheduled to keep a closing “fireplace chat” at the end (which has no longer but came about as of this writing).

 » …
Read More

LEAVE A REPLY

Please enter your comment!
Please enter your name here