{"id":1924122,"date":"2026-05-07T22:24:50","date_gmt":"2026-05-07T19:24:50","guid":{"rendered":"https:\/\/analyse.optim.biz\/?p=1924122"},"modified":"2026-05-07T22:24:50","modified_gmt":"2026-05-07T19:24:50","slug":"openai-launches-new-voice-intelligence-features-in-its-api","status":"publish","type":"post","link":"https:\/\/analyse.optim.biz\/?p=1924122","title":{"rendered":"OpenAI launches new voice intelligence features in its API"},"content":{"rendered":"<p>[analyse_image type=&#8221;featured&#8221; src=&#8221;https:\/\/techcrunch.com\/wp-content\/uploads\/2025\/01\/GettyImages-2170386424.jpg?w=1024&#8243;]<\/p>\n<div class=\"entry-content wp-block-post-content is-layout-constrained wp-block-post-content-is-layout-constrained\">\n<p id=\"speakable-summary\" class=\"wp-block-paragraph\">OpenAI said Thursday that its API will now include a number of new voice intelligence features designed to help developers create apps that can talk, transcribe, and translate conversations with users.<\/p>\n<p class=\"wp-block-paragraph\">The company\u2019s new GPT\u2011Realtime\u20112 is another voice model, built to create a realistic vocal simulation that can converse with users. However, unlike its predecessor (GPT-Realtime-1.5) this one is built with GPT\u20115\u2011class reasoning that OpenAI says was created to deal with more complicated requests from users.<\/p>\n<p class=\"wp-block-paragraph\">The company is also launching GPT\u2011Realtime\u2011Translate, which, just as it sounds, is designed to provide real-time translation services that \u201ckeep pace\u201d with the user, conversationally. The feature includes more than 70 input languages (that is, the languages that it can comprehend) and 13 output languages (the languages it relays to the speaker).<\/p>\n<p class=\"wp-block-paragraph\">Finally, the company has also launched a new transcription capability, GPT-Realtime-Whisper, which gives users live speech-to-text capabilities that are captured as interactions occur.<\/p>\n<p class=\"wp-block-paragraph\">\u201cTogether, the models we are launching move real-time audio from simple call-and-response toward voice interfaces that can actually do work: listen, reason, translate, transcribe, and take action as a conversation unfolds,\u201d the company said.<\/p>\n<p class=\"wp-block-paragraph\">Who will these updates be good for? Companies that want to expand customer service capabilities are an obvious target. However, OpenAI also notes that its new features will assist with a wide array of areas, including education, media, events, and creator platforms, among others.<\/p>\n<p class=\"wp-block-paragraph\">As useful as these tools seem from an enterprise perspective, it also seems plausible that they could be misused. The company said it has built guardrails to stop its new features from being abused to create spam, fraud, or other forms of online abuse. Certain triggers have been embedded in the system so that \u201cconversations can be halted if they are detected as violating our harmful content guidelines,\u201d OpenAI said.<\/p>\n<div class=\"wp-block-techcrunch-inline-cta\">\n<div class=\"inline-cta__wrapper\">\n<div class=\"inline-cta__flag\">Techcrunch event<\/div>\n<div class=\"inline-cta__content\">\n<div class=\"inline-cta__header-container\">\n<div class=\"inline-cta__header-container-desktop\">\n<h3 class=\"inline-cta__header has-h-5-font-size\">This Week Only: Buy one pass, get the second at 50% off<\/h3>\n<h4 class=\"inline-cta__subheader\">Your next round. Your next hire. Your next breakout opportunity. Find it at TechCrunch Disrupt 2026, where 10,000+ founders, investors, and tech leaders gather for three days of 250+ tactical sessions, powerful introductions, and market-defining innovation. Register before May 8 to bring a +1 at half the cost.<\/h4>\n<\/div>\n<div class=\"inline-cta__header-container-mobile\">\n<h3 class=\"inline-cta__header has-h-5-font-size\">This Week Only: Buy one pass, get the second at 50% off<\/h3>\n<h4 class=\"inline-cta__subheader\">Your next round. Your next hire. Your next breakout opportunity. Find it at TechCrunch Disrupt 2026, where 10,000+ founders, investors, and tech leaders gather for three days of 250+ tactical sessions, powerful introductions, and market-defining innovation. Register before May 8 to bring a +1 at half the cost.<\/h4>\n<\/div>\n<\/div>\n<div class=\"inline-cta__event-info\"><span class=\"inline-cta__location\">San Francisco, CA<\/span><span class=\"inline-cta__separator\">|<\/span><span class=\"inline-cta__date\">October 13-15, 2026<\/span><\/div>\n<div class=\"inline-cta__register-button\">\n<div class=\"wp-block-buttons is-layout-flex wp-block-buttons-is-layout-flex\">\n<div class=\"wp-block-button\">REGISTER NOW<\/div>\n<\/div>\n<\/div>\n<\/div>\n<\/div>\n<\/div>\n<p class=\"wp-block-paragraph\">All of the new voice models are included in OpenAI\u2019s Realtime API. Translate and Whisper are billed by the minute, while GPT-Realtime-2 is billed by token consumption.<\/p>\n<\/div>\n<p>[analyse_source url=&#8221;https:\/\/techcrunch.com\/2026\/05\/07\/openai-launches-new-voice-intelligence-features-in-its-api\/&#8221;]<\/p>\n","protected":false},"excerpt":{"rendered":"<p>[analyse_image type=&#8221;featured&#8221; src=&#8221;https:\/\/techcrunch.com\/wp-content\/uploads\/2025\/01\/GettyImages-2170386424.jpg?w=1024&#8243;] OpenAI said Thursday that its API will now include a number of new voice intelligence features designed to help developers create apps that can talk, transcribe, and translate conversations with users. The company\u2019s new GPT\u2011Realtime\u20112 is another voice model, built to create a realistic vocal simulation that can converse with users. However, [&hellip;]<\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"open","ping_status":"","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[2],"tags":[226,62],"class_list":["post-1924122","post","type-post","status-publish","format-standard","hentry","category-politics","tag-crawlmanager","tag-techcrunch-com"],"_links":{"self":[{"href":"https:\/\/analyse.optim.biz\/index.php?rest_route=\/wp\/v2\/posts\/1924122","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/analyse.optim.biz\/index.php?rest_route=\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/analyse.optim.biz\/index.php?rest_route=\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/analyse.optim.biz\/index.php?rest_route=\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/analyse.optim.biz\/index.php?rest_route=%2Fwp%2Fv2%2Fcomments&post=1924122"}],"version-history":[{"count":0,"href":"https:\/\/analyse.optim.biz\/index.php?rest_route=\/wp\/v2\/posts\/1924122\/revisions"}],"wp:attachment":[{"href":"https:\/\/analyse.optim.biz\/index.php?rest_route=%2Fwp%2Fv2%2Fmedia&parent=1924122"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/analyse.optim.biz\/index.php?rest_route=%2Fwp%2Fv2%2Fcategories&post=1924122"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/analyse.optim.biz\/index.php?rest_route=%2Fwp%2Fv2%2Ftags&post=1924122"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}