Realtime regression in non-English production voice agents

bishopsmother1 pts0 comments

Realtime regression in non-English production voice agents: gpt-realtime-mini vs gpt-realtime-mini-2025-10-06 - Deprecations - OpenAI Developer Community

= 40rem)" rel="stylesheet" data-target="desktop" />

= 40rem)" rel="stylesheet" data-target="chat_desktop" /><br>= 40rem)" rel="stylesheet" data-target="discourse-ai_desktop" /><br>= 40rem)" rel="stylesheet" data-target="discourse-gamification_desktop" /><br>= 40rem)" rel="stylesheet" data-target="discourse-reactions_desktop" /><br>= 40rem)" rel="stylesheet" data-target="poll_desktop" />

Realtime regression in non-English production voice agents: gpt-realtime-mini vs gpt-realtime-mini-2025-10-06

API

Deprecations

api-realtime

Ted_Papan

May 10, 2026, 4:59pm

We operate a production AI voice platform built on the OpenAI Realtime API via WebSocket/SIP/Twilio.

We are currently deploying our solution across a few dozen locations nationwide for an enterprise client, and we have encountered a material regression when comparing the dated Realtime snapshot we validated in production against the listed replacement model.

Model validated in production:

gpt-realtime-mini-2025-10-06

Listed replacement:

gpt-realtime-mini

Issue:

In non-English voice-agent flows, with Romanian as our current production case, the replacement model shows noticeably worse language quality and worse faithfulness to supplied business data.

The most serious issue is not just spelling or phrasing. We have observed the newer model hallucinating non-existing departments, services, and operational details that were not present in the database/context. The older snapshot, gpt-realtime-mini-2025-10-06, has been significantly more faithful to the provided information and less prone to confabulating unavailable services or internal departments.

This is important because the older snapshot was not selected casually. It was selected after thousands of hours of testing, R&D, and practical validation in Romanian-language voice-agent scenarios. Its reliability in staying faithful to provided business information is one of the reasons we currently depend on it for production deployments.

Impact:

This affects an active enterprise rollout across a few dozen locations nationwide. The regression impacts:

live AI phone conversations;

appointment and call summaries;

CRM/customer records;

operational reporting;

client trust during rollout.

We are concerned that this may not be limited to Romanian, but may reflect broader non-English quality/faithfulness differences between the dated snapshot and the current gpt-realtime-mini alias.

Evidence:

We have transcription evidence and can provide side-by-side examples comparing the same or similar flows between gpt-realtime-mini-2025-10-06 and gpt-realtime-mini.

Questions:

Has anyone else observed worse non-English performance or worse faithfulness to supplied data on gpt-realtime-mini compared to dated Realtime snapshots?

Is OpenAI tracking language-specific regressions for Realtime models before snapshot deprecations?

Is there a path for production customers to request temporary extended access or a migration path when a listed replacement model is not behaviorally equivalent?

We are committed to building on OpenAI’s Realtime infrastructure, but we need a reliable migration path before moving production enterprise traffic away from the currently working snapshot.

vb

May 10, 2026, 5:51pm

Hi!

Thank you for raising this.

You are probably already aware that you can still use gpt-realtime-mini-2025-10-06. You do not need to use the undated model slug, which currently points to the December snapshot.

For production systems, dated model versions are usually the better choice because they help keep behavior more consistent.

Since I do not see a deprecation notice for the older snapshot, using it seems reasonable here.

In this case, I would suggest staying on the older snapshot for now and waiting for a possible gpt-realtime-mini-2 release, or reviewing the Realtime model best practices here:

Using realtime models | OpenAI API

Learn how to use realtime voice models, including Realtime 2 reasoning, preambles, tool use, and migration from earlier realtime models.

Ted_Papan

May 10, 2026, 6:49pm

vb:

Hi!

Thank you for raising this.

You are probably already aware that you can still use gpt-realtime-mini-2025-10-06. You do not need to use the undated model slug, which currently points to the December snapshot.

For production systems, dated model versions are usually the better choice because they help keep behavior more consistent.

Since I do not see a deprecation notice for the older snapshot, using it seems reasonable here.

In this case, I would suggest staying on the older snapshot for now and waiting for a possible gpt-realtime-mini-2 release, or reviewing the Realtime model best practices here:

Thank you for the reply.

Just to clarify one important point: gpt-realtime-mini-2025-10-06 does appear to be listed on the official OpenAI...

realtime mini production snapshot model data

Related Articles