An Azure service that provides access to OpenAI’s GPT-3 models with enterprise capabilities.
GPT-4.1 EU Data Zone Standard: intermittent token corruption in Dutch output — English insertion, gibberish, truncation
We are experiencing intermittent but severe output quality degradation with GPT-4.1 on Data Zone Standard (EU). We generate Dutch-language structured reports and load-balance across multiple EU Azure OpenAI instances.
Symptom A subset of production requests produces corrupted output. We have classified the corruption into 4 distinct categories:
- Random English insertion — valid English words injected into Dutch text Examples: "luxury", "pipeline", "timeline", "prestige", "cup", "deep"
- Gibberish / corrupted tokens — non-words or garbled text Examples: "musteraan", "plumeert", "jeae", "Êr", "resolootje"
- Truncated / abbreviated words — words cut off mid-way Examples: "Onbek vac.", "NBermogen", "sup.", "gem", "pn"
- Word substitution — wrong Dutch words or grammar Examples: "de has", "based" substituted for correct Dutch
Scale Multiple affected production sessions identified. The majority of traffic is handled correctly — this is intermittent but consistent over time.
Hypothesis The intermittent nature and variety of corruption types suggests one or more degraded inference nodes in the EU Data Zone pool. The corruption is not prompt-related or input-related; it appears infrastructure-side.
Configuration
- Model: gpt-4.1
- Deployment type: Data Zone Standard (EU)
Priority This is our highest priority issue. The corruption is severe enough to fundamentally undermine trust in AI-generated output. Because we are on Data Zone Standard, we have no control over request routing and cannot mitigate this ourselves — the fix must come from Microsoft's side.
We want to ensure this is visible to the Azure OpenAI infrastructure team and is being actively investigated.