Add `encrypted_content_affinity` to config.yaml for future Responses API load balancing

Currently, the LiteLLM proxy configuration does not include the `encrypted_content_affinity` pre-call check.

For Responses API models (like `gpt-5.1-codex` and `gpt-5.3-codex`), if clients chain calls by passing `previous_response_id` or encrypted reasoning items, these follow-up requests *must* route to the same Azure deployment that generated the encryption key.

If we ever add multiple deployments per `model_name` (e.g., load balancing `gpt-5.1-codex` across both `germanywestcentral` and `swedencentral` under the exact same alias), requests will fail with `invalid_encrypted_content` errors unless affinity routing is enabled.

To fix this when we expand to load balancing:
```yaml
router_settings:
  optional_pre_call_checks:
    - encrypted_content_affinity
```

**Note:** This is not strictly necessary right now because each `model_name` in `openai.tf` maps to a single regional deployment. We are tracking this purely for future scale-out scenarios.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add `encrypted_content_affinity` to config.yaml for future Responses API load balancing #2

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Add encrypted_content_affinity to config.yaml for future Responses API load balancing #2

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions

Add `encrypted_content_affinity` to config.yaml for future Responses API load balancing #2