Browse docs
--- title: "Failover" description: "Configure Aurora failover with manual rules, understand experimental auto mode, and know when fallback attempts run." icon: "shuffle" keywords: ["failover", "fallback"] ---
Overview
Aurora exposes failover through the fallback config block.
When a request fails, Aurora can retry it against alternate models. For predictable behavior, use manual mode.
Failover is an OSS-capable gateway feature. Enterprise advanced routing can add additional adaptive routing controls, but manual fallback chains do not require Enterprise.
Manual Mode
Manual mode is the recommended mode today.
fallback:
default_mode: "manual"
manual_rules_path: "config/fallback.json"config/fallback.json is a JSON object where each model entry contains an ordered candidate list (array); top-level keys are not ordered:
{
"<primary model id>": [
"<provider-name>/<first fallback model id>",
"<provider-name>/<second fallback model id>"
]
}The order-sensitive part is the array under each model entry.
Aurora tries the listed candidates in order and stops on the first success. Use bare model names or provider-qualified selectors from GET /admin/api/v1/models.
If needed, you can override the mode per model with fallback.overrides.
Auto Mode
fallback:
default_mode: "auto"
manual_rules_path: "config/fallback.json"Auto mode keeps any manual candidates first, then appends up to five extra candidates from the current model registry. It prefers models with the same request category, similar rankings, overlapping capabilities, and the same family when possible.
When It Runs
Failover is attempted only after the primary request returns:
5xx429- model unavailable, unsupported, or not found style errors
It currently applies to translated /v1/chat/completions and /v1/responses requests, not /v1/embeddings.
Manual and API usage
Manual fallback rules are boot configuration. Edit fallback.manual_rules_path and restart or refresh runtime state if your deployment supports runtime refresh.
Server automation can update the JSON rules file in your deployment system and then call:
curl -X POST http://your-aurora-host/admin/api/v1/runtime/refresh \
-H "Authorization: Bearer $AURORA_MASTER_KEY"Use workflows to turn fallback behavior on or off for a provider, model, or user_path scope. See Workflows.