LLM routing, without the glue code
What is LLM routing on Backboard?
Backboard lets you call 17,000+ models from a single endpoint and change which model you use at any time, without rewriting your app. Instead of hard‑coding every provider's SDK and payload quirks, you:
LLM ROUTING
Why engineers route through Backboard
From benchmark-leading memory to BYOK with no token markup — everything built into one stateful API.
how it works
How LLM routing works
You call a single msg‑style endpoint and pass the model (or routing rule), the state or conversation ID, and optional tools: memory, RAG, web search, custom tools.
routing patterns
Routing patterns you can implement
Same state, same memory, same tools—different models for different jobs.
how it works
Why not just build your own router?
Wiring a couple of models is easy. The hard parts are what Backboard solves.
Wiring a couple of models is easy. The hard parts:
Keeping state and memory consistent across models and providers
Handling different context windows without losing important info
Tracking cost, latency, and usage when logic is scattered
Making RAG, web search, and tools behave the same for every model
Managing multiple keys and pricing models without accidentally overpaying
Backboard gives you:
A unified API for 17,000+ models
BYOK support for major providers with no token markup
Access to many free models for experimentation and background work
Free state management and Adaptive Context Management baked in
Best‑in‑class configurable memory (lite and pro), plus RAG and web search
You integrate once and get world‑leading routing everywhere.