01 · The Story
Meet Casey
- PM dropped a Jira ticket: “Add AI chat to the support flow. Sprint ends Friday.”
- Heard about OpenRouter: one API key, every model, automatic failover.
- Landed on openrouter.ai/models. 503 models stared back.
- Picked GPT-4o based on a Reddit thread. Cost $47 in week one.
Casey is a full-stack developer at a 40-person startup. Last Tuesday her PM dropped a Jira ticket: “Add AI chat to the support flow. Sprint ends Friday.” Not ML, not infra. Just Casey, a deadline, and a blank file.
She'd heard about OpenRouter. One key, every model, automatic failover. She went to openrouter.ai/models.
503 models.
She didn't know if she needed Claude or GPT-4o or Llama. She didn't know what a context window meant for a support bot running roughly 200 conversations a day. She didn't know if $3.50 per million tokens was cheap or ruinous at that volume.
She opened Reddit. Found a thread from 8 months ago. Someone said “just use GPT-4o.” She used GPT-4o. Cost her $47 in the first week. Her PM was not pleased.
Three weeks later, the model ID she'd hardcoded changed without warning. Feature broke in production. She found out at 2am from an error alert, not from OpenRouter.
OpenRouter had the right answer the whole time. It just wasn't findable.
“The bottleneck isn't the model. It's knowing which model.”
“Just grab the top ~30 models on OpenRouter and test them all.”
news.ycombinator.com ↗“Costs 100 times higher than expected. Nearly $50 for less than 100 lines of output.”
trustpilot.com ↗“OpenRouter keeps changing the model ID names, causing errors.”
trustpilot.com ↗Real quotes from public sources. 39 OpenRouter models were silently deprecated in a single LiteLLM update ↗, breaking live integrations with no warning.