Decision Signal
HealthyCurrent gross margin is 95.1%. Use this to choose whether pricing and costs are ready before scaling usage.
Step 1 Provider and Model
Switch model assumptions using prices from the selected snapshot.Step 2 Quick Mode
Use plain-language assumptions first. Open Advanced assumptions only if needed.Repo-aware coding workflows where active developer usage, context load, and model routing move unit cost fast.
- Need help estimating inputs? Read token sizing and reranking basics.
Step 3 Advanced Assumptions
Tune retrieval, reranking, embeddings, vector, caching, and infra.Show advanced inputs
Only adjust these once your Quick Mode assumptions feel realistic.
Scenario actions
Copy scenario URL
Paste into ChatGPT or Claude, or share with a teammate.
Save and track this scenario
Track pricing drift on this scenario and get an email if the latest result changes.
How tracking works
After you click Save and track, we carry this exact calculator state into the tracked-scenarios page so you can sign in and confirm the save.
We save your assumptions and the pricing snapshot used for this result.
When a newer pricing snapshot lands, we recompute the same scenario, show what changed, and email you if the latest result moved.
1 tracked scenario free, then $12/mo or $120/yr for up to 25 tracked scenarios.
Cost / user / month
$2.3954Gross margin
95.1%Estimated monthly AI cost
$526.98Estimated monthly gross profit
$10,253.02Top Cost Drivers
Most sensitive variables when each is moved up by 10%.Totals
Summary metrics for monthly unit economics and margin.| Cost per request | $0.02178 |
| Cost per user/month | $2.3954 |
| Gross margin % | 95.1% |
| Break-even price | $2.3954 |
Component Breakdown (USD/user/month)
Each cost component is computed independently and summed.Largest cost block: reranking, not generation.
| GenerationModel input/output token spend for requests. | $0.7854 |
| RetrievalExtra model input spend from retrieved context chunks. | $0.2541 |
| RerankingReranker cost based on docs scored per request. | $1.98 |
| Embeddings IngestionAmortized per-user share of the fixed monthly corpus embedding refresh cost. | $0 |
| Vector DbVector database query cost across all requests. | $0.002 |
| CacheSavings from cache hits. Negative means lower total cost. | $-0.6756 |
| InfraNon-model infra overhead per request. | $0.0495 |
Sensitivity RankingChange in total cost when one variable is increased by 10%.
| Variable | Delta cost % |
|---|---|
| Requests Per User MonthUser activity level per month. | 10.0% |
| Rerank DocsDocs reranked per request. | 6.4% |
| Cache Hit RateFraction of requests served by cache. | -2.8% |
| Output TokensGenerated tokens per request. | 1.8% |
| Retrieved ChunksRetrieved chunk count per request. | 0.8% |
| Tokens Per ChunkAverage chunk size in tokens. | 0.8% |
| Input TokensPrompt-side tokens per request. | 0.8% |
| Vector Queries Per RequestVector query count per request. | 0.0% |
| Monthly Active UsersActive-user estimate used to amortize fixed monthly embedding refresh. | -0.0% |
Assumptions and Units
Explicit assumptions to keep outputs reproducible and auditable.- CurrencyUSD
- Token unittoken
- Pricing snapshot2026-04-12
- Selected model rowOpenAI/GPT-5.3 Codex
- Volume basisBusiness totals and fixed monthly terms use monthly active users as the denominator
- Embedding refreshAmortized per user from the fixed monthly corpus refresh term
- Cache componentNegative value means cost savings
Recommended Next Step
Use these links to lower top cost drivers without guessing.Optimize the biggest modeled cost driver first. Compare infra only after model, retrieval, reranking, or context changes stop being the better lever.
Compare infra providers
View Infra RecommendationsSources and Snapshot
Pricing comes from the current dated snapshot.Active Pricing Row
Selected model
OpenAI / GPT-5.3 Codex
- Input tokens$1.75 / 1M
- Output tokens$14 / 1M
Shared retrieval defaults
- Embedding input$0.02 / 1M
- Rerank docs$1 / 1K
- Snapshot date: 2026-04-12
- Source links and update notes: Pricing Snapshot Reference
Continue Analysis
Move to the next tool or guide without losing your current scenario.Switch tools
- AI Workflow Cost
- Break-even Price
- Compare Model Costs
- Retrieval Cost
- Rerank Cost
- Cache Savings
- Prompt Overhead
- RAG or Long Prompt
- Indexing Cost
- Browse all tools
Read guides