Step 1 Provider and Model
Select pricing rows from the active snapshot.Step 2 Quick Mode
Set the key pricing and behavior assumptions first.Steady usage, moderate context, and balanced quality-cost tradeoffs.
Step 3 Advanced Assumptions
Tune retrieval, reranking, embeddings, vector, cache, and infra.Show advanced inputs
Scenario actions
Copy scenario URL
Paste into ChatGPT or Claude, or share with a teammate.
Save and track this scenario
Track pricing drift on this scenario and get an email if the latest result changes.
How tracking works
After you click Save and track, we carry this exact calculator state into the tracked-scenarios page so you can sign in and confirm the save.
We save your assumptions and the pricing snapshot used for this result.
When a newer pricing snapshot lands, we recompute the same scenario, show what changed, and email you if the latest result moved.
1 tracked scenario free, then $12/mo or $120/yr for up to 25 tracked scenarios.
Headline metric
Price likely supports target marginRequired price at 80.0% margin: $9.76 per user/month.
Current price: $49.00. Price gap to target: $-39.24.
Cost / user / month
$1.95Break-even price (0% margin)
$1.95Required monthly revenue
$6,346Implied monthly gross profit
$5,077Top Cost Drivers
Largest sensitivity shifts when each variable is increased by 10%.Totals
Pricing and margin summary under current assumptions.| Cost per request | $0.01627 |
| Cost per user/month | $1.95 |
| Current gross margin % | 96.0% |
| Required price at target margin | $9.76 |
Component Breakdown (USD/user/month)
Each component is computed independently then summed.| Generation | $0.11 |
| Retrieval | $0.04 |
| Reranking | $2.40 |
| Embeddings Ingestion | $0.00 |
| Vector Db | $0.00 |
| Cache | $-0.65 |
| Infra | $0.05 |
Sensitivity RankingDelta in total cost if one variable increases by 10%.
| Variable | Delta cost % |
|---|---|
| Requests Per User MonthUser activity level per month. | 10.0% |
| Rerank DocsDocs reranked per request. | 9.2% |
| Cache Hit RateFraction of requests served by cache. | -3.3% |
| Output TokensGenerated tokens per request. | 0.3% |
| Retrieved ChunksRetrieved chunk count per request. | 0.2% |
| Tokens Per ChunkAverage chunk size in tokens. | 0.2% |
| Input TokensPrompt-side tokens per request. | 0.1% |
| Vector Queries Per RequestVector query count per request. | 0.0% |
| Monthly Active UsersActive-user estimate used to amortize fixed monthly embedding refresh. | -0.0% |
Assumptions and Units
Explicit assumptions keep this decision reproducible.- CurrencyUSD
- Token unittoken
- Pricing snapshot2026-04-09
- Selected model rowOpenAI/GPT-5 Mini
- Volume basisBusiness totals and fixed monthly terms use monthly active users as the denominator
- Target margin formularequired_price = cost / (1 - margin)
Recommended Next Step
Use this section to turn break-even math into a pricing plan.If retrieval or hosting is driving cost, check infra options first, then come back to re-check price and margin.
Pricing and Packaging Plan
RAG Break-even Price per SeatHow to Calculate the Break-even Point for AI WorkflowsCompare infra providers
View Infra RecommendationsSources and Snapshot
Pricing comes from the current dated snapshot.Active Pricing Row
Selected model
OpenAI / GPT-5 Mini
- Input tokens$0.25 / 1M
- Output tokens$2 / 1M
Shared retrieval defaults
- Embedding input$0.02 / 1M
- Rerank docs$1 / 1K
- Snapshot date: 2026-04-09
- Source links and update notes: Pricing Snapshot Reference
Continue Analysis
Move to the next tool or guide without losing your current scenario.Switch tools
- AI Workflow Cost
- Compare Model Costs
- Retrieval Cost
- Rerank Cost
- Cache Savings
- Prompt Overhead
- RAG or Long Prompt
- Indexing Cost
- Browse all tools
Read guides