- Baseline
- 600,000,000
- Candidate
- 200,000,000
- Delta
- -400,000,000
Step 1 Provider and Model
Choose the pricing row used for monthly embedding refresh assumptions.Step 2 Quick Mode
Set baseline and candidate monthly refresh scope before tuning the wider workflow.Compare full refresh scope against narrower docs updates.
Optional Advanced assumptions
Tune the wider workflow only after the monthly refresh numbers look realistic.Show advanced inputs
Scenario actions
Copy scenario URL
Paste into ChatGPT or Claude, or share with a teammate.
Save and track this scenario
Track pricing drift on this scenario and get an email if the latest result changes.
How tracking works
After you click Save and track, we carry this exact calculator state into the tracked-scenarios page so you can sign in and confirm the save.
We save your assumptions and the pricing snapshot used for this result.
When a newer pricing snapshot lands, we recompute the same scenario, show what changed, and email you if the latest result moved.
1 tracked scenario free, then $12/mo or $120/yr for up to 25 tracked scenarios.
Headline metric
Candidate refresh plan lowers fixed monthly costThe candidate refresh plan changes fixed monthly embedding cost by -$8.00 and adds -$0.01 per active user at the current scale.
Baseline embedding cost / month
$12.00Candidate embedding cost / month
$4.00Amortized delta / user / month
-$0.01Break-even delta / user / month
-$0.01Totals
Compare the fixed monthly refresh term and its amortized per-user impact.- Baseline
- $12.00
- Candidate
- $4.00
- Delta
- -$8.00
- Baseline
- 2.3%
- Candidate
- 0.8%
- Delta
- -1.5%
- Baseline
- $0.015
- Candidate
- $0.005
- Delta
- -$0.01
| Metric | Baseline | Candidate | Delta |
|---|---|---|---|
| Embedding tokens / month | 600,000,000 | 200,000,000 | -400,000,000 |
| Embedding cost / month | $12.00 | $4.00 | -$8.00 |
| Embedding share of modeled cost | 2.3% | 0.8% | -1.5% |
| Amortized embedding cost / user / month | $0.015 | $0.005 | -$0.01 |
Component Breakdown
Full workflow context helps show whether refresh is material or just noise.- Baseline
- $0.0435
- Candidate
- $0.0435
- Delta
- $0
- Baseline
- $0.0198
- Candidate
- $0.0198
- Delta
- $0
- Baseline
- $0.96
- Candidate
- $0.96
- Delta
- $0
- Baseline
- $0.015
- Candidate
- $0.005
- Delta
- -$0.01
- Baseline
- $0.0009
- Candidate
- $0.0009
- Delta
- $0
- Baseline
- $-0.3972
- Candidate
- $-0.3972
- Delta
- $0
- Baseline
- $0.021
- Candidate
- $0.021
- Delta
- $0
| Component | Baseline | Candidate | Delta |
|---|---|---|---|
| GenerationModel input/output token spend for requests. | $0.0435 | $0.0435 | $0 |
| RetrievalExtra model input spend from retrieved context chunks. | $0.0198 | $0.0198 | $0 |
| RerankingReranker cost based on docs scored per request. | $0.96 | $0.96 | $0 |
| Embeddings IngestionAmortized per-user share of the fixed monthly corpus embedding refresh cost. | $0.015 | $0.005 | -$0.01 |
| Vector DbVector database query cost across all requests. | $0.0009 | $0.0009 | $0 |
| CacheSavings from cache hits. Negative means lower total cost. | $-0.3972 | $-0.3972 | $0 |
| InfraNon-model infra overhead per request. | $0.021 | $0.021 | $0 |
Sensitivity RankingBaseline sensitivity: cost change when one variable is increased by 10%.
| Variable | Delta cost % |
|---|---|
| Requests Per User MonthUser activity level per month. | 9.8% |
| Rerank DocsDocs reranked per request. | 9.0% |
| Cache Hit RateFraction of requests served by cache. | -6.0% |
| Output TokensGenerated tokens per request. | 0.3% |
| Monthly Active UsersActive-user estimate used to amortize fixed monthly embedding refresh. | -0.2% |
| Retrieved ChunksRetrieved chunk count per request. | 0.2% |
| Tokens Per ChunkAverage chunk size in tokens. | 0.2% |
| Input TokensPrompt-side tokens per request. | 0.1% |
| Vector Queries Per RequestVector query count per request. | 0.0% |
Assumptions and Units
Explicit assumptions to keep refresh-cost planning reproducible and auditable.- CurrencyUSD
- Token unittoken
- Pricing snapshot2026-04-29
- Selected model rowOpenAI / GPT-5 Mini
- Fixed monthly termEmbedding refresh is modeled as a fixed monthly cost
- AmortizationPer-user impact divides the fixed monthly refresh term by monthly active users
Recommended Next Step
Use this section to turn the refresh-cost estimate into an indexing plan.If refresh cost is material, review infra and indexing constraints before widening your update cadence.
Compare infra providers
View Infra RecommendationsSources and Snapshot
Pricing comes from the current dated snapshot.Active Pricing Row
Active pricing row
OpenAI / GPT-5 Mini
- Input tokens$0.25 / 1M
- Output tokens$2 / 1M
Shared retrieval defaults
- Embedding input$0.02 / 1M
- Rerank docs$1 / 1K
- Snapshot date: 2026-04-29
- Source links and update notes: Pricing Snapshot Reference
Continue Analysis
Move to the next tool or guide without losing your current scenario.Switch tools
Read guides