What does this calculator estimate?

It compares baseline and candidate monthly embedding refresh plans and reports fixed monthly cost delta, amortized per-user impact, and how much of modeled cost comes from refreshes.

Should I re-embed the whole corpus after every change?

Usually no. Use this output to quantify the cost of full re-indexes versus narrower refresh scopes before you set your indexing cadence.

Embedding Ingestion Cost Calculator

Pricing snapshot: 2026-06-14Provider: OpenAIModel: GPT-5 Mini

Step 1 Provider and Model

ProviderModel

Step 2 Quick Mode

Use-case preset

Compare full refresh scope against narrower docs updates.

Monthly active usersUsed to convert fixed monthly refresh cost into per-user context.Baseline embedding tokens / monthCurrent monthly refresh scope before any optimization.Requests per user / monthUsed to keep refresh cost in context against the broader workflow.Candidate embedding tokens / monthMonthly token volume in the candidate refresh plan.Price per user / month (USD)Current list price used for per-user pricing context.

Optional Advanced assumptions

Show advanced inputs

Base prompt tokens / requestNon-retrieval prompt tokens used to keep total workflow cost in context.Output tokens / requestAverage response length for the same workflow.Retrieved chunks / requestAverage chunk count used by the same workflow.Tokens per chunkAverage chunk size in tokens.Rerank docs / requestDocuments reranked in the same workflow.Vector queries / requestVector DB lookups tied to the same workflow.Vector cost / query (USD)Average per-query vector DB cost.Infra cost / request (USD)Non-model compute and network overhead for the same workflow.Cache hit rate (0 to 0.99)Share of requests served from cache.

Scenario actions

Copy scenario URL

Paste into ChatGPT or Claude, or share with a teammate.

Save and track this scenario

Track pricing drift on this scenario and get an email if the latest result changes.

How tracking works

After you click Save and track, we carry this exact calculator state into the tracked-scenarios page so you can sign in and confirm the save.

We save your assumptions and the pricing snapshot used for this result.

When a newer pricing snapshot lands, we recompute the same scenario, show what changed, and email you if the latest result moved.

1 tracked scenario free, then $12/mo or $120/yr for up to 25 tracked scenarios.

Headline metric

Candidate refresh plan lowers fixed monthly cost

The candidate refresh plan changes fixed monthly embedding cost by -$8.00 and adds -$0.01 per active user at the current scale.

Baseline embedding cost / month

$12.00

Candidate embedding cost / month

$4.00

Amortized delta / user / month

-$0.01

Break-even delta / user / month

-$0.01

Totals

Embedding tokens / month

Baseline: 600,000,000
Candidate: 200,000,000
Delta: -400,000,000

Embedding cost / month

Baseline: $12.00
Candidate: $4.00
Delta: -$8.00

Embedding share of modeled cost

Baseline: 2.3%
Candidate: 0.8%
Delta: -1.5%

Amortized embedding cost / user / month

Baseline: $0.015
Candidate: $0.005
Delta: -$0.01

Metric	Baseline	Candidate	Delta
Embedding tokens / month	600,000,000	200,000,000	-400,000,000
Embedding cost / month	$12.00	$4.00	-$8.00
Embedding share of modeled cost	2.3%	0.8%	-1.5%
Amortized embedding cost / user / month	$0.015	$0.005	-$0.01

Component Breakdown

Generation

Baseline: $0.0435
Candidate: $0.0435
Delta: $0

Retrieval

Baseline: $0.0198
Candidate: $0.0198
Delta: $0

Reranking

Baseline: $0.96
Candidate: $0.96
Delta: $0

Embeddings Ingestion

Baseline: $0.015
Candidate: $0.005
Delta: -$0.01

Vector Db

Baseline: $0.0009
Candidate: $0.0009
Delta: $0

Cache

Baseline: $-0.3972
Candidate: $-0.3972
Delta: $0

Infra

Baseline: $0.021
Candidate: $0.021
Delta: $0

Component	Baseline	Candidate	Delta
Generation	$0.0435	$0.0435	$0
Retrieval	$0.0198	$0.0198	$0
Reranking	$0.96	$0.96	$0
Embeddings Ingestion	$0.015	$0.005	-$0.01
Vector Db	$0.0009	$0.0009	$0
Cache	$-0.3972	$-0.3972	$0
Infra	$0.021	$0.021	$0

Sensitivity Ranking

Variable	Delta cost %
Requests Per User Month	9.8%
Rerank Docs	9.0%
Cache Hit Rate	-6.0%
Output Tokens	0.3%
Monthly Active Users	-0.2%
Retrieved Chunks	0.2%
Tokens Per Chunk	0.2%
Input Tokens	0.1%
Vector Queries Per Request	0.0%

Assumptions and Units

CurrencyUSD
Token unittoken
Pricing snapshot2026-06-14
Selected model rowOpenAI / GPT-5 Mini
Fixed monthly termEmbedding refresh is modeled as a fixed monthly cost
AmortizationPer-user impact divides the fixed monthly refresh term by monthly active users

Recommended Next Step

If refresh cost is material, review infra and indexing constraints before widening your update cadence.

Refresh planning references

RAG Cost Components Explained How To Choose Chunk Size and Chunk Count

Compare infra providers

View Infra Recommendations

Sources and Snapshot

Active Pricing Row

Active pricing row

OpenAI / GPT-5 Mini

Input tokens$0.25 / 1M
Output tokens$2 / 1M

Shared retrieval defaults

Embedding input$0.02 / 1M
Rerank docs$1 / 1K

Snapshot date: 2026-06-14
Source links and update notes: Pricing Snapshot Reference

Continue Analysis

Switch tools

Read guides

Indexing Cost

How It Works

Formula

Assumptions and Units

Example Scenario

Step 1 Provider and Model

Step 2 Quick Mode

Optional Advanced assumptions

Scenario actions

Copy scenario URL

Save and track this scenario

Headline metric

Totals

Component Breakdown

Assumptions and Units

Recommended Next Step

Sources and Snapshot

Continue Analysis