What does this calculator estimate?

It estimates cost per request, cost per user per month, break-even price, and margin from explicit workflow assumptions.

Are prices fetched live?

No. It uses dated pricing snapshots so the results stay consistent and shareable.

AI Workflow Cost Calculator

Pricing snapshot: 2026-06-14Provider: OpenAIModel: GPT-5 Mini

Decision Signal

Baseline

Baseline preview margin is 91.4%.

High margin here is partly driven by the sample price. Check your own price.

Step 1 Provider and Model

ProviderModel

Step 2 Quick Mode

Starting point

Baseline starts from the generic sample scenario before workflow-specific presets.

Quick workflow

Use-case preset

Monthly active usersRequests per user / monthAverage prompt size (tokens)Average response size (tokens)Price per user / month (USD)

Need help estimating inputs? Read token sizing and reranking basics.

Optional Advanced assumptions

Show advanced inputs

Only adjust these once your Quick Mode assumptions feel realistic.

Retrieved chunks / requestTokens per chunkRerank docs / requestEmbedding ingestion tokens / monthVector queries / requestVector cost / query (USD)Infra cost / request (USD)Cache hit rate (0 to 0.99)

Scenario actions

Copy scenario URL

Paste into ChatGPT or Claude, or share with a teammate.

Save and track this scenario

Track pricing drift on this scenario and get an email if the latest result changes.

How tracking works

After you click Save and track, we carry this exact calculator state into the tracked-scenarios page so you can sign in and confirm the save.

We save your assumptions and the pricing snapshot used for this result.

When a newer pricing snapshot lands, we recompute the same scenario, show what changed, and email you if the latest result moved.

1 tracked scenario free, then $12/mo or $120/yr for up to 25 tracked scenarios.

Cost per user/month

$2.487

Gross margin

91.4%

Estimated monthly AI cost

$1,243.52

Estimated monthly gross profit

$13,256.48

Top Cost Drivers

Requests Per User Month10.0%

Rerank Docs9.7%

Output Tokens0.2%

Totals

Cost per request

$0.03109

Cost per user/month

$2.487

Gross margin %

91.4%

Break-even price

$2.487

Cost per request	$0.03109
Cost per user/month	$2.487
Gross margin %	91.4%
Break-even price	$2.487

Component Breakdown (USD/user/month)

Largest cost block: reranking, not generation.

Generation

$0.052

Retrieval

$0.0144

Reranking

$2.4

Embeddings Ingestion

Vector Db

$0.0006

Cache

$-0

Infra

$0.02

Generation	$0.052
Retrieval	$0.0144
Reranking	$2.4
Embeddings Ingestion	$0
Vector Db	$0.0006
Cache	$-0
Infra	$0.02

Sensitivity Ranking

Variable	Delta cost %
Requests Per User Month	10.0%
Rerank Docs	9.7%
Output Tokens	0.2%
Retrieved Chunks	0.1%
Tokens Per Chunk	0.1%
Input Tokens	0.0%
Vector Queries Per Request	0.0%
Monthly Active Users	-0.0%
Cache Hit Rate	0.0%

Assumptions and Units

CurrencyUSD
Token unittoken
Pricing snapshot2026-06-14
Selected model rowOpenAI/GPT-5 Mini
Volume basisBusiness totals and fixed monthly terms use monthly active users as the denominator
Embedding refreshAmortized per user from the fixed monthly corpus refresh term
Cache componentNegative value means cost savings

Recommended Next Step

Optimize the biggest modeled cost driver first. Compare infra only after model, retrieval, reranking, or context changes stop being the better lever.

Optimize cost drivers first

Compare Model Costs Rerank Cost Prompt Overhead

Compare infra providers

View Infra Recommendations

Sources and Snapshot

Active Pricing Row

Selected model

OpenAI / GPT-5 Mini

Input tokens$0.25 / 1M
Output tokens$2 / 1M

Shared retrieval defaults

Embedding input$0.02 / 1M
Rerank docs$1 / 1K

Snapshot date: 2026-06-14
Source links and update notes: Pricing Snapshot Reference

Continue Analysis

Switch tools

Read guides

AI Workflow Cost

How It Works

Formulas

Assumptions and Units

Example Scenario

FAQs

Decision Signal

Step 1 Provider and Model

Step 2 Quick Mode

Optional Advanced assumptions

Scenario actions

Copy scenario URL

Save and track this scenario

Top Cost Drivers

Totals

Component Breakdown (USD/user/month)

Assumptions and Units

Recommended Next Step

Sources and Snapshot

Continue Analysis