What does required deflection mean here?

It is the share of tickets the AI must fully resolve without human handoff for blended cost to beat the current human baseline.

Can the threshold be impossible?

Yes. If fully deflected tickets still cost more than the human baseline, or every escalated ticket is already cheaper than human handling, the threshold state changes accordingly.

Support Deflection Threshold Simulator

Pricing snapshot: 2026-05-03Provider: OpenAIModel: GPT-5.2

Step 1 Provider and Model

ProviderModel

Step 2 Support Scenario

Use-case preset

Mid-volume SaaS support queue with moderate retrieval depth and tighter per-ticket savings headroom.

Monthly ticketsHuman cost / ticket (USD)Human cost / escalated ticket (USD)AI requests before handoffDeflection rate (0 to 1)

Need help sizing assumptions? Read the support cost guide and the deflection-rate guide.

Step 3 AI Workload Assumptions

AI requests / resolved ticketBase prompt tokens / requestResponse tokens / requestCache hit rate (0 to 0.99)

Show advanced inputs

Open these only when retrieval, reranking, vector, or infra detail changes the support decision.

Retrieved chunks / requestTokens per chunkRerank docs / requestEmbedding ingestion tokens / monthVector queries / requestVector cost / query (USD)Infra cost / request (USD)

Scenario actions

Copy scenario URL

Paste into ChatGPT or Claude, or share with a teammate.

Save and track this scenario

Track pricing drift on this scenario and get an email if the latest result changes.

How tracking works

After you click Save and track, we carry this exact calculator state into the tracked-scenarios page so you can sign in and confirm the save.

We save your assumptions and the pricing snapshot used for this result.

When a newer pricing snapshot lands, we recompute the same scenario, show what changed, and email you if the latest result moved.

1 tracked scenario free, then $12/mo or $120/yr for up to 25 tracked scenarios.

Decision Signal

Above threshold

Modeled support deflection needs to reach 42.8% to beat the $2.4 human baseline.

Your current deflection assumption is +11.2% above the modeled threshold.

Required deflection

42.8%

Assumed deflection

54.0%

Gap vs threshold

+11.2%

Blended cost / ticket

$1.9843

Most Sensitive Threshold Inputs

Human cost / ticket-6.5%

Human cost / escalated ticket+5.5%

Rerank Docs+0.3%

Totals

Required deflection rate

42.8%

Assumed deflection rate

54.0%

Gap vs threshold

+11.2%

AI cost / deflected ticket

$0.2806

All-escalated cost / ticket

$3.9842

Blended cost / ticket

$1.9843

Monthly savings vs human

+$914.59

Required deflection rate	42.8%
Assumed deflection rate	54.0%
Gap vs threshold	+11.2%
AI cost / deflected ticket	$0.2806
All-escalated cost / ticket	$3.9842
Blended cost / ticket	$1.9843
Monthly savings vs human	+$914.59

Blended Cost Breakdown (USD/ticket)

Largest cost block: Human escalation.

Cache

$-0.031

Embeddings Ingestion

Generation

$0.0365

Infra

$0.0057

Reranking

$0.1661

Retrieval

$0.0128

Vector Db

$0.0001

Human escalation

$1.794

Cache	$-0.031
Embeddings Ingestion	$0
Generation	$0.0365
Infra	$0.0057
Reranking	$0.1661
Retrieval	$0.0128
Vector Db	$0.0001
Human escalation	$1.794

Sensitivity Ranking

Variable	Delta required deflection (pts)
Human cost / ticket	-6.5%
Human cost / escalated ticket	+5.5%
Rerank Docs	+0.3%
AI requests / ticket	+0.3%
AI requests before handoff	+0.1%
Cache Hit Rate	-0.1%
Output Tokens	+0.0%
Input Tokens	+0.0%
Retrieved Chunks	+0.0%
Tokens Per Chunk	+0.0%
Infra Cost Per Request	+0.0%
Vector Queries Per Request	+0.0%
Vector Cost Per Query	+0.0%
Embedding Ingestion Tokens Monthly	+0.0%
Monthly tickets	-0.0%

Assumptions and Units

CurrencyUSD
Unit of analysisResolved support ticket
Pricing snapshot2026-05-03
Selected model rowOpenAI/GPT-5.2
Deflection definitionShare of tickets fully resolved without human handoff
Escalation definitionEscalated tickets add human handling cost only after the AI first pass
Threshold ruleRequired deflection is bounded from 0 to 100%

Recommended Next Step

If the threshold looks realistic, open break-even pricing to see whether the workflow still clears target margin, then translate the threshold into a blended support cost.

Commercial next step

Break-even Price AI + Human Ticket Cost

Sources and Snapshot

Active Pricing Row

Selected model

OpenAI / GPT-5.2

Input tokens$1.75 / 1M
Output tokens$14 / 1M

Shared retrieval defaults

Embedding input$0.02 / 1M
Rerank docs$1 / 1K

Snapshot date: 2026-05-03
Source links and update notes: Pricing Snapshot Reference

Continue Analysis

Switch tools

Read guides

AI Payoff Point

How It Works

Formula

Assumptions and Units

Step 1 Provider and Model

Step 2 Support Scenario

Step 3 AI Workload Assumptions

Scenario actions

Copy scenario URL

Save and track this scenario

Decision Signal

Most Sensitive Threshold Inputs

Totals

Blended Cost Breakdown (USD/ticket)

Assumptions and Units

Recommended Next Step

Sources and Snapshot

Continue Analysis