Support Escalation Blend Cost Model

Estimate blended AI plus human support cost per ticket from deflection and escalated-ticket cost assumptions.

How this tool works

This cost model weights fully deflected tickets against tickets that still escalate after an AI first pass, then reports the blended cost per resolved ticket and monthly impact.

How It Works

  1. Set ticket volume, human baseline cost, and escalated-ticket handling cost.
  2. Set AI cost for deflected tickets and AI requests before handoff on escalations.
  3. Review blended ticket cost, monthly savings, and the cost share still driven by human escalation.

Formula

blended_cost = deflection_rate * ai_deflected_cost + (1 - deflection_rate) * (ai_before_handoff + human_escalation_cost)

Assumptions and Units

  • Currency: USD
  • Unit of analysis: resolved support ticket
  • Deflection rate is the share of tickets fully resolved without human handoff
  • Escalated tickets still include AI first-pass cost before the human handling term is added

Can I save or share scenarios?

Yes. Save/share URLs preserve the blended escalation assumptions so you can reopen the same scenario later.

Related resources: AI Support Bot Cost per Ticket Calculator, Support Deflection Threshold Simulator, AI Support Agent Cost per Ticket, How To Price an AI Agent, What Cache Hit Rate Means for RAG, RAG Cost Components Explained.

Pricing snapshot: 2026-03-19Provider: OpenAIModel: GPT-5.2

Step 1 Provider and Model

Switch model assumptions using prices from the selected snapshot.

Step 2 Support Scenario

Set ticket volume, human baseline, and escalation assumptions before tuning the AI workload.

Mid-volume SaaS support queue with moderate retrieval depth and tighter per-ticket savings headroom.

Step 3 AI Workload Assumptions

Set the support-turn assumptions first, then open advanced retrieval and infra fields if needed.
Show advanced inputs

Open these only when retrieval, reranking, vector, or infra detail changes the support decision.

Save or Share Scenario URL

Copy this link to reopen or share these exact assumptions later. Pricing uses the published snapshot shown on the page.

Save it for later or paste it into ChatGPT or Claude to discuss this scenario.

Decision Signal

Near human

At 54.0% deflection, blended support cost lands at $1.9843 per ticket versus $2.4 for the current human baseline.

That leaves $0.4157 of cost headroom per ticket, or $914.59 per month at the current ticket volume.

Blended cost / ticket

$1.9843

Cost gap / ticket

+$0.4157

Monthly blended cost

$4,365.41

Monthly savings vs human

+$914.59

Most Sensitive Inputs

Most sensitive blended-cost inputs when each is moved up by 10%.
Deflection rate-$0.2
Human cost / escalated ticket+$0.1794
AI requests / ticket+$0.0152

Totals

Summary metrics for the current support scenario.
Blended cost / ticket
$1.9843
Human cost / ticket
$2.4
Cost gap / ticket
+$0.4157
Monthly blended cost
$4,365.41
Monthly human cost
$5,280
Monthly savings vs human
+$914.59
Human share of blended cost
90.4%
Escalation rate
46.0%
Blended cost / ticket$1.9843
Human cost / ticket$2.4
Cost gap / ticket+$0.4157
Monthly blended cost$4,365.41
Monthly human cost$5,280
Monthly savings vs human+$914.59
Human share of blended cost90.4%
Escalation rate46.0%

Blended Cost Breakdown (USD/ticket)

Each cost component is computed independently, then summed.

Largest cost block: Human escalation.

CacheSavings from repeated support requests served by cache. Negative means lower total cost.
$-0.031
Embeddings IngestionAmortized per-ticket share of the fixed monthly support corpus embedding refresh.
$0
GenerationModel input/output token spend for support turns.
$0.0365
InfraNon-model infra overhead per support request.
$0.0057
RerankingReranker cost based on docs rescored per support request.
$0.1661
RetrievalExtra model input spend from retrieved support context.
$0.0128
Vector DbVector database query cost across support requests.
$0.0001
Human escalationHuman handling cost applied only to the escalated share of tickets.
$1.794
CacheSavings from repeated support requests served by cache. Negative means lower total cost.$-0.031
Embeddings IngestionAmortized per-ticket share of the fixed monthly support corpus embedding refresh.$0
GenerationModel input/output token spend for support turns.$0.0365
InfraNon-model infra overhead per support request.$0.0057
RerankingReranker cost based on docs rescored per support request.$0.1661
RetrievalExtra model input spend from retrieved support context.$0.0128
Vector DbVector database query cost across support requests.$0.0001
Human escalationHuman handling cost applied only to the escalated share of tickets.$1.794
Sensitivity RankingChange in ticket cost when one variable is increased by 10%.
VariableDelta blended cost / ticket
Deflection rateShare of tickets fully resolved without human handoff.-$0.2
Human cost / escalated ticketHuman handling cost on escalated tickets only.+$0.1794
AI requests / ticketAverage AI turns needed to fully resolve a ticket.+$0.0152
Rerank DocsDocs reranked per support request.+$0.0143
AI requests before handoffAI turns spent before a ticket escalates to a human.+$0.0039
Cache Hit RateFraction of support requests served from cache.-$0.0031
Output TokensGenerated tokens per support response.+$0.002
Input TokensBase prompt tokens per support request.+$0.0011
Retrieved ChunksRetrieved chunk count per support request.+$0.0011
Tokens Per ChunkAverage chunk size in retrieved support context.+$0.0011
Infra Cost Per Request+$0.0005
Vector Queries Per RequestVector lookup count per support request.+$0
Vector Cost Per Query+$0
Embedding Ingestion Tokens Monthly+$0
Monthly ticketsTicket volume used for monthly totals and fixed-term amortization.-$0
Human cost / ticketCurrent fully loaded human handling cost per resolved ticket.$0

Assumptions and Units

Explicit assumptions keep the support model reproducible and auditable.
  • CurrencyUSD
  • Unit of analysisResolved support ticket
  • Pricing snapshot2026-03-19
  • Selected model rowOpenAI/GPT-5.2
  • Monthly tickets2,200 tickets for monthly totals and fixed-term amortization
  • Embedding refreshAmortized across monthly tickets as a fixed support corpus refresh term
  • Deflection mix54.0% deflected / 46.0% escalated
  • Escalated ticket cost$3.9 human handling plus $0.0842 of AI first-pass cost
  • Human escalation share90.4% of current blended ticket cost

Recommended Next Step

Use these links to move from modeled support economics into the next pricing or implementation check.

Once the blended escalation path looks viable, verify pricing headroom and operating constraints before you scale the support bot further.

Sources and Snapshot

Pricing comes from a daily snapshot generated by batch workflows.

Active Pricing Row

Selected model

OpenAI / GPT-5.2

  • Input tokens$1.75 / 1M
  • Output tokens$14 / 1M

Shared retrieval defaults

  • Embedding input$0.02 / 1M
  • Rerank docs$1 / 1K