Vendor quotes for SMB AI projects routinely miss 20 to 50% of the actual cost. The missing pieces aren't hidden in the sense of dishonest. They're missing because the quote was scoped to what the vendor is comfortable committing to, and the remaining cost falls on you.
This post is the line-item inventory of what's typically missing. So when you get a quote, you can ask the right questions and either get the missing items added or budget for them yourself. Either way, the project doesn't surprise you in month four.
The pricing-side companion to this is what it actually costs to build a custom AI agent. The ROI-side companion is the honest ROI of AI for small business. Read them together for the full budget picture.
The cost vendor quotes almost never include and the one owners forget to count.
The build phase: your team spends time on the project. Stakeholder interviews with whoever knows the workflow. Data exports for the agent to learn from. Integration reviews and security signoffs. Sandbox testing. Pilot review sessions. For a typical SMB AI build, this is 50 to 120 hours of internal time spread across 8 to 12 weeks.
Operating phase: the operating owner spends 5 to 10 hours a week in the first quarter watching outputs and tuning. After the first quarter, 1 to 2 hours a week ongoing. That's $4,000 to $10,000 per quarter at loaded rates.
The team adoption phase: the people whose workflows the agent touches spend time learning to use it, reporting issues, and changing how they work. Often 1 to 3 hours per person per week for the first month.
Total for a 6 to 12 month first AI build: $15,000 to $35,000 of internal time. Not on any invoice. Real money.
Vendor quotes typically use the cheapest model tier for the cost estimate. Production usage almost never stays on the cheapest tier.
Why: triage, classification, and basic extraction work fine on smaller models. Drafting, complex reasoning, and edge case handling need larger models. The actual production mix ends up being maybe 60% cheap model and 40% expensive model, weighted by call volume.
The cost impact: if the quoted operating cost was $80 per month assuming all calls hit GPT-4o-mini, real cost is usually $150 to $400 per month when the mix is honest. Higher for agents that do significant generation work.
Plus growth: usage grows as the agent absorbs more of the workflow. The agent that processes 200 tickets a week in month 1 is processing 500 by month 6. Costs scale linearly with volume.
Pattern: budget 2 to 4x the quoted operating cost for the first year. After the first year you'll know your actual mix and can plan more precisely.
The build that "works on 80% of cases" sounds great in a demo. The remaining 20% is where the project budget goes if it wasn't budgeted upfront.
Edge cases at typical SMB scale: the weird vendor invoice that nobody else's format matches. The customer ticket with the unusual product configuration. The document with handwritten annotations. The lead form filled in by someone in a different language. These all happen. None of them is a rare event in aggregate.
Properly handling edge cases takes one of three paths: build smart routing so the edge cases go to humans, build specialized handling for the most common edge case patterns, or accept lower accuracy on those cases.
Cost impact: edge case handling is usually 15 to 25% of the total build cost. Quotes that skip it produce agents that look good in pilot and fail in production.
I cover the architecture pattern for handling this in the custom AI agents for small business pillar. The short version: small model for the common case, larger model or human review for the uncertain cases.
Skipped by most quotes because the vendor doesn't think you need it. You do.
What it includes: a fixed eval dataset, a scoring rubric (human review, LLM-as-judge with calibration, or a hybrid), a tracked metric over time, alerts when the metric moves significantly, and the ability to re-run the eval on a candidate prompt or model.
Build cost: $5,000 to $12,000 for a reasonable eval harness.
Operating cost: minimal, maybe $20 to $50 per month for the storage.
Why it matters: without this, you can't tell when the agent drifts or breaks. The cost of finding out via production failures is usually 5 to 10x the cost of building the eval harness in the first place.
Push back hard if a vendor quote doesn't have an eval line item. Either ask them to add it or budget to build it yourself.
Every 12 to 18 months you'll want to swap the underlying model. Either because the vendor deprecated yours, or because a better cheaper model came out, or because your usage shifted in a way that makes a different model more appropriate.
The swap is not free. Different models have different prompt sensitivities. The prompts that worked on Claude 3.5 don't work the same way on Claude 4.7 [verify]. Some edge cases that were handled fine by the old model fail on the new one. Behavior shifts in subtle ways.
A typical model swap is 1 to 2 weeks of work to re-tune prompts, re-run eval, validate behavior on edge cases, and deploy the new version. At vendor rates, $5,000 to $15,000 per swap.
Budget for at least one swap in year two and one in year three. After that, the cadence depends on how active the model market is.
The clean APIs (HubSpot, Slack, Zendesk, Notion) integrate smoothly. The systems with weird APIs, undocumented behaviors, or no API at all are where integration cost balloons.
The categories of system that drive this: industry-specific vertical software with limited APIs. Internal databases without modern interfaces. Legacy on-premise tools. Custom-built systems your team wrote five years ago and nobody quite remembers how they work.
If your agent needs to integrate with one of these, expect the integration cost to be 1.5 to 3x the integration with a clean API would have been.
This is the category where I've watched vendors get genuinely surprised. They quote thinking the integration is standard, discover the weirdness during build, and either eat the overage (rare) or come back for more budget (common). Asking specifically "have you integrated with our specific systems before" in the kickoff is a high-value question.
Prompts in production agents change over time. New edge cases get added. Tone and behavior get tuned. Bugs get fixed.
You need a way to know which prompt is in production, when it changed, what changed, and how the change affected eval metrics. This is prompt management.
The vendor-quoted version is often "the prompts live in the codebase". That works in theory and breaks in practice when three people are changing prompts in parallel and nobody can roll back a regression.
The fix is a small prompt management system. Doesn't need to be fancy. A versioned git repo of prompts with a small UI for viewing and rolling back is enough. Build cost: $3,000 to $8,000. Operating cost: minimal.
Most vendor quotes don't include this. If you're going to have more than one agent or more than one prompt-tuning person, it's worth budgeting.
The agent ships. Your team has to learn to use it. This takes time and the time has a cost.
For an SMB at typical scale, expect 2 to 4 hours of training per team member, plus 1 to 2 hours per week of reduced productivity during the adoption period (the first 4 to 8 weeks).
For a team of 8 affected by the agent, that's roughly $4,000 to $10,000 of distributed productivity cost. Real, but distributed across the team so it's rarely felt acutely. Still, budget for it.
For SMBs in regulated industries (healthcare, finance, legal, education) or with security-sensitive customers, the agent will need a real security review. SOC 2 implications. HIPAA review if applicable. Customer-facing security questionnaires that now include AI questions.
Cost: $5,000 to $30,000 depending on scope. Often skipped from initial quotes because the vendor doesn't know the regulatory context.
If you're in a regulated category, ask the vendor to include security review in the quote or be prepared to handle it yourself. Either way it's real cost.
A vendor-quoted "$25,000 first AI agent" for an SMB typically ends up costing $40,000 to $55,000 across categories above when measured honestly. The breakdown looks like:
Original quote: $25,000.
Internal staff time across build and operating: $12,000 to $20,000.
API costs above quoted: $1,500 to $3,500 in year one.
Edge case handling work (if not in original scope): $3,000 to $7,000.
Eval harness (if not in scope): $5,000 to $10,000.
Total year-one honest cost: $46,500 to $65,500.
This isn't bad. It's just bigger than the quote. SMBs that budget at the honest level don't get surprised. SMBs that budget at the quoted level have a bad month 4 conversation.
The way to push back during quoting: ask the vendor to itemize. Get a quote that includes evaluation, integration with explicit edge case handling, and at least the first month of operating support. Then budget your internal time on top.
The vendors who can't or won't itemize this clearly are the same ones whose builds blow through budget. The ones who will and do are the ones who've done this before honestly.
RELATED READING
The cost to build a custom AI agent is the question SMB owners ask earliest and get the most varied answers on. I've seen quotes for "the same" agent range from $8,000 to $120,000 depending on who's…
The ROI of AI for small business is one of the most lied-about numbers in the current vendor pitch landscape. Every vendor deck has a chart showing 5x or 10x return in the first year. Most of them…
I've been on the receiving end of more failed AI projects than I'd like to admit. Some I ran. Some I inherited and tried to save. Some I was asked to do a post-mortem on after the original team had…
Most SMB AI pilots fail. The published failure rates I've seen put it at 60 to 80% [verify], which is roughly consistent with what I've watched in the field. But the failure rate isn't the…
The cheapest way to fix an AI project is before it starts. The most expensive way is after it ships. The honest version of "de-risk your first AI project" is a list of seven things to lock down…
Most small business owners I talk to are stuck in the same place with AI. They've watched ChatGPT do something impressive. They've signed up for two or three SaaS products with "AI" in the name.…
FREQUENTLY ASKED