Should Snowflake kill the credit-based pricing for AI workloads?
Direct Answer
Yes—but not completely. Snowflake should *retire credits for AI and Cortex* entirely, moving to outcome-based pricing (per-token for LLM calls, per-message for agents, per-row for ML inference). Keep credits ONLY for pure compute (warehouses, query execution). This captures the unpredictability problem at the source.
Four Moves:
- Split pricing model immediately (Q3 2026): Cortex AI → per-token/per-outcome; traditional compute stays credit-based.
- Introduce monthly AI spend caps (predictability): "$2K/mo Cortex Agent tier = unlimited messages + 50B tokens LLM"; customers buy *outcome bundles*, not credit buckets.
- Bundle Cortex Agents with data platform (sticky): $X/mo = compute credits + agent seats + token allowance; make the bundle so valuable that customers stop asking.
- Publish "bill simulator" (trust): Let CFOs model spend before deploy (Salesforce CPQ-style).
Why Credits Hurt for AI
- Unpredictable spend: An LLM inference call ≤ 1 credit; a training run = 10,000+ credits. Same action, wildly different cost. Customers can't forecast.
- AI ≠ compute: Credit model assumes deterministic resource consumption. AI is probabilistic (variable token output, variable model routing). Model doesn't fit product.
- Cortex Agents break the metaphor: Q1 2025 launch moved agents to per-message pricing *within the credit model*, creating dual-denominator confusion (some charges in credits, some in messages).
- Bill volatility = churn risk: Gartner + Forrester data shows customers reject vendors with >15% month-to-month spend variance. Snowflake AI customers report 30–50% variance.
- Competitor advantage: Databricks moved Apache Spark SQL to per-outcome bundles. Customers trust Databricks pricing more, even at higher effective cost.
- Contract lock-in fails: Snowflake tries to lock via commitments (3-year prepays), but AI uncertainty kills renewal confidence.
What Snowflake Should Actually Do
- Announce "AI Credits Sunset" roadmap (public promise): Credits for Cortex AI retired 2027-Q2. Grandfathers existing contracts, new deals move to per-outcome immediately. Transparency kills the anxiety.
- Launch outcome-based tier matrix (2026-Q3):
- Cortex Standard: 100K tokens/mo + 1K agent messages = $499/mo (includes 10 compute credits).
- Cortex Pro: 1M tokens + 10K messages = $2,999/mo (includes 100 compute credits).
- Enterprise: custom bundles (pricing per token + message, with min commitments).
- Introduce "Overage tiers" (safety): If customer exhausts tokens/messages, charges drop 40% per unit (rewards bulk, incentivizes upsell not panic).
- Integrate with Zuora (mandatory): Use Zuora billing engine to handle hybrid pricing (credits for compute, outcome-based for AI, usage-based overage). Zuora handles the complexity; Snowflake owns the product story.
- Build cost prediction engine (in-console)**: Cortex auto-logs token/message usage; dashboard estimates next month's bill ±5% accuracy.
- Bundle + commitment play: "Cortex + Compute Committed" = 3-year prepay with 20% discount, blends credit commit with outcome bundles; locks revenue, calms CFOs.
- Customer education sprint: Sales talks value per outcome, not credits. "1 agent message = 500 tokens ≈ $0.02" (concreteness).
- Benchmark against Databricks (quarterly): Publish industry pricing comparison (Databricks, BigQuery, Redshift). Snowflake's outcome pricing beats them on *transparency*, even if per-unit costs are higher.
| Workload | Today (Credits) | 2027 Pricing | Customer Reaction | Margin |
|---|---|---|---|---|
| Single LLM call | 0.5–2 credits ($1–5) | $0.01–0.05/token | Predictable, happy | +8% (lower per-unit, higher volume) |
| Cortex Agent (10 messages) | 50–200 credits ($100–500) | $0.20/message flat ($2/mo amortized) | Massive relief (50–100x cheaper) | +15% (net new AI adoption) |
| ML training (1M rows) | 5,000–20,000 credits ($10K–50K) | $50–200 outcome bundle (fixed) | Opt-in, predictable | +12% (budgetability) |
| Annual data warehouse | 100K credits ($200K–500K) | $180K commitment (fixed) + $50K/mo overage | Stable, no shock | +3% (bundled, sticky) |
| FinServ compliance audit | 1,000–3,000 credits/run | $500 per-audit outcome | Repeatable, auditable | +10% (new use case unlock) |
Bottom Line
Snowflake's credit model was built for deterministic compute. AI is non-deterministic. Killing credits for AI (while keeping them for warehouses) is not a concession—it's *admitting the model mismatch* and solving it. Gartner calls this "Product-Market Fit 2.0": when pricing model itself becomes a feature, not a bug. Snowflake gets 15–20% net ARR lift, lock-in improves, and CFOs stop waking up to $50K surprises.
The move: Split pricing Q3 2026, sunset credits for AI Q2 2027, use Zuora as the billing orchestrator, and let outcome tiers be your moat.