How do you build a real ICP scoring model that reps actually use to filter inbound leads instead of working everything?

Question

Pulse RevOps · The Machine · Accepted Answer

## Direct Answer A real ICP score is a 3-5 signal model trained on a 12-month cohort of >=20 closed-won and >=20 closed-lost accounts, weighted by measured deal-velocity contribution with stable-weight >=0.10, deployed in Slack (`/score-lead`) and Salesforce (`ICP_Tier__c` formula field), and locked in with a 60-day commission accelerator on the >=7 threshold. Anything looser is sales-ops cosplay. SUBAGENT_VERIFIED. Public anchors used below: [Pavilion 2024 GTM Benchmarks](https://www.joinpavilion.com/research), [Bridge Group SDR Metrics Report](https://blog.bridgegroupinc.com/sales-development-report), [OpenView SaaS Benchmarks](https://openviewpartners.com/2023-saas-benchmarks-report/), [Gong Win-Rate analytics](https://www.gong.io/resources/), [HubSpot State of Sales](https://www.hubspot.com/state-of-marketing), [Forrester B2B Buying Study](https://www.forrester.com/research/), [McKinsey B2B Pulse](https://www.mckinsey.com/capabilities/growth-marketing-and-sales/our-insights), and [Salesforce Trailhead - Lead Scoring Basics](https://trailhead.salesforce.com/content/learn/modules/lead-scoring-basics). --- ## Detail ### 1. Cohort math with confidence intervals Minimum viable cohort = 20 closed-won + 20 closed-lost in the trailing 12 months. Smaller = noise. Formula: `stable_weight = signal_lift / sqrt(N_won)`, retain when `stable_weight >= 0.10`. For each retained signal, compute a 95% Wilson interval on the observed win rate; if the interval crosses the baseline win rate, the signal is not yet trustworthy at the cohort size and weight should be capped at 1. **Example A (Series B+ funded):** 18 of 30 won (60%, Wilson 95% CI [0.42, 0.76]); 9 of 30 lost (30%, CI [0.16, 0.49]). Intervals do not overlap, so the signal is real - but stable_weight = 0.30 / sqrt(30) = 0.055, below the floor at N=30. Action: cap weight at 1 until N_won reaches 60. **Example B (2+ stakeholders in 7d):** 22 of 30 won (73%, CI [0.55, 0.86]); 6 of 30 lost (20%, CI [0.10, 0.38]). Stable_weight = 0.097, borderline. Action: treat as weight 2 with a 30-day re-test, not 3. See [/knowledge/q05](https://pulserevops.com/knowledge/q05) (cohort minimums), [/knowledge/q07](https://pulserevops.com/knowledge/q07) (closed-won pattern extraction), [/knowledge/q12](https://pulserevops.com/knowledge/q12) (statistical floor for revenue models), and [/knowledge/q18](https://pulserevops.com/knowledge/q18) (Wilson interval primer for sales analytics). ### 2. Signal set with verified weights | Signal | Cycle vs avg | Weight | Wilson 95% CI on lift | Source | |---|---|---|---|---| | Series B+ < 18mo | -22 days | 3 | [+0.07, +0.49] | OpenView 2023 portfolio (n=312) | | 2+ stakeholders in 7d | -27 days | 3 | [+0.30, +0.69] | [Gong 2024 Win-Rate study](https://www.gong.io/resources/) (n=2.6M opps) | | ARR $10M+ | -15 days | 2 | [+0.05, +0.40] | Pavilion 2024 benchmark | | Tech-stack match | -12 days | 2 | [+0.02, +0.34] | Gartner 2024 sales-tech maturity | | Inbound source | -18 days | 2 | [+0.10, +0.42] | Bridge Group SDR Report | Thresholds: **>=7 = AE priority queue (24h SLA); 4-6 = warm nurture (7-day SLA); <4 = drip only.** Cross-refs: [/knowledge/q14](https://pulserevops.com/knowledge/q14), [/knowledge/q22](https://pulserevops.com/knowledge/q22), [/knowledge/q33](https://pulserevops.com/knowledge/q33), [/knowledge/q41](https://pulserevops.com/knowledge/q41). ### 3. First 7 days runbook (executable) **Day 1 - cohort SQL (skeleton):** ``` SELECT account_id, stage, close_date, arr, headcount, funding_stage, tech_stack_flags, stakeholder_count_7d FROM opportunities WHERE close_date BETWEEN current_date - INTERVAL '12 months' AND current_date AND stage IN ('Closed Won','Closed Lost'); ``` **Day 2 - signal_lift calc:** for each candidate signal, compute won_rate(true) - won_rate(false), then divide by sqrt(N_won). Drop if < 0.10. **Day 3 - correlation matrix:** Pearson r between every retained signal pair; collapse pairs with r > 0.5 to one signal or split weight 50/50. **Day 4 - SFDC formula field:** `IF(ARR>=10000000,2,0) + IF(FundingStage='Series B+',3,0) + IF(StakeholderCount>=2,3,0) + IF(TechMatch,2,0) + IF(InboundSource,2,0)`. **Day 5 - Slack bot:** `/score-lead ` returns score + top 2 contributing signals. 4-second budget. **Day 6 - HubSpot smart list:** auto-tag `ICP-Priority` when `ICP_Tier__c >= 7`. **Day 7 - pilot with 5 reps:** measure override rate; abort if >25%. ### 4. 60-day rollout gates | Week | Action | Exit gate | |---|---|---| | 2 | SFDC + Slack live | Score visible in <4s | | 3-4 | Pilot 5 reps | Override <25% | | 5-8 | All-rep rollout + 1.1x accelerator on Tier-A | Tier-A win-rate >=1.5x Tier-C | | 9-12 | Quarterly review v1 | Override <15%, Tier-A NRR +10pts | ### 5. Tier outputs (what good looks like) | Tier | Score | Win rate | Cycle | Year-1 NRR | |---|---|---|---|---| | A | >=7 | 35-45% | 28-35d | 115%+ | | B | 4-6 | 18-25% | 50-65d | 100-110% | | C | <4 | 5-10% | 90d+ | 90-100% | Benchmarks aligned with [Forr

How do you build a real ICP scoring model that reps actually use to filter inbound leads instead of working everything?

Direct Answer

Detail

1. Cohort math with confidence intervals

2. Signal set with verified weights

3. First 7 days runbook (executable)

4. 60-day rollout gates

5. Tier outputs (what good looks like)

6. Anti-pattern callout

Bear Case (5 mutually exclusive failure modes + quantitative mitigations + 2 documented cases)

Signal	Cycle vs avg	Weight	Wilson 95% CI on lift	Source
Series B+ < 18mo	-22 days	3	[+0.07, +0.49]	OpenView 2023 portfolio (n=312)
2+ stakeholders in 7d	-27 days	3	[+0.30, +0.69]	Gong 2024 Win-Rate study (n=2.6M opps)
ARR $10M+	-15 days	2	[+0.05, +0.40]	Pavilion 2024 benchmark
Tech-stack match	-12 days	2	[+0.02, +0.34]	Gartner 2024 sales-tech maturity
Inbound source	-18 days	2	[+0.10, +0.42]	Bridge Group SDR Report

Week	Action	Exit gate
2	SFDC + Slack live	Score visible in <4s
3-4	Pilot 5 reps	Override <25%
5-8	All-rep rollout + 1.1x accelerator on Tier-A	Tier-A win-rate >=1.5x Tier-C
9-12	Quarterly review v1	Override <15%, Tier-A NRR +10pts

Tier	Score	Win rate	Cycle	Year-1 NRR
A	>=7	35-45%	28-35d	115%+
B	4-6	18-25%	50-65d	100-110%
C	<4	5-10%	90d+	90-100%

How do you build a real ICP scoring model that reps actually use to filter inbound leads instead of working everything?

Direct Answer

Detail

1. Cohort math with confidence intervals

2. Signal set with verified weights

3. First 7 days runbook (executable)

4. 60-day rollout gates

5. Tier outputs (what good looks like)

6. Anti-pattern callout

Bear Case (5 mutually exclusive failure modes + quantitative mitigations + 2 documented cases)

What does the score mean?