Skip to main content
Version: 3.0

Clear Rates Methodology

The Clear Rates pipeline turns raw payer and hospital machine-readable files into a single, scored, canonical rate per rate object. This section documents each stage of that pipeline — how rates are collected, scored, selected, and delivered.

Pipeline Stages

StageWhat it doesSQL
SpinesBuild canonical reference tables for payers, networks, providers, and codessource
Rate Object SpaceCompute the Cartesian product of valid (payer, provider, code, network) combinationssource
Raw DataIngest payer MRF, hospital MRF, claims, and gross charges; map onto the rate object spacesource
TransformationsStandardize non-dollar rates (percentages, per diems, drug dosages) into comparable dollar amountssource
ImputationsEstimate missing rates using a hierarchical fallback chain (RC global → RC HCPCS → carveouts → DRG → CSTM)source
AccuracyScore every rate 0–7 using outlier bounds, benchmark comparisons, and payer/hospital counterparty validationsource
Rate SelectionPick the highest-scoring rate per ROID; translate to canonical score 1–5source
OutputMerge sub-version outputs into production tables; enrich with whispers and rollup viewssource

Key Concepts

Spines — curated reference tables that define the universe of payers, providers, networks, and codes. Spines standardize identifiers across data sources and determine which entities appear in the Rate Object Space. See Spines.

ROID — Rate Object ID. A unique combination of (payer_id, network_id, provider_id, billing_code, billing_code_type, bill_type). Every table in the pipeline is keyed on ROIDs.

Canonical — the final, selected values that represent Clear Rates' best answer for a ROID. canonical_rate is the single dollar rate chosen by the Rate Selection algorithm, and canonical_rate_score (1–5) is its confidence score. See Score Hierarchy for the full mapping.