Skip to main content
Version: 2.2

Airflow Automated Validation Checks

Dataset Validations

Checks on Output of the Build

test nametest descriptiontables checked
validation_check_roidchecks that ROIDs are uniqueprod_combined_all, prod_combined_abridged
validation_check_roid_vs_roschecks that the count of ROIDs matches expected countprod_combined_all, prod_combined_abridged
validation_all_payers_have_rateschecks that non-null rates exist for all payersprod_combined_all, prod_combined_abridged
validation_all_networks_have_rateschecks that non-null rates exist for all networksprod_combined_all, prod_combined_abridged
validation_most_hospitals_have_rateschecks that non-null rates exist for >95% of hospitalsprod_combined_all, prod_combined_abridged
validation_billing_code_types_have_rateschecks that non-null rates exist for expected billing code typesprod_combined_all, prod_combined_abridged
validation_bill_types_have_rateschecks that non-null rates exist for expected billing typesprod_combined_all, prod_combined_abridged
non_outlier_median_canonical_ratechecks that non-outlier median canonical rate is between 1000 and 4000prod_combined_abridged
non_outlier_median_canonical_percentage_of_state_avg_medicarechecks that non-outlier median canonical % of med is between 1 and 1.7prod_combined_abridged
no_negative_rateschecks that there are no negative non-outlier canonical ratesprod_combined_abridged
no_rates_gtr_20mchecks that there are no non-outlier canonical rates > 20mprod_combined_abridged
non_outlier_coverage_gtr_30_pctchecks that non-outlier coverage > 30%prod_combined_abridged
raw_rate_typesfor each raw rate type, use trace_raw_id to find the matching rate in core rates / hospital ratesprod_combined_all
transform_rate_typesreproduce each transformation using trace_raw_id to find the matching rate in core rates / hospital ratesprod_combined_all
impute_rate_typesverify each imputation calculation using fields in prod_combined_allprod_combined_all

Sample Analyze Report Creation

For each provider org below, re-create an analyze report using data-only and check that results are within a reasonable range of expected values.

  • commonspirit_health
  • mass_general_brigham
  • providence
  • intermountain

Pipeline Validations

Checks at Various Stages in the Build

Rate Object Space:

test nametest descriptiontables checked
validation_check_roidchecks that ROIDs are uniquetmp_rate_object_space
ros_billing_code_typeschecks that expected billing code types existtmp_rate_object_space
ros_bill_typeschecks that expected bill types existtmp_rate_object_space
ros_taxonomy_groupingschecks that expected taxonomy groupings existtmp_rate_object_space
ros_payerschecks that expected payers existtmp_rate_object_space
ros_networkschecks that expected networks existtmp_rate_object_space

Raw Data After Mapping to ROS:

test nametest descriptiontables checked
validation_check_roidchecks that ROIDs are uniquetmp_raw_mrf_payer_rates, tmp_raw_mrf_hospital_rates
validation_all_payers_have_rateschecks that non-null rates exist for all payerstmp_raw_mrf_payer_rates, tmp_raw_mrf_hospital_rates
validation_all_networks_have_rateschecks that non-null rates exist for all networkstmp_raw_mrf_payer_rates, tmp_raw_mrf_hospital_rates
validation_most_hospitals_have_rateschecks that non-null rates exist for >95% of hospitalstmp_raw_mrf_payer_rates, tmp_raw_mrf_hospital_rates
validation_billing_code_types_have_rateschecks that non-null rates exist for expected billing code typestmp_raw_mrf_payer_rates, tmp_raw_mrf_hospital_rates
tmp_raw_mrf_payer_rates, tmp_raw_mrf_hospital_rates

Raw Data Combined:

test nametest descriptiontables checked
validation_check_roidchecks that ROIDs are uniquetmp_int_combined_raw
validation_check_roid_vs_roschecks that the count of ROIDs matches expected counttmp_int_combined_raw

After Combining Benchmarks and Accuracy:

test nametest descriptiontables checked
validation_check_roidchecks that ROIDs are uniquetmp_int_accuracy_brit
validation_check_roid_vs_roschecks that the count of ROIDs matches expected counttmp_int_accuracy_brit

SubDag Output:

test nametest descriptiontables checked
validation_check_roidchecks that ROIDs are uniquetmp_int_combined, tmp_int_combined_abridged
validation_check_roid_vs_roschecks that the count of ROIDs matches expected counttmp_int_combined, tmp_int_combined_abridged
validation_all_payers_have_rateschecks that non-null rates exist for all payerstmp_int_combined, tmp_int_combined_abridged
validation_all_networks_have_rateschecks that non-null rates exist for all networkstmp_int_combined, tmp_int_combined_abridged
validation_most_hospitals_have_rateschecks that non-null rates exist for >95% of hospitalstmp_int_combined, tmp_int_combined_abridged
validation_billing_code_types_have_rateschecks that non-null rates exist for expected billing code typestmp_int_combined, tmp_int_combined_abridged
validation_bill_types_have_rateschecks that non-null rates exist for expected billing typestmp_int_combined, tmp_int_combined_abridged