Provenance

Dataset ID
geo-var-county
Entity Type
county
Role
base
Source
CMS
Vintage
2014–2023
Entity Count
3,198
Last ETL Run
2026-04-13

Overview

The Medicare Geographic Variation by County dataset is published by the Centers for Medicare & Medicaid Services (CMS) through the Medicare Geographic Variation Public Use File program. It contains county-level spending, utilization, and demographic measures for Medicare fee-for-service (FFS) beneficiaries, with approximately 100 fields across 3,198 county-level records. The source file spans calendar years 2014 through 2023; CareGraph displays the most recent year available (currently 2023). The dataset is released annually, typically with a 12- to 18-month reporting lag from the end of the measurement year.

This dataset answers questions such as: How does per-capita Medicare spending in one county compare to the national average? Which counties have the highest hospitalization or emergency department visit rates? How does beneficiary demographics (age, sex, race, dual-eligible status) and illness burden (HCC risk score) vary across counties? CareGraph uses the standardized per-capita spending figures (fields ending in _STDZD_PYMT_PC) to enable fair cross-county comparisons by removing geographic payment adjustments such as the wage index, cost-of-living adjustments, and teaching hospital add-on payments.

Join Strategy

Each row in the source CSV carries a BENE_GEO_CD field containing the county FIPS code. During ETL, only rows where BENE_GEO_LVL equals County and BENE_AGE_LVL equals All are retained, yielding one row per county for the selected year. The BENE_GEO_CD value is normalized to a 5-digit zero-padded string (2-digit state FIPS + 3-digit county FIPS) by stripping non-digit characters and left-padding with zeros via normalize_fips(). This normalized FIPS code serves as the join key to the county entity page at /county/{FIPS}.

The join is a left join from the county entity manifest to the dataset: county pages without a matching geographic variation record display missing-data indicators rather than being omitted. Rows whose BENE_GEO_CD does not yield a valid 5-digit FIPS after normalization are logged and skipped. The county display name and state abbreviation are parsed from the BENE_GEO_DESC field, which uses the format ST-County Name.

Known Limitations

Data Quality Notes

← Back to Methodology Hub · Report an error