Provenance

Dataset ID
inpatient-by-drg
Entity Type
drg
Role
base
Source
CMS
Vintage
CY2023
Entity Count
534
Last ETL Run
2026-04-13

Overview

The Medicare Inpatient Hospitals by Provider and Service dataset is published by CMS as part of the Medicare Provider Utilization and Payment Data series, available on data.cms.gov. It reports hospital-level discharge counts, average covered charges, average total payments, and average Medicare payments for each MS-DRG (Medicare Severity Diagnosis Related Group) at each participating hospital. The current file covers calendar year 2023 discharges at hospitals paid under the Inpatient Prospective Payment System (IPPS). Each row represents one provider-DRG combination, with key fields including Rndrng_Prvdr_CCN, DRG_Cd, DRG_Desc, Tot_Dschrgs, Avg_Submtd_Cvrd_Chrg, Avg_Tot_Pymt_Amt, and Avg_Mdcr_Pymt_Amt.

This dataset answers questions such as: how many Medicare fee-for-service discharges occurred nationally for a given DRG, which hospitals treat the highest volume of a specific DRG, how average Medicare payments for a DRG vary across hospitals, and how billed charges compare to actual payments. It is the primary source for DRG entity pages on CareGraph, where discharge volumes and payment statistics are aggregated across all reporting hospitals.

Join Strategy

This dataset connects to DRG entity pages on CareGraph using the DRG code as the join key. The source DRG_Cd field contains values in the format NNN - Description (e.g., 470 - MAJOR HIP AND KNEE JOINT REPLACEMENT OR REATTACHMENT OF LOWER EXTREMITY W/O MCC). During ETL, the build_drgs() function extracts the numeric portion by splitting on the hyphen delimiter and stripping non-digit characters to produce a clean 3-digit code (e.g., 470). All source rows sharing the same extracted DRG code are grouped together and aggregated into a single DRG manifest at site_data/drg/{drg_code}.json. The manifest includes discharge-weighted average payments, total discharge counts, hospital counts, percentile-based payment range statistics, and a ranked list of the top 20 hospitals by discharge volume for that DRG. Each hospital entry in the manifest also carries the provider CCN, enabling cross-links to hospital entity pages.

Known Limitations

Data Quality Notes

---

← Back to Methodology Hub · Report an error