Provenance

Dataset ID
xubh-q36u
Entity Type
hospital
Role
base
Source
CMS
Vintage
FY2026
Entity Count
5,426
Last ETL Run
2026-04-13

Overview

The Hospital General Information dataset (dataset ID xubh-q36u) is published by the Centers for Medicare & Medicaid Services (CMS) through the Care Compare initiative (formerly Hospital Compare) and distributed via data.cms.gov. It contains one row per Medicare-certified hospital and covers approximately 5,426 facilities across all 50 states, the District of Columbia, and US territories. Each record includes facility identifiers, address, phone number, hospital type, ownership, emergency services availability, and the Hospital Overall Rating (1–5 star rating). The file is refreshed by CMS on a roughly quarterly cycle tied to Care Compare update releases.

This dataset answers questions such as: What type of hospital is this (acute care, critical access, psychiatric, etc.)? Who owns it? Does it provide emergency services? What is its CMS star rating? On CareGraph, it serves as the foundational spine for all hospital entity pages — every hospital page is seeded from a row in this file.

Join Strategy

Each row is matched to a hospital entity page using the Facility ID column, which contains the CMS Certification Number (CCN). The CCN is a 6-character string, zero-padded on the left. During ETL, the normalize_ccn function strips whitespace and non-alphanumeric characters, then zero-pads to 6 characters. One JSON manifest is emitted per valid CCN into site_data/hospital/{CCN}.json. Rows with blank or unparseable CCNs are logged and skipped. The hospital entity page at /hospital/{CCN} renders data from this manifest. Because this dataset defines the hospital entity roster, every hospital page on CareGraph originates from a record in this file — other hospital datasets (HRRP, HVBP) are enrichment joins onto the same CCN key.

Known Limitations

Data Quality Notes

← Back to Methodology Hub · Report an error