Provenance

Dataset ID
partd-drug-spending
Entity Type
drug
Role
base
Source
CMS
Vintage
CY2023
Entity Count
1,938
Last ETL Run
2026-04-13

Overview

The Medicare Part D Spending by Drug dataset is published by CMS on data.cms.gov as part of the Medicare Drug Spending dashboard. It reports total spending, claim counts, beneficiary counts, cost-per-dosage-unit, and year-over-year spending changes for drugs covered under Medicare Part D — the outpatient and retail prescription drug benefit. Each row represents one drug, identified by brand name and generic name. The current file contains 1,938 drug records. The dataset does not include drugs administered by physicians and billed under Medicare Part B (e.g., infused chemotherapy, injectable biologics).

This dataset answers questions such as: which drugs account for the largest share of Medicare Part D spending, how average cost per dosage unit compares across drugs, how many beneficiaries use a given drug, and how spending has changed year over year. Total spending reflects gross amounts before manufacturer rebates, which are confidential — net spending after rebates is substantially lower for many brand-name drugs.

Join Strategy

This dataset joins to drug entity pages on CareGraph using the generic name field (Generic Name), uppercased and trimmed during ETL to serve as the canonical drug identifier. All brand-name formulations, manufacturers, and dosage forms that share the same generic name are aggregated under a single drug entity page. For example, all records for atorvastatin — regardless of brand (Lipitor, generics) or dosage form (10 mg, 20 mg, 40 mg tablets) — are merged into the /drug/atorvastatin page. The join key is a plain string match after uppercasing and whitespace normalization; no code-based identifier (NDC, RxNorm) is used. Drug entity pages display the aggregated spending, claim count, beneficiary count, and cost-per-unit fields from this dataset.

Known Limitations

Data Quality Notes

← Back to Methodology Hub · Report an error