About This Project
Data Source
This dashboard visualizes the HHS Medicaid Provider Spending Dataset, published by the U.S. Department of Health and Human Services on the Medicaid Open Data Portal.
The dataset contains 227 million rows of claim-level spending data from 2018 through 2024, covering $1.09 trillion in Medicaid payments to over 617,000 unique billing providers across all 50 states, DC, and U.S. territories.
Methodology
Raw claims data was enriched with provider names and locations from the NPPES (National Plan and Provider Enumeration System), procedure descriptions from HCPCS (Healthcare Common Procedure Coding System), and provider classifications from NUCC (National Uniform Claim Committee) taxonomy codes.
Supplementary data enrichment includes:
- Enrollment data from CMS Medicaid Enrollment (MBES) — state-level monthly unduplicated enrollee counts (2018-2024)
- Managed care penetration from CMS Managed Care Enrollment Reports — percentage of enrollees in managed care by state and year
- ACA expansion status from KFF Medicaid Expansion Tracker — 41 states + DC expanded as of 2025
- Medicare crosswalk from CMS Medicare Physician & Other Practitioners PUF (2023) — NPI-level Medicare payment data for cross-program comparison
All aggregations were computed using DuckDB against the raw Parquet files. The web application serves pre-computed JSON files — no real-time queries are executed in the browser.
- Spending figures are as reported by HHS and have not been adjusted for inflation.
- Provider locations come from NPPES registration data, not claim service locations.
- State-level aggregations use the billing provider's registered state, which may differ from where services were rendered.
- Year-over-year growth is calculated using calendar year totals.
Privacy
The source dataset already applies privacy protections: rows with fewer than 12 claims are excluded by HHS before publication. All data shown here is derived from publicly available government records.
Limitations
- Fee-for-service only: This dataset covers Medicaid FFS claims only. Approximately 85% of Medicaid enrollees nationally are in managed care arrangements, meaning the vast majority of Medicaid spending flows through MCOs and is not captured in this data. States with high managed care penetration (e.g., TN 94%, TX 96%) appear to have very low spending — this reflects data coverage, not actual spending levels. See managed care percentages on each state page.
- Provider names may contain typos or variations (e.g., LLC vs L.L.C.) since they come from NPPES self-reported data.
- Some NPI numbers may map to different entities over time due to practice changes, mergers, or data entry issues.
- The dataset is updated annually. The most recent data may be incomplete due to claims processing lag.
Technology
Built with Next.js (static export), Tailwind CSS, Recharts, and react-simple-maps. Data processing uses Python with DuckDB. Hosted on Vercel with zero server costs.