Skip to content
FonteumThe Graph

The capability layer

APIREST + bulk accessMCP serverCallable by AI agentsFHIR R4 APIBulk exportAttestation & audit packReconciliationSource-vs-source diffsEntity graphSnapshotsPoint-in-time, bitemporal

By use case

Exclusion & sanctions screeningCredentialing & provider-data enrichmentAudit evidence & defensible programsProvider data for AI / RAGM&A & network diligence

By buyer

Compliance & riskDevelopers & AI teams

The differentiator

Coverage & sourcesThe catalogFreshnessMethodologyCare CompareFacility qualityBrowse all datasets →
Research

The dev on-ramp

DocsAPI referenceMCPQuickstartStatusChangelogSDKs & integrations
Pricing
Sign inTry the FHIR sandbox →Request access →

Platform

APIMCP serverFHIR R4 APIBulk exportAttestation & audit packReconciliationEntity graphSnapshots

Solutions

Exclusion & sanctions screeningCredentialing & provider-data enrichmentAudit evidence & defensible programsProvider data for AI / RAGM&A & network diligenceCompliance & riskDevelopers & AI teams

Data

Coverage & sourcesFreshnessMethodologyCare CompareBrowse all datasets →
Research

Developers

DocsAPI referenceMCPQuickstartStatusChangelogSDKs & integrations
Pricing
Sign inTry the FHIR sandbox →Request access →
Fonteum · Compare

Provenance-tracked provider data vs raw public files

Raw federal files — NPPES, PECOS, CMS Care Compare, OIG LEIE — are free and authoritative, but they ship as undated bulk CSVs with no cross-file identity spine and no per-field lineage. Fonteum publishes the same federal records NPI-resolved, snapshot-dated, and attestation-wrapped across

44 federal source familiesSource: https://fonteum.com/methodology · Dataset: fonteum-methodology/v1 · Snapshot: 2026-05-27
as of June 2026.

Published June 17, 2026 · Last reviewed June 2026 · Capability comparison — public facts only

Request access →or explore the Fonteum data platform →
9-capability comparison

From a bulk download to a queryable, provenance-tracked record

From a bulk download to a queryable, provenance-tracked record
CapabilityRaw public files (DIY)Fonteum
Cost & licensingFree under 17 U.S.C. § 105 — the federal records are public-domain works, no license required.Free to read — the same public-domain federal records, published structured, with no paywall.
Identity resolutionEach bulk file keys on its own identifier; there is no shared NPI spine joining a provider across files.NPI-resolved across sources on one identity backbone — active providers.
Cross-source joinsYou join NPPES, PECOS, OIG LEIE, and Care Compare yourself, reconciling mismatched keys.Pre-joined on NPI and CMS Certification Number so one entity view is reproducible against the files.
Field-level provenanceA bulk CSV carries no per-field source lineage — a number, but not where it came from.Source name, snapshot date, and known limitation attached to every rendered field.
Freshness trackingThe file carries one release date; there is no per-field last-checked stamp.Each field carries the source cadence and the date it was last reconciled.
Point-in-time historyEach download overwrites the last; reconstructing an as-of view is left to you.Snapshot-dated, bitemporal history — query the record as it stood on a past date.
Tamper-evidenceA local copy carries no cryptographic proof of what was fetched or when.SHA-256 digest plus an Ed25519 attestation chain on every snapshot.
Delivery & APIBulk files only — you build and host the query layer yourself.FHIR R4 API, MCP server, bulk export, and free research CSV/JSON.
Effort to first resultDownload, parse, dedupe, entity-match, then host before the first query.A first query in minutes with a free sandbox key — no sales cycle.

Raw federal files are authoritative and free; the difference is the work between a bulk download and a queryable, provenance-tracked record. Capability descriptions reflect the public bulk-file format, not any single vendor.

Why the gap exists

What a bulk download leaves to you

The files are free — the integration is the cost

NPPES, PECOS, and OIG LEIE are authoritative public-domain downloads, and that is a real strength. The cost lands afterward: parsing multi-gigabyte bulk files, deduplicating, reconciling identifiers, and standing up a query layer before a single answer comes out.

Identity resolution is the hard part

A provider appears in NPPES, in

2.98MSource: https://data.cms.gov/provider-characteristics/medicare-provider-supplier-enrollment/medicare-fee-for-service-public-provider-enrollment · Dataset: cms-pecos/v1 · Snapshot: 2026-05-01
PECOS enrollments, and in the OIG LEIE exclusions under keys that do not line up. Resolving one entity across files is the work a raw download leaves undone.

Provenance is what a download cannot carry

A CSV cell is a value with no memory of its origin. When an auditor asks which federal file backs a field and on what date, a field-level provenance record answers; a re-keyed local copy of a bulk file cannot.

Related comparisons

Compare other data capabilities

Exclusion screening vs single-list checks →

Multi-source, NPI-resolved exclusion screening vs checking one list.

Live provider data vs annual snapshots →

Continuously refreshed federal data vs paywalled annual snapshots.

Healthcare provider data platforms compared →

How sourcing model and provenance separate the category.

FAQ

Common questions

Why not just download the raw NPPES and PECOS files myself?
You can — they are free public-domain works. The work is everything after the download: parsing multi-gigabyte bulk files, deduplicating, reconciling identifiers across files, and hosting a query layer. Fonteum publishes those same federal records already parsed, NPI-resolved, and queryable, so the integration cost moves off your team.
What does NPI-resolved add over the raw files?
Raw files key on their own identifiers, so the same provider in NPPES, PECOS, and OIG LEIE is not linked. NPI-resolution joins those records onto one identity backbone of active providers, so a single provider view is reproducible across sources rather than something you reconstruct yourself.
Is Fonteum's data different from the federal source files?
No — it is the same federal records, restructured. Fonteum ingests the public files directly from CMS, OIG, and HRSA, parses them, and attaches provenance. Every published figure is reproducible against the original file, so you are reading the federal record, not a proprietary substitute for it.
What is field-level provenance and why does it matter?
Field-level provenance attaches a source name, snapshot date, and any known limitation to each individual field. It matters in compliance, credentialing, and diligence, where the basis of a data point can carry legal weight. A bulk CSV stores the value but not its lineage; the provenance record stores both.
How does Fonteum stay current versus a one-time download?
A download is a single point in time. Fonteum re-ingests each source on its native cadence — NPPES weekly, OIG LEIE monthly, Care Compare and PBJ quarterly — and stamps every field with a last-checked date, across , so freshness is observable rather than assumed.
Is Fonteum free if the federal files are already free?
The data is free to read — the underlying records are public-domain federal works, published openly with no account for static CSV and JSON. The paid pilot tier covers only what costs money to provide: scoped exports, FHIR API throughput, and integration support. You pay for scoping and throughput, not for access to federal data.
Request access

Skip the parsing. Query the federal record.

Browse free research at /research, see the field-level pipeline at /data-provenance, or request access.

Request access →or browse the source library →
See also
  • /sources → The full source library with tier, refresh cadence, and limitations.
  • /data-provenance → How every field ties back to a federal record.
  • /data → Dataset catalog, export concepts, and pilot pricing.
  • /docs/fhir → FHIR R4 US Core endpoint reference.

Built on the authoritative federal record

The primary sources, named on every page.

These are the federal agencies whose public datasets Fonteum ingests and attributes — the issuing authorities, not customers or partners. Every figure on the site links back to one of them.

  • CMS
  • HHS-OIG
  • HRSA
  • FDA
  • NLM
  • NUCC
  • Census
  • BLS
  • BEA

See the full source registry, with license and refresh cadence for each →

Reproducible by design

Every figure traces to its federal source.

14-tuple provenance

Every rendered fact ties to a source URL, dataset ID, snapshot date, row key, and SHA-256 — the full chain-of-custody record.

Reproducible SQL

Each study ships the exact query behind its figures, run against the cited federal snapshot. Re-run it yourself.

Daily reconciliation

Published counts are reconciled against the upstream federal datasets on a daily cadence, with drift logged.

Named medical review

Reviewed by Jennifer Montecillo, MD, medical reviewer. Non-practicing medical reviewer.

Read the full provenance and attestation methodology →

Two doors

Use the free API and open data

Query providers, facilities, sanctions, and quality scores — each field carrying its federal source. Self-serve, no call to start.

Explore the API →Browse the data catalog →

Talk to us

Managed pilots, enterprise terms, and audit-ready, signed attestation packages for compliance, risk, and research teams.

Talk to us →
Fonteum
Platform
Platform overviewAPIMCP serverFHIR R4 APIBulk exportAttestation & audit packReconciliationEntity graphSnapshots
Solutions
All solutionsExclusion & sanctions screeningCredentialing & enrichmentAudit evidenceProvider data for AI / RAGM&A & network diligenceCompliance & riskDevelopers & AI teams
Data & sources
Coverage & sourcesBrowse all datasetsState Medicaid exclusionsFreshnessMethodologyCare CompareSanctionsOwnershipStaffingDeficienciesSpecial Focus Facilities
Developers
Developer hubDocsAPI referenceQuickstartStatusChangelogSDKs & integrationsWebhooks
Research
Research hubHospital margin gapProvider access gapsGlossaryComparisonsCitationsWhy Fonteum
Company
AboutPressCustomersPricingContactEditorial policyCorrections
Trust & legal
TrustQualitySecurityPrivacy policyTerms of serviceMedical disclaimer

Reviewed by Jennifer Montecillo, MD, medical reviewer. Non-practicing medical reviewer.

© 2026 Fonteum LLC. All rights reserved.

·hello@fonteum.com

The U.S. healthcare graph AI can cite — every fact carries its source.

Request access→

The substrate, by the numbers

44federal source familiesDistinct CMS, OIG, HRSA, FDA and peer datasets
35dataset pagesCitable, downloadable /data catalog pages
65reproducible studiesEach shipping the SQL behind its figures