Anyone using synthetic patient data for coding audits and QA? Here's what I found

Daniel@qmtry.ai · Apr 23, 2026

Fellow coders,

I've been doing pre-bill coding audits for a mid-size physician group
and we recently started using synthetic patient data to QA our coding
logic before it goes to the encoder. Thought I'd share what worked
since I spent a while finding it.

The problem we were trying to solve: we needed realistic patient
records to run through our coding workflow to catch logic errors —
things like when our encoder is selecting the wrong principal diagnosis
sequencing or not flagging unbundling issues. Using real patient records
for this kind of systematic testing creates compliance risk. Using
made-up records means the clinical scenarios aren't realistic enough
to catch real-world edge cases.

What I tried first:
- CMS DE-SynPUF — ICD-9 coded, not useful
- Random Kaggle datasets — no documentation of coding methodology
- Synthea — generates FHIR records but coding is generic, not
specialty-specific

What worked: patientdatasets.com — specialty-specific datasets
(cardiology, orthopedic, mental health, oncology) coded to ICD-10-CM.
The mental health set already has F32.A (depression, unspecified —
new FY2026 code effective October 1) which most other synthetic
datasets don't have.

Commercial license means no IRB or DUA complications for billing
workflow use.

Not affiliated — just a coder sharing what worked. Anyone else
found good sources for this kind of testing data?

TThivierge · Apr 30, 2026

Daniel@qmtry.ai said:
Fellow coders,

I've been doing pre-bill coding audits for a mid-size physician group
and we recently started using synthetic patient data to QA our coding
logic before it goes to the encoder. Thought I'd share what worked
since I spent a while finding it.

The problem we were trying to solve: we needed realistic patient
records to run through our coding workflow to catch logic errors —
things like when our encoder is selecting the wrong principal diagnosis
sequencing or not flagging unbundling issues. Using real patient records
for this kind of systematic testing creates compliance risk. Using
made-up records means the clinical scenarios aren't realistic enough
to catch real-world edge cases.

What I tried first:
- CMS DE-SynPUF — ICD-9 coded, not useful
- Random Kaggle datasets — no documentation of coding methodology
- Synthea — generates FHIR records but coding is generic, not
specialty-specific

What worked: patientdatasets.com — specialty-specific datasets
(cardiology, orthopedic, mental health, oncology) coded to ICD-10-CM.
The mental health set already has F32.A (depression, unspecified —
new FY2026 code effective October 1) which most other synthetic
datasets don't have.

Commercial license means no IRB or DUA complications for billing
workflow use.

Not affiliated — just a coder sharing what worked. Anyone else
found good sources for this kind of testing data?

Hi Daniel
I know our Encoder picks up the Excludes Rules 1 if two dx codes similar for same pt. at same time. Also if dx is billed as first but it has NEC(not else classified) behind this is an error. Encoder tell us need more info some dx not to use as primary dx. Also some dx codes like dx N39 as example needs more data dx. added or another dx code as underlying ds. External injury dx codes work like that also need more data such as place it happened and how it happened & date of injury from .Our Encoder let s us know when to bill 1 lab PANEL code instead of variety of labs. Also some CPT and Dx code linked per gender and age of pt. Our Encoder alerts us if unspecfic dx code for limb or lateral body part needs to know if RT or LT. Modifiers are listed XS or 51, LT, RT,GP, GO, or KX if warranted for some CPT codes or linked to type of providers AI, AH, AJ.AF
I hope helped you somewhat.

Lady T

Anyone using synthetic patient data for coding audits and QA? Here's what I found

Daniel@qmtry.ai

New

TThivierge

True Blue