Background image


The SyntheticMass data set is available for download in bulk as gzip archives. Each archive contains one million synthetic patient medical records, encoded in HL7 FHIR, C-CDA and CSV. Please consider reaching out and letting us know how you are using this data set so we can improve it in the future.

Sample files containing 1,000 patient records in multiple formats are available below:

SyntheticMass Data is also available through a rich public data query API using the HL7 FHIR v.1.8.0 standard. More information is available on the FHIR API page.