2020
DOI: 10.1101/2020.03.16.20037143
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Comparison of Population Characteristics in Real-World Clinical Oncology Databases in the US: Flatiron Health, SEER, and NPCR

Abstract: Background and Objective The Surveillance, Epidemiology, and End Results Program (SEER) program and the National Program of Cancer Registries (NPCR), are authoritative sources for population cancer surveillance and research in the US. An increasing number of recent oncology studies are based on the electronic health record (EHR)-derived de-identified databases created and maintained by Flatiron Health. This report describes the differences in the originating sources and data development processes, and compares… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
251
0

Year Published

2020
2020
2023
2023

Publication Types

Select...
9

Relationship

3
6

Authors

Journals

citations
Cited by 233 publications
(255 citation statements)
references
References 21 publications
0
251
0
Order By: Relevance
“…We conducted this retrospective study using the Flatiron Health database: a demographically and geographically diverse longitudinal deidentified database derived from data obtained from electronic health records (EHRs) in the U.S. [20, 21]. The database includes information from more than 265 cancer clinics (approximately 800 sites of care), representing more than 2 million U.S. patients with cancer available for analysis.…”
Section: Subjects Materials and Methodsmentioning
confidence: 99%
“…We conducted this retrospective study using the Flatiron Health database: a demographically and geographically diverse longitudinal deidentified database derived from data obtained from electronic health records (EHRs) in the U.S. [20, 21]. The database includes information from more than 265 cancer clinics (approximately 800 sites of care), representing more than 2 million U.S. patients with cancer available for analysis.…”
Section: Subjects Materials and Methodsmentioning
confidence: 99%
“…The Flatiron Health database is a nationwide longitudinal, de-identified EHR-derived database comprised of de-identified patient-level structured and unstructured data, curated via technology-enabled abstraction [10]. During the study period, this database included deidentified data from approximately 280 US cancer clinics (ca.…”
Section: Data Sourcementioning
confidence: 99%
“…We considered only the assessments of response that followed radiographic imaging tests performed to evaluate disease burden, and the dates of those assessments were marked as ''assessment time points.'' This information was abstracted from the EHR by trained professionals [10].…”
Section: Development Of the Real-world Response (Rwr) Variablementioning
confidence: 99%
“…We defined patient cohorts from 3 subsets of the nationwide Flatiron Health electronic health record–derived de-identified database 21 : (1) metastatic CRC, (2) advanced non-small cell lung cancer, and (3) metastatic breast cancer. For each patient in our cohort, we extracted and assembled outcome data (time surviving after the date of advanced diagnosis) and selected clinical features that were commonly available and that we thought might help to predict prognosis (“predictors”).…”
Section: Methodsmentioning
confidence: 99%