Are Aggregated Electronic Health Record Datasets Good for Research?

Neal Goldstein; Brianne L Olivieri-Mui; Igor Burstyn

doi:10.1007/s11606-025-09808-9

Back

Are Aggregated Electronic Health Record Datasets Good for Research?

Journal article

Open access

Peer reviewed

Are Aggregated Electronic Health Record Datasets Good for Research?

Neal Goldstein, Brianne L Olivieri-Mui and Igor Burstyn

Journal of general internal medicine : JGIM, v 40, pp 3743-3749

12 Aug 2025

DOI: https://doi.org/10.1007/s11606-025-09808-9

PMID: 40794368

Featured in Collection : Research Supported by Drexel Libraries' OA Programs

Files and links (1)

url

https://doi.org/10.1007/s11606-025-09808-9View

Published, Version of Record (VoR) Open Access via Drexel Libraries Read and Publish Program 2025 Open CC BY V4.0

Abstract

data aggregation

validity

quantitative bias analysis

Data Management or Analysis (Medical)

Health Information Technology

Electronic Health Records

There has been a proliferation of large-scale electronic health record (EHR) data platforms that pool across multiple healthcare organizations, such as the National Institutes of Health’s All of Us in the federal space and TriNetX and Epic Cosmos in the commercial space. There are unique issues that occur when EHR data are aggregated across disparate healthcare systems beyond the general—and more well known—concerns about secondary analysis of EHR data from a single entity. In this article, we define aggregated EHR data, contrasting it to other real-world data sources, highlight benefits and challenges when working with aggregated EHR data, offer several “good practices” to address these challenges, and conclude by discussing whether it is appropriate to pool these data together or not.

Metrics

14 Record Views

2 citations in Web of Science

See more details

Details

Title: Are Aggregated Electronic Health Record Datasets Good for Research?
Creators: Neal Goldstein (Corresponding Author) - Drexel University, Epidemiology and Biostatistics
Brianne L Olivieri-Mui - Northeastern University
Igor Burstyn - Drexel University, Environmental and Occupational Health
Publication Details: Journal of general internal medicine : JGIM, v 40, pp 3743-3749
Publisher: Springer Nature
Number of pages: 7
Grant note: This work was supported in part by award #K01AG077972 from the National Institute of Aging (to BOM)
Resource Type: Journal article
Language: English
Academic Unit: Microbiology and Immunology; Epidemiology and Biostatistics; Environmental and Occupational Health
Web of Science ID: WOS:001547782500001
Scopus ID: 2-s2.0-105013393306
Other Identifier: 991022073428404721

UN Sustainable Development Goals (SDGs)

This publication has contributed to the advancement of the following goals:

InCites Highlights

Data related to this publication, from InCites Benchmarking & Analytics tool:

Collaboration types: Domestic collaboration
Web of Science research areas: Health Care Sciences & Services

Are Aggregated Electronic Health Record Datasets Good for Research?

Files and links (1)

Abstract

Metrics

Details

UN Sustainable Development Goals (SDGs)

InCites Highlights

Drexel University Social media