Linked Electronic Health Records for Research on a Nationwide Cohort of More Than 54 Million People in England: Data Resource

Linked Electronic Health Records for Research on a Nationwide Cohort of More Than 54 Million People in England: Data Resource

RESEARCH: SPECIAL PAPER Linked electronic health records for research on a nationwide BMJ: first published as 10.1136/bmj.n826 on 7 April 2021. Downloaded from cohort of more than 54 million people in England: data resource Angela Wood,1,2,3,4,5 Rachel Denholm,6,7,8 Sam Hollings,9 Jennifer Cooper,6,7,8 Samantha Ip,1 Venexia Walker,6,10,11 Spiros Denaxas,5,12,13,14 Ashley Akbari,15 Amitava Banerjee,16,13,17 William Whiteley,18,19 Alvina Lai,13 Jonathan Sterne,6,7,8 Cathie Sudlow,18,20,21 on behalf of the CVD-COVID-UK consortium For numbered affiliations see ABSTRACT sex, and ethnicity are complete for around 95% of end of the article. OBJECTIVE the population. Among 53.3 million people with no Correspondence to: C Sudlow To describe a novel England-wide electronic health previous diagnosis of stroke or transient ischaemic [email protected] (or @BHFDataScience on Twitter: record (EHR) resource enabling whole population attack, 98 721 had a first ever incident stroke or ORCID 0000-0002-7725-7520) research on covid-19 and cardiovascular disease transient ischaemic attack between 1 January and 31 Additional material is published while ensuring data security and privacy and October 2020, of which 30% were recorded only in online only. To view please visit maintaining public trust. primary care and 4% only in death registry records. the journal online. DESIGN Among 53.2 million people with no previous diagnosis C ite this as: BMJ 2021;372:n826 of myocardial infarction, 62 966 had an incident http://dx.doi.org/10.1136/bmj.n826 Data resource comprising linked person level records from national healthcare settings for the English myocardial infarction during follow-up, of which 8% Accepted: 29 March 2021 population, accessible within NHS Digital’s new were recorded only in primary care and 12% only in trusted research environment. death registry records. A total of 959 470 people had a confirmed or suspected covid-19 diagnosis (714 162 SETTING in primary care data, 126 349 in hospital admission EHRs from primary care, hospital episodes, death records, 776 503 in covid-19 laboratory test data, and registry, covid-19 laboratory test results, and 50 504 in death registry records). Although 58% of community dispensing data, with further enrichment these were recorded in both primary care and covid-19 planned from specialist intensive care, cardiovascular, laboratory test data, 15% and 18%, respectively, were and covid-19 vaccination data. recorded in only one. PARTICIPANTS http://www.bmj.com/ CONCLUSIONS 54.4 million people alive on 1 January 2020 and This population-wide resource shows the importance registered with an NHS general practitioner in of linking person level data across health settings England. to maximise completeness of key characteristics MAIN OUTCOME MEASURES and to ascertain cardiovascular events and covid-19 Confirmed and suspected covid-19 diagnoses, diagnoses. Although this resource was initially exemplar cardiovascular conditions (incident stroke established to support research on covid-19 and or transient ischaemic attack and incident myocardial cardiovascular disease to benefit clinical care and on 24 September 2021 by guest. Protected copyright. infarction) and all cause mortality between 1 January public health and to inform healthcare policy, it can and 31 October 2020. broaden further to enable a wide range of research. RESULTS The linked cohort includes more than 96% of the Introduction English population. By combining person level data The covid-19 pandemic has increased awareness of the across national healthcare settings, data on age, importance of population-wide person level electronic health record (EHR) data from a range of sources for WH AT IS ALREADY KNOWN ON THIS TOPIC examining, modelling, and reporting disease trends to inform healthcare and public health policy.1 Key At the start of the covid-19 pandemic, approved researchers were unable to benefits of research using such data on nationwide access national, linked health data across the whole UK population to conduct cohorts include generalisability of findings across all analyses that would support healthcare and public health policy age groups, ethnicities, geographical locations, and WH AT THIS STUDY ADDS socioeconomic, health, and personal characteristics, In partnership with NHS Digital, the British Heart Foundation Data Science and inclusion of large numbers of people and events, enhancing the precision of findings and enabling a wide Centre has developed a new trusted research environment for England, spectrum of novel research studies (eg, characterising providing researchers with secure access to linked health data from primary and shapes of relations between risk factors and disease, or secondary care, registered deaths, covid-19 laboratory and vaccination data, and studying minority groups and rare disease subtypes). cardiovascular specialist audits While EHRs for whole country cohorts for Wales, These datasets cover almost the entire population of England (>54 million Scotland, Denmark, and Sweden (populations of 3 people); similar linked data have been made available in trusted research to 10 million) have been used in research for several environments for Scotland and Wales (>8 million people) years,2-6 at the start of the covid-19 pandemic, Large numbers of approved researchers are now accessing health data on almost researchers had no access to national linked healthcare all people in the UK to address important covid-19 related research questions data across the population of England to enable critical the bmj | BMJ 2021;372:n826 | doi: 10.1136/bmj.n826 1 RESEARCH: SPECIAL PAPER research to support healthcare decisions and public emergency department, and critical care episodes), health policy. There were two main reasons for this: registered deaths (including causes of death), covid-19 BMJ: first published as 10.1136/bmj.n826 on 7 April 2021. Downloaded from the collection of comprehensive, linkable primary care laboratory tests, and community dispensed medicines data did not exist nationally and there was no secure, (table 1, fig 1, CVD-COVID-UK Dataset dashboard,10 privacy protecting mechanism for researchers to access CVD-COVID-UK Dataset trusted research environment and conduct population-wide research using national asset in Health Data Research Innovation Gateway).11 datasets linked across different parts of the health Further incorporation of specialist intensive care, data system (eg, from primary care, hospitals, death cardiovascular audit, hospital electronic prescribing, registries, laboratories). EHR research in England to and covid-19 vaccination data are planned soon. date has therefore not been able to take advantage Supplementary figure 1 shows how data generated in of the statistical power of studying a population of hospitals, general practices, community pharmacies, almost 60 million people, and clinical, public health, covid-19 NHS and commercial testing services, and policy insights have directly represented only a covid-19 vaccination services, and registry offices subset of the population. Hence there remains a need flow to NHS Digital. Regular flows to NHS Digital of for accessible, nationwide health data in England specialist intensive care and cardiovascular audit data for research, while ensuring participant safety and from several different providers were established for the maintaining public trust. first time during 2020. Furthermore, the new primary Motivated by the public health importance of care dataset established during spring 2020 by the GP fully understanding the relation between covid-19 Extraction Service, is the most comprehensive yet to and cardiovascular disease (CVD), the British Heart flow to NHS Digital. Foundation (BHF) Data Science Centre7 established the CVD-COVID-UK initiative.8 This partnership Data processing and linkage with national health data custodians in the four The processing and quality checks applied by nations of the UK aims to provide linked, nationally data providers and processors before arriving into collated EHRs for the whole population of the UK for and within NHS Digital vary by dataset (see notes approved research within secure, privacy protecting accompanying supplementary figure 1 for full details). environments. Although established initially to Linkage between datasets is enabled by NHS Digital’s support research into the impact of CVDs, and related Master Person Service,12 which aims to match multiple treatment and risk factors, on covid-19 and the impact records with 99% accuracy for each person from http://www.bmj.com/ of covid-19 on CVDs, these linked EHR resources will, different clinical computer systems (eg, hospitals and with appropriate ethical and regulatory approvals, be general practices) to a single unique identifier, the able to support a wide range of research studies. These National Health Service number representing a single could include investigations of the links between person. NHS numbers that are included in records the full spectrum of risk factors and health states are cross checked with associated personal details, documented in EHRs and covid-19, the impact of the including age, sex, and postcode, within the Personal pandemic on health service activity and provision for Demographics Service. If the NHS number is verified, on 24 September 2021 by guest. Protected copyright. non-covid-19 conditions, the nature and determinants no further processing is required. If the NHS number of long covid, and the benefits and

View Full Text

Details

  • File Type
    pdf
  • Upload Time
    -
  • Content Languages
    English
  • Upload User
    Anonymous/Not logged-in
  • File Pages
    12 Page
  • File Size
    -

Download

Channel Download Status
Express Download Enable

Copyright

We respect the copyrights and intellectual property rights of all users. All uploaded documents are either original works of the uploader or authorized works of the rightful owners.

  • Not to be reproduced or distributed without explicit permission.
  • Not used for commercial purposes outside of approved use cases.
  • Not used to infringe on the rights of the original creators.
  • If you believe any content infringes your copyright, please contact us immediately.

Support

For help with questions, suggestions, or problems, please contact us