The International Genome Sample Resource (IGSR; http://www.internationalgenome.org) expands in data type and population diversity the resources from the 1000 Genomes Project. IGSR represents the largest open collection of human variation data and provides easy access to these resources. IGSR was established in 2015 to maintain and extend the 1000 Genomes Project data, which has been widely used as a reference set of human variation and by researchers developing analysis methods. IGSR has mapped all of the 1000 Genomes sequence to the newest human reference (GRCh38), and will release updated variant calls to ensure maximal usefulness of the existing data. IGSR is collecting new structural variation data on the 1000 Genomes samples from long read sequencing and other technologies, and will collect relevant functional data into a single comprehensive resource. IGSR is extending coverage with new populations sequenced by collaborating groups. Here, we present the new data and analysis that IGSR has made available. We have also introduced a new data portal that increases discoverability of our data—previously only browseable through our FTP site—by focusing on particular samples, populations or data sets of interest.
AbstractsBackgroundIndividual-level data pooling of large population-based studies across research centres in international research projects faces many hurdles. The BioSHaRE (Biobank Standardisation and Harmonisation for Research Excellence in the European Union) project aims to address these issues by building a collaborative group of investigators and developing tools for data harmonization, database integration and federated data analyses.MethodsEight population-based studies in six European countries were recruited to participate in the BioSHaRE project. Through workshops, teleconferences and electronic communications, participating investigators identified a set of 96 variables targeted for harmonization to answer research questions of interest. Using each study’s questionnaires, standard operating procedures, and data dictionaries, harmonization potential was assessed. Whenever harmonization was deemed possible, processing algorithms were developed and implemented in an open-source software infrastructure to transform study-specific data into the target (i.e. harmonized) format. Harmonized datasets located on server in each research centres across Europe were interconnected through a federated database system to perform statistical analysis.ResultsRetrospective harmonization led to the generation of common format variables for 73% of matches considered (96 targeted variables across 8 studies). Authenticated investigators can now perform complex statistical analyses of harmonized datasets stored on distributed servers without actually sharing individual-level data using the DataSHIELD method.ConclusionNew Internet-based networking technologies and database management systems are providing the means to support collaborative, multi-center research in an efficient and secure manner. The results from this pilot project show that, given a strong collaborative relationship between participating studies, it is possible to seamlessly co-analyse internationally harmonized research databases while allowing each study to retain full control over individual-level data. We encourage additional collaborative research networks in epidemiology, public health, and the social sciences to make use of the open source tools presented herein.
The incidence of cancer is continuing to rise and risk-tailored early diagnostic and/or primary prevention strategies are urgently required. The ideal risk-predictive test should: integrate the effects of both genetic and nongenetic factors and aim to capture these effects using an approach that is both biologically stable and technically reproducible; derive a score from easily accessible biological samples that acts as a surrogate for the organ in question; and enable the effectiveness of risk-reducing measures to be monitored. Substantial evidence has accumulated suggesting that the epigenome and, in particular, DNA methylation-based tests meet all of these requirements. However, the development and implementation of DNA methylation-based risk-prediction tests poses considerable challenges. In particular, the cell type specificity of DNA methylation and the extensive cellular heterogeneity of the easily accessible surrogate cells that might contain information relevant to less accessible tissues necessitates the use of novel methods in order to account for these confounding issues. Furthermore, the engagement of the scientific community with health-care professionals, policymakers and the public is required in order to identify and address the organizational, ethical, legal, social and economic challenges associated with the routine use of epigenetic testing.
Sudden cardiac death (SCD) accounts for 10–20% of total mortality, i.e., one in five individuals will eventually die suddenly. Given the substantial genetic component of SCD in younger cases, postmortem genetic testing may be particularly useful in elucidating etiological factors in the cause of death in this subset. The identification of genes responsible for inherited cardiac diseases have led to the organization of cardiogenetic consultations in many countries worldwide. Expert recommendations are available, emphasizing the importance of genetic testing and appropriate information provision of affected individuals, as well as their relatives. However, the context of postmortem genetic testing raises some particular ethical, legal, and practical (including economic or financial) challenges. The Public and Professional Policy Committee of the European Society of Human Genetics (ESHG), together with international experts, developed recommendations on management of SCD after a workshop sponsored by the Brocher Foundation and ESHG in November 2016. These recommendations have been endorsed by the ESHG Board, the European Council of Legal Medicine, the European Society of Cardiology working group on myocardial and pericardial diseases, the ERN GUARD-HEART, and the Association for European Cardiovascular Pathology. They emphasize the importance of increasing the proportion of both medical and medicolegal autopsies and educating the professionals. Multidisciplinary collaboration is of utmost importance. Public funding should be allocated to reach these goals and allow public health evaluation.
Motivation: The data that put the ‘evidence’ into ‘evidence-based medicine’ are central to developments in public health, primary and hospital care. A fundamental challenge is to site such data in repositories that can easily be accessed under appropriate technical and governance controls which are effectively audited and are viewed as trustworthy by diverse stakeholders. This demands socio-technical solutions that may easily become enmeshed in protracted debate and controversy as they encounter the norms, values, expectations and concerns of diverse stakeholders. In this context, the development of what are called ‘Data Safe Havens’ has been crucial. Unfortunately, the origins and evolution of the term have led to a range of different definitions being assumed by different groups. There is, however, an intuitively meaningful interpretation that is often assumed by those who have not previously encountered the term: a repository in which useful but potentially sensitive data may be kept securely under governance and informatics systems that are fit-for-purpose and appropriately tailored to the nature of the data being maintained, and may be accessed and utilized by legitimate users undertaking work and research contributing to biomedicine, health and/or to ongoing development of healthcare systems.Results: This review explores a fundamental question: ‘what are the specific criteria that ought reasonably to be met by a data repository if it is to be seen as consistent with this interpretation and viewed as worthy of being accorded the status of ‘Data Safe Haven’ by key stakeholders’? We propose 12 such criteria.Contact: paul.burton@bristol.ac.uk
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.