Enabling the secondary use of research data to advance scientific discoveries while respecting participant privacy has been a priority for both NIH and the public. How can we strike the right balance of maximizing public benefit from research while remaining consistent among the many important scientific and ethical considerations?
NIH has been evaluating the best way for researchers to access genomic summary results (GSR), which, as the name implies, are ‘aggregated’ summary statistics from all participants in a genomic research study or set of studies. GSR have an important distinction from some other types of genomic research data. This is because GSR do not include individual-level information, in contrast to individual genome sequences. Instead, GSR come from pooling genomic data from multiple individuals together, yielding information like genotype frequencies and other statistics. This information can help researchers determine which genomic variants might or might not contribute to a disease or disorder.
Before 2008, these types of GSR were publicly available in the NIH Database of Genotypes and Phenotypes (dbGaP). However, in 2008, an article was published showing that statistical methods using GSR could possibly be used to determine if an individual participated in a specific research study (if they also had access to that individual’s genomic data). Because of this concern, NIH decided that until it had a better appreciation of the state of the science and the actual risks to research participants, it was best to have GSR available through controlled-access.
Since that time, NIH has convened two workshops to bring together leaders in the field to consider a wide range of issues, including those directly related to GSR. One of the workshops, held in 2016, focused specifically on the risks and benefits of different levels of access to GSR. NIH also solicited broad input in a Request for Information earlier in 2017. Based on the recommendations from the workshops and public comments received through the RFI, NIH has come to realize that many stakeholders believe that there is little risk when GSR are maintained through unrestricted access (i.e., in an open and public way). However, they also suggested that additional protections should be in place for sensitive studies where there might be additional concerns, such as studies that include populations from isolated geographic areas or with rare or stigmatizing traits.
Based on this input, NIH has developed a proposed update to the access process for GSR under the NIH Genomic Data Sharing Policy, and is now seeking public comment. This update would allow GSR from most studies to be provided via a public, rapid-access model. GSR from sensitive studies would remain in controlled-access.
To view the request for comments and for instructions on how to comment, please visit: Previously Compiled Public Comments.
NIH encourages comments from all stakeholders, and is especially interested in hearing from members of the general public, research participants, and the broader patient community. Comments will be accepted until October 20, 2017. In addition, during the comment period, experts from both OSP and NHGRI will also be hosting a webinar on GSR on October 4. More details on this webinar will be provided shortly.
NIH is committed to maximizing the value of government-funded research while ensuring that participant privacy is protected, and we want to take all stakeholder thoughts into account. We look forward to hearing from you!
This blog was co-authored by Dr. Eric Green, Director of the Human Genome Research Institute. More information about NHGRI can be found at https://www.genome.gov/.