New Vulnerabilities Exposed In The Security Of Personal Genetic Information

Main Category: Genetics
Also Included In: Public Health
Article Date: 20 Jan 2013 - 0:00 PST

Current ratings for:
New Vulnerabilities Exposed In The Security Of Personal Genetic Information

Patient / Public:not yet rated

Healthcare Prof:5 stars

5 (1 votes)


Using only a computer, an Internet connection, and publicly accessible online resources, a team of Whitehead Institute researchers has been able to identify nearly 50 individuals who had submitted personal genetic material as participants in genomic studies.

Intent on conducting an exercise in "vulnerability research" - a common practice in the field of information security - the team took a multi-step approach to prove that under certain circumstances, the full names and identities of genomic research participants can be determined, even when their genetic information is held in databases in de-identified form.

"This is an important result that points out the potential for breaches of privacy in genomics studies," says Whitehead Fellow Yaniv Erlich, who led the research team. A description of the group's work is published in this week's Science magazine.

Erlich and colleagues began by analyzing unique genetic markers known as short tandem repeats on the Y chromosomes (Y-STRs) of men whose genetic material was collected by the Center for the Study of Human Polymorphisms (CEPH) and whose genomes were sequenced and made publicly available as part of the 1000 Genomes Project. Because the Y chromosome is transmitted from father to son, as are family surnames, there is a strong correlation between surnames and the DNA on the Y chromosome.

Recognizing this correlation, genealogists and genetic genealogy companies have established publicly accessible databases that house Y-STR data by surname. In a process known as "surname inference," the Erlich team was able to discover the family names of the men by submitting their Y-STRs to these databases. With surnames in hand, the team queried other information sources, including Internet record search engines, obituaries, genealogical websites, and public demographic data from the National Institute of General Medical Sciences (NIGMS) Human Genetic Cell Repository at New Jersey's Coriell Institute, to identify nearly 50 men and women in the United States who were CEPH participants.

Previous studies have contemplated the possibility of genetic identification by matching the DNA of a single person, assuming the person's DNA were cataloged in two separate databases. This work, however, exploits data between distant paternally-related individuals. As a result, the team notes that the posting of genetic data from a single individual can reveal deep genealogical ties and lead to the identification of a distantly-related person who may have no acquaintance with the person who released that genetic data.

"We show that if, for example, your Uncle Dave submitted his DNA to a genetic genealogy database, you could be identified," says Melissa Gymrek, a member of the Erlich lab and first author of the Science paper. "In fact, even your fourth cousin Patrick, whom you've never met, could identify you if his DNA is in the database, as long as he is paternally related to you."

Aware of the sensitivity of his work, Erlich emphasizes that he has no intention of revealing the names of those identified, nor does he wish to see public sharing of genetic information curtailed.

"Our aim is to better illuminate the current status of identifiability of genetic data," he says. "More knowledge empowers participants to weigh the risks and benefits and make more informed decisions when considering whether to share their own data. We also hope that this study will eventually result in better security algorithms, better policy guidelines, and better legislation to help mitigate some of the risks described."

To that end, Erlich shared his findings with officials at the National Human Genome Research Institute (NHGRI) and NIGMS prior to publication. In response, NIGMS and NHGRI moved certain demographic information from the publicly-accessible portion the NIGMS cell repository to help reduce the risk of future breaches. In the same issue of Science in which the Erlich study appears, Judith H. Greenberg and Eric D. Green, the Directors of NIGMS and NHGRI, and colleagues author a perspective on this latest research in which they advocate for an examination of approaches to balance research participants' privacy rights with the societal benefits to be realized from the sharing of biomedical research data.

"Yaniv's work is a timely reminder that in this era in which massive amounts of genomic data are being generated rapidly and shared in the interest of scientific advancement, there is an increasing likelihood of privacy breaches," says Whitehead Institute Director David Page. "I'm delighted that, thanks to Yaniv's overture to NIH, we at Whitehead Institute have the opportunity to join policymakers at NHGRI and elsewhere in what will be a critical, ongoing dialog about the importance of safeguarding data, of sharing data, and the implications of failure in either endeavor."

Article adapted by Medical News Today from original press release. Click 'references' tab above for source.
Visit our genetics section for the latest news on this subject.


This work was supported by the National Defense Science & Engineering Graduate Fellowship, the Edmond J. Safra Center for Bioinformatics at Tel-Aviv University, and a gift from James and Cathleen Stone.

Written by Matt Fearer

Yaniv Erlich is the Andria and Paul Heafy Fellow of Whitehead Institute for Biomedical Research, where his laboratory is located and all his research is conducted.

Full Citation:
"Identifying Personal Genomes by Surname Inference"
Science, January 18, 2012
Melissa Gymrek (1,2,3,4), Amy L. McGuire (5), David Golan (6), Eran Halperin (7,8,9), and Yaniv Erlich (1)

1. Whitehead Institute for Biomedical Research, Nine Cambridge Center, Cambridge, MA 02142, USA.
2. Harvard-MIT Division of Health Sciences and Technology, MIT, Cambridge, MA 02139, USA.
3. Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, 02142, USA.
4. Department of Molecular Biology and Diabetes Unit, Massachusetts General Hospital, Boston, MA 02114, USA.
5. Center for Medical Ethics and Health Policy, Baylor College of Medicine, Houston, TX 77030, USA.
6. Department of Statistics and Operations Research, Tel Aviv University, Tel Aviv 69978, Israel.
7. School of Computer Science, Tel Aviv University, Tel Aviv 69978, Israel.
8. Department of Molecular Microbiology and Biotechnology, Tel-Aviv University, Tel Aviv 69978, Israel.
9. The International Computer Science Institute, Berkeley, CA 94704, USA.

Whitehead Institute for Biomedical Research
Please use one of the following formats to cite this article in your essay, paper or report:

MLA
Whitehead Institute for Biomedical Research. "New Vulnerabilities Exposed In The Security Of Personal Genetic Information." Medical News Today. MediLexicon, Intl., 20 Jan. 2013. Web.
19 May. 2013. <http://www.medicalnewstoday.com/releases/255086.php>

APA
Whitehead Institute for Biomedical Research. (2013, January 20). "New Vulnerabilities Exposed In The Security Of Personal Genetic Information." Medical News Today. Retrieved from
http://www.medicalnewstoday.com/releases/255086.php.

Please note: If no author information is provided, the source is cited instead.



Add Your Opinion On This Article

'New Vulnerabilities Exposed In The Security Of Personal Genetic Information'

Please note that we publish your name, but we do not publish your email address. It is only used to let you know when your message is published. We do not use it for any other purpose. Please see our privacy policy for more information.

If you write about specific medications or operations, please do not name health care professionals by name.

All opinions are moderated before being included (to stop spam)

Your Name:*
E-mail Address:*
Your Opinion Title:*
Opinion:*
This is to help prevent SPAM submissions. Please enter the words exactly as they appear, including capital letters and punctuation.*

* Fields marked with a * need to be filled in before you hit the submit button.

Contact Our News Editors

For any corrections of factual information, or to contact the editors please use our feedback form.

Please send any medical news or health news press releases to:

Note: Any medical information published on this website is not intended as a substitute for informed medical advice and you should not take any action before consulting with a health care professional. For more information, please read our terms and conditions.


Genetics

Most Popular Articles



Follow Our Genetics News On Twitter

Follow Us On Twitter
Get the latest news for this category delivered straight to your Twitter account. Simply visit our Genetics Twitter account and select the 'follow' option.



View list of all 'What Is...' articles »