Genetic privacy
   HOME

TheInfoList



OR:

Genetic privacy involves the concept of personal
privacy Privacy (, ) is the ability of an individual or group to seclude themselves or information about themselves, and thereby express themselves selectively. The domain of privacy partially overlaps with security, which can include the concepts of ...
concerning the storing, repurposing, provision to third parties, and displaying of information pertaining to one's
genetic information A nucleic acid sequence is a succession of bases signified by a series of a set of five different letters that indicate the order of nucleotides forming alleles within a DNA (using GACT) or RNA (GACU) molecule. By convention, sequences are u ...
. This concept also encompasses privacy regarding the ability to identify specific individuals by their
genetic sequence A nucleic acid sequence is a succession of bases signified by a series of a set of five different letters that indicate the order of nucleotides forming alleles within a DNA (using GACT) or RNA (GACU) molecule. By convention, sequences are us ...
, and the potential to gain information on specific characteristics about that person via portions of their genetic information, such as their propensity for specific diseases or their immediate or distant ancestry. With the public release of genome sequence information of participants in large-scale research studies, questions regarding participant privacy have been raised. In some cases, it has been shown that it is possible to identify previously anonymous participants from large-scale genetic studies that released gene sequence information. Genetic privacy concerns also arise in the context of criminal law because the government can sometimes overcome criminal suspects' genetic privacy interests and obtain their DNA sample. Due to the shared nature of genetic information between family members, this raises privacy concerns of relatives as well.Ram, Natalie (July 2018). "Incidental Informants: Police Can Use Genealogy Databases to Help Identify Criminal Relatives--But Should They?". ''Maryland Bar Journal''. As concerns and issues of genetic privacy are raised, regulations and policies have been developed in the United States both at a federal and state level.


Significance of genetic information

In the majority of cases, an individual's
genetic sequence A nucleic acid sequence is a succession of bases signified by a series of a set of five different letters that indicate the order of nucleotides forming alleles within a DNA (using GACT) or RNA (GACU) molecule. By convention, sequences are us ...
is considered unique to that individual. One notable exception to this rule in humans is the case of
identical twins Twins are two offspring produced by the same pregnancy.MedicineNet > Definition of TwinLast Editorial Review: 19 June 2000 Twins can be either ''monozygotic'' ('identical'), meaning that they develop from one zygote, which splits and forms two em ...
, who have nearly identical genome sequences at birth. In the remainder of cases, one's genetic fingerprint is considered specific to a particular person and is regularly used in the identification of individuals in the case of establishing innocence or guilt in legal proceedings via
DNA profiling DNA profiling (also called DNA fingerprinting) is the process of determining an individual's DNA characteristics. DNA analysis intended to identify a species, rather than an individual, is called DNA barcoding. DNA profiling is a forensic t ...
. Specific gene variants one's genetic code, known as
allele An allele (, ; ; modern formation from Greek ἄλλος ''állos'', "other") is a variation of the same sequence of nucleotides at the same place on a long DNA molecule, as described in leading textbooks on genetics and evolution. ::"The chrom ...
s, have been shown to have strong predictive effects in the occurrences of diseases, such as the
BRCA1 Breast cancer type 1 susceptibility protein is a protein that in humans is encoded by the ''BRCA1'' () gene. Orthologs are common in other vertebrate species, whereas invertebrate genomes may encode a more distantly related gene. ''BRCA1'' is a ...
and
BRCA2 ''BRCA2'' and BRCA2 () are a human gene and its protein product, respectively. The official symbol (BRCA2, italic for the gene, nonitalic for the protein) and the official name (originally breast cancer 2; currently BRCA2, DNA repair associated) ...
mutant genes in
Breast Cancer Breast cancer is cancer that develops from breast tissue. Signs of breast cancer may include a lump in the breast, a change in breast shape, dimpling of the skin, milk rejection, fluid coming from the nipple, a newly inverted nipple, or ...
and
Ovarian Cancer Ovarian cancer is a cancerous tumor of an ovary. It may originate from the ovary itself or more commonly from communicating nearby structures such as fallopian tubes or the inner lining of the abdomen. The ovary is made up of three different ...
, or PSEN1, PSEN2, and APP genes in
Early-Onset Alzheimer's Disease Early-onset Alzheimer's disease, also called younger-onset Alzheimer's, is Alzheimer's disease diagnosed before the age of 65. It is an uncommon form of Alzheimer's, accounting for only 5–10% of all Alzheimer's cases. About 60% have a positive ...
. Additionally, gene sequences are passed down with a regular pattern of inheritance between generations, and can therefore reveal one's ancestry via
genealogical DNA test A genealogical DNA test is a DNA-based test used in genetic genealogy that looks at specific locations of a person's genome in order to find or verify ancestral genealogical relationships, or (with lower reliability) to estimate the ethnic mixt ...
ing. Additionally with knowledge of the sequence of one's biological relatives, traits can be compared that allow relationships between individuals, or the lack thereof, to be determined, as is often done in
DNA paternity testing DNA paternity testing is the use of DNA profiles to determine whether an individual is the biological parent of another individual. Paternity testing can be especially important when the rights and duties of the father are in issue and a child' ...
. As such, one's genetic code can be used to infer many characteristics about an individual, including many potentially sensitive subjects such as: * Parentage / Non-paternity *
Consanguinity Consanguinity ("blood relation", from Latin '' consanguinitas'') is the characteristic of having a kinship with another person (being descended from a common ancestor). Many jurisdictions have laws prohibiting people who are related by blood fr ...
* Adoptive Status * Ancestry * Propensity for Disease * Predicted Physical Characteristics


Sources of genetic information

Common specimen types for direct-to-consumer genetic testing are cheek swabs and saliva samples. One of the most popular reasons for at-home genetic testing is to obtain information on an individual's ancestry via genealogical DNA testing and is offered by many companies such as 23andMe,
AncestryDNA Ancestry.com LLC is an American genealogy company based in Lehi, Utah. The largest for-profit genealogy company in the world, it operates a network of genealogical, historical records, and related genetic genealogy websites. In November 2018 ...
,
Family Tree DNA FamilyTreeDNA is a division of Gene by Gene, a commercial genetic testing company based in Houston, Texas. FamilyTreeDNA offers analysis of autosomal DNA, Y-DNA, and mitochondrial DNA to individuals for genealogical purpose. With a database of ...
, or
MyHeritage MyHeritage is an online genealogy platform with web, mobile, and software products and services, introduced by the Israeli company MyHeritage in 2003. Users of the platform can obtain their family trees, upload and browse through photos, and sea ...
. Other tests are also available which provide consumers with information on genes which influence the risk of specific diseases, such as the risk of developing late-onset Alzheimer's disease or
celiac disease Coeliac disease (British English) or celiac disease (American English) is a long-term autoimmune disorder, primarily affecting the small intestine, where individuals develop intolerance to gluten, present in foods such as wheat, rye and barle ...
.


Risks


Privacy Breaches

Studies have shown that genomic data is not immune to adversary attacks. A study conducted in 2013 revealed vulnerabilities in the security of public databases that contain genetic data. As a result, research subjects could sometimes be identified by their DNA alone. Although reports of premeditated breaches outside of experimental research are disputed, researchers suggest the liability is still important to study. While accessible genomic data has been pivotal in advancing biomedical research, it also escalates the possibility of exposing sensitive information. A common practice in genomic medicine to protect patient anonymity involves removing patient identifiers. However, de-identified data is not subject to the same privileges as the research subjects. Furthermore, there is an increasing ability to re-identify patients and their genetic relatives from their genetic data. One study demonstrated re-identification by piecing together genomic data from
short tandem repeats A microsatellite is a tract of repetitive DNA in which certain DNA motifs (ranging in length from one to six or more base pairs) are repeated, typically 5–50 times. Microsatellites occur at thousands of locations within an organism's genome. ...
(e.g.
CODIS The Combined DNA Index System (CODIS) is the United States national DNA database created and maintained by the Federal Bureau of Investigation. CODIS consists of three levels of information; Local DNA Index Systems (LDIS) where DNA profiles ori ...
), SNPallele frequencies (e.g.
ancestry An ancestor, also known as a forefather, fore-elder or a forebear, is a parent or ( recursively) the parent of an antecedent (i.e., a grandparent, great-grandparent, great-great-grandparent and so forth). ''Ancestor'' is "any person from w ...
testing), and whole-genome sequencing. They also hypothesize using a patient's genetic information, ancestry testing, and social media to identify relatives. Other studies have echoed the risks associated with linking genomic information with public data like social media, including voter registries, web searches, and personal demographics, or with controlled data, like personal medical records. There is also controversy regarding the responsibility a
DNA testing Genetic testing, also known as DNA testing, is used to identify changes in DNA sequence or chromosome structure. Genetic testing can also include measuring the results of genetic changes, such as RNA analysis as an output of gene expression, ...
company has to ensure that leaks and breaches do not happen. Determining who legally owns the genomic data, the company or the individual, is of legal concern. There have been published examples of personal genome information being exploited, as well as indirect identification of family members. Additional privacy concerns, related to, e.g.,
genetic discrimination Genetic discrimination occurs when people treat others (or are treated) differently because they have or are perceived to have a gene mutation(s) that causes or increases the risk of an inherited disorder. It may also refer to any and all discr ...
, loss of anonymity, and psychological impacts, have been increasingly pointed out by the academic community as well as government agencies.


Law Enforcement

Additionally, for criminal justice and privacy advocates, the use of genetic information in identifying suspects for criminal investigations proves worrisome under the Fourth Amendment—especially when an indirect genetic link connects an individual to crime scene evidence. Since 2018, law enforcement officials have been harnessing the power of genetic data to revisit
cold case A cold case is a crime, or a suspected crime, that has not yet been fully resolved and is not the subject of a current criminal investigation, but for which new information could emerge from new witness testimony, re-examined archives, new or r ...
s with DNA evidence.Murphy, Heather (2019-04-25). "Sooner or Later Your Cousin's DNA Is Going to Solve a Murder". ''The New York Times''.
ISSN An International Standard Serial Number (ISSN) is an eight-digit serial number used to uniquely identify a serial publication, such as a magazine. The ISSN is especially helpful in distinguishing between serials with the same title. ISSNs a ...
0362-4331. Retrieved 2020-05-19.
Suspects discovered through this process are not directly identified by the input of their DNA into established criminal databases, like CODIS. Instead, suspects are identified as the result of familial genetic sleuthing by law enforcement, submitting crime scene DNA evidence to genetic database services that link users whose DNA similarity indicates a family connection.  Officers can then track the newly identified suspect in person, waiting to collect discarded trash that might carry DNA in order to confirm the match. Despite the privacy concerns of suspects and their relatives, this procedure is likely to survive Fourth Amendment scrutiny. Much like donors of biological samples in cases of genetic research, criminal suspects do not retain property rights in abandoned waste; they can no longer assert an expectation of privacy in the discarded DNA used to confirm law enforcement suspicions, thereby eliminating their Fourth Amendment protection in that DNA. Additionally, the genetic privacy of relatives is likely irrelevant under current caselaw since Fourth Amendment protection is “personal” to criminal defendants.


Psychological Impact

In a systematic review of perspectives toward genetic privacy, researchers highlight some of the concerns individuals hold regarding their genetic information, such as the potential dangers and effects on themselves and family members. Academics note that participating in biomedical research or genetic testing has implications beyond the participant; it can also reveal information about genetic relatives. The study also found that people expressed concerns as to which body controls their information and if their genetic information could be used against them. Additionally, the
American Society of Human Genetics The American Society of Human Genetics (ASHG), founded in 1948, is a professional membership organization for specialists in human genetics. As of 2009, the organization had approximately 8,000 members. The Society's members include researchers, a ...
has expressed issues about genetic tests in children. They infer that testing could lead to negative consequences for the child. For example, if a child's likelihood for adoption was influenced by genetic testing, the child might suffer from self esteem issues. A child's well-being might also suffer due to paternity testing or custody battles that require this type of information.


Regulations

When the access of genetic information is regulated, it can prevent
insurance companies Insurance is a means of protection from financial loss in which, in exchange for a fee, a party agrees to compensate another party in the event of a certain loss, damage, or injury. It is a form of risk management, primarily used to hedge ...
and employers from reaching such data. This could avoid issues of discrimination, which oftentimes leaves an individual whose information has been breached without a job or without insurance.


In the United States


Federal Regulation

In the United States, biomedical research containing human subjects is governed by a baseline standard of ethics known as The Common Rule, which aims to protect a subject's privacy by requiring "identifiers" such as name or address to be removed from collected data. A 2012 report by the
Presidential Commission for the Study of Bioethical Issues The Presidential Commission for the Study of Bioethical Issues (the Bioethics Commission) was created by on November 24, 2009. Executive Order 13521 - ''Establishing the Presidential Commission for the Study of Bioethical Issues'', November  ...
stated, however, that "what constitutes 'identifiable' and 'de-identified' data is fluid and that evolving technologies and the increasing accessibility of data could allow de-identified data to become re-identified". In fact, research has already shown that it is "possible to discover a study participant's identity by cross-referencing research data about him and his DNA sequence … ithgenetic genealogy and public-records databases". This has led to calls for policy-makers to establish consistent guidelines and best practices for the accessibility and usage of individual genomic data collected by researchers. Privacy protections for genetic research participants were strengthened by provisions of the 21st Century Cures Act (H.R.34) passed on 7 December 2016 for which the American Society of Human Genetics (ASHG) commended Congress, Senator Warren and Senator Enzi. The
Genetic Information Nondiscrimination Act The Genetic Information Nondiscrimination Act of 2008 (, GINA ), is an Act of Congress in the United States designed to prohibit some types of genetic discrimination. The act bars the use of genetic information in health insurance and employment ...
of 2008 (GINA) protects the genetic privacy of the public, including research participants. The passage of GINA makes it illegal for health insurers or employers to request or require genetic information of an individual or of family members (and further prohibits the discriminatory use of such information). This protection does not extend to other forms of insurance such as life insurance. The Health Insurance Portability and Accountability Act of 1996 (HIPAA) also provides some genetic privacy protections. HIPAA defines health information to include genetic information, which places restrictions on who health providers can share the information with.


State Regulation

Three kinds of laws are frequently associated with genetic privacy: those relating to informed consent and property rights, those preventing insurance discrimination, and those prohibiting employment discrimination. According to the National Human Genome Research Institute, forty-one states have enacted genetic privacy laws as of January 2020. However, those privacy laws vary in the scope of protection offered; while some laws "apply broadly to any person" others apply "narrowly to certain entities such as insurers, employers, or researchers." Arizona, for example, falls in the former category and offers broad protection. Currently, Arizona's genetic privacy statutes focus on the need for informed consent to create, store, or release genetic testing results, but a pending bill would amend the state genetic privacy law framework to grant exclusive property rights in genetic information derived from genetic testing to all persons tested. In expanding privacy rights by including property rights, the bill would grant persons who undergo genetic testing greater control over their genetic information. Arizona also prohibits insurance and employment discrimination on the basis of genetic testing results. New York State also has strong legislative measures protecting individuals from genetic discrimination. Section 79-I of the New York Civil Rights Law places strict restrictions on the usage of genetic data. The statute also outlines the proper conditions for consenting to genetic data collection or usage. California similarly offers a broad range of protection for genetic privacy, but it stops short of granting individuals property rights in their genetic information. While currently enacted legislation focuses on prohibiting genetic discrimination in employment and insurance, a piece of pending legislation would extend genetic privacy rights to provide individuals with greater control over genetic information obtained through direct-to-consumer testing services like 23andMe. Florida passed House Bill 1189, a DNA privacy law that prohibits insurers from using genetic data, in July 2020. On the other hand, Mississippi offers few genetic privacy protections beyond those required by the federal government. In the Mississippi Employment Fairness Act, the legislature recognized the applicability of the
Genetic Information Nondiscrimination Act The Genetic Information Nondiscrimination Act of 2008 (, GINA ), is an Act of Congress in the United States designed to prohibit some types of genetic discrimination. The act bars the use of genetic information in health insurance and employment ...
, which "prohibit discrimination on the basis of genetic information with respect to health insurance and employment."


Other

To balance data sharing with the need to protect the privacy of research subjects geneticists are considering to move more data behind controlled-access barriers, authorizing trusted users to access the data from many studies, rather than "having to obtain it piecemeal from different studies". In October 2005, IBM became the world's first major corporation to establish a genetics privacy policy. Its policy prohibits using employees' genetic information in employment decisions.


Breaching techniques

According to a 2014 study by Yaniv Erlich and Arvind Narayanan, genetic privacy breaching techniques fall into three categories:


Identity Tracing

:Here the aim is to link between an unknown genome and the concealed identity of the data originator by accumulating quasi-identifiers − residual pieces of information that are embedded in the dataset − and to gradually narrow down the possible individuals that match the combination of these quasi-identifiers.


Attribute Disclosure Attacks via DNA (ADAD)

:Here the adversary already has access to the identified
DNA sample Genetic testing, also known as DNA testing, is used to identify changes in DNA sequence or chromosome structure. Genetic testing can also include measuring the results of genetic changes, such as RNA analysis as an output of gene expression, or ...
of the target and to a database that links DNA-derived data to sensitive attributes without explicit identifiers, for example a public database of the genetic study of drug abuse. The ADAD techniques match the DNA data and associate the identity of the target with the sensitive attribute


Completion Techniques

:Here the adversary also knows the identity of a genomic dataset but has access only to a sanitized version without sensitive loci. The aim here is to expose the sensitive loci that are not part of the original data. However, more recent studies have indicated new avenues for breaching genetic privacy:


Phenotype Inferences

:Here, the goal is to use readily available phenotype information about an individual, such as physical features (or some combination thereof), to make genetic inferences. As genetic databases grow at unprecedented rates, providing larger and more comprehensive aggregates, the ability to make inferences with more probabilistic certainty greatly increases. Furthermore, the scope of potential inferences grows with expanding datasets.


Safeguards

According to a 2022 study by Zhiyu Wan et al., safeguards for genetic privacy fall into two categories:


Legal Safeguards

:Legal safeguards include the Genetic Information Nondiscrimination Act of 2008, the
Health Insurance Portability and Accountability Act of 1996 The Health Insurance Portability and Accountability Act of 1996 (HIPAA or the Kennedy– Kassebaum Act) is a United States Act of Congress enacted by the 104th United States Congress and signed into law by President Bill Clinton on August 21, 1 ...
, the
Common Rule The Common Rule is a 1981 rule of ethics in the United States regarding biomedical and behavioral research involving human subjects. A significant revision became effective July 2018. It governed Institutional Review Boards for oversight of huma ...
, the US
National Institutes of Health The National Institutes of Health, commonly referred to as NIH (with each letter pronounced individually), is the primary agency of the United States government responsible for biomedical and public health research. It was founded in the lat ...
(NIH) data sharing policy,
European Union The European Union (EU) is a supranational union, supranational political union, political and economic union of Member state of the European Union, member states that are located primarily in Europe, Europe. The union has a total area of ...
’s
General Data Protection Regulation The General Data Protection Regulation (GDPR) is a European Union regulation on data protection and privacy in the EU and the European Economic Area (EEA). The GDPR is an important component of EU privacy law and of human rights law, in par ...
(GDPR), US state privacy laws (e.g.,
California Consumer Privacy Act The California Consumer Privacy Act (CCPA) is a state statute intended to enhance privacy rights and consumer protection for residents of California, United States. The bill was passed by the California State Legislature and signed into law by Je ...
,
California Privacy Rights Act The California Privacy Rights Act of 2020 (CPRA), also known as Proposition 24, is a California ballot proposition that was approved by a majority of voters after appearing on the ballot for the general election on November 3, 2020. This propo ...
, or Virginia Consumer Data Protection Act), self-regulations (e.g., data use agreements,
privacy policies A privacy policy is a statement or legal document (in privacy law) that discloses some or all of the ways a party gathers, uses, discloses, and manages a customer or client's data. Personal information can be anything that can be used to identify ...
, or terms of service), and
informed consent Informed consent is a principle in medical ethics and medical law, that a patient must have sufficient information and understanding before making decisions about their medical care. Pertinent information may include risks and benefits of treat ...
s.


Technical Safeguards

:Technical safeguards include cryptographic tools, access control, and data perturbation approaches. Specifically, cryptographic approaches include
homomorphic encryption Homomorphic encryption is a form of encryption that permits users to perform computations on its encrypted data without first decrypting it. These resulting computations are left in an encrypted form which, when decrypted, result in an identical ...
,
secure multiparty computation Secure multi-party computation (also known as secure computation, multi-party computation (MPC) or privacy-preserving computation) is a subfield of cryptography with the goal of creating methods for parties to jointly compute a function over their ...
, trusted execution environment, and
Blockchain A blockchain is a type of distributed ledger technology (DLT) that consists of growing lists of records, called ''blocks'', that are securely linked together using cryptography. Each block contains a cryptographic hash of the previous block, ...
, whereas data perturbation approaches include ''k''-anonymity, Beacon services,
differential privacy Differential privacy (DP) is a system for publicly sharing information about a dataset by describing the patterns of groups within the dataset while withholding information about individuals in the dataset. The idea behind differential privacy is t ...
, and synthetic data generation.


See also

* *
Genetic discrimination Genetic discrimination occurs when people treat others (or are treated) differently because they have or are perceived to have a gene mutation(s) that causes or increases the risk of an inherited disorder. It may also refer to any and all discr ...
* Genetic exceptionalism *
Gene theft In bioethics and law, gene theft or DNA theft is the act of acquiring the genetic material of another individual, usually from public places, without his or her permission. The DNA may be harvested from a wide variety of common objects such as ...
*
Maryland v. King In ''Maryland v. King'', 569 U.S. 435 (2013), the United States Supreme Court decided that a cheek swab of an arrestee's DNA is comparable to fingerprinting and therefore, a legal police booking procedure that is reasonable under the Fourth Ame ...
*
List of data breaches This is a list of data breaches, using data compiled from various sources, including press reports, government news releases, and mainstream news articles. The list includes those involving the theft or compromise of 30,000 or more records, al ...
*
Comparative genomics Comparative genomics is a field of biological research in which the genomic features of different organisms are compared. The genomic features may include the DNA sequence, genes, gene order, regulatory sequences, and other genomic structural ...
* DNA encryption


References


Further reading

* The Right to Know and the Right Not to Know: Genetic Privacy and Responsibility, {{ISBN, 9781107076075


External links


Genetic Information Privacy
EFF
Medical and Genetic Privacy
ACLU The American Civil Liberties Union (ACLU) is a nonprofit organization founded in 1920 "to defend and preserve the individual rights and liberties guaranteed to every person in this country by the Constitution and laws of the United States". ...

Genetic Privacy Laws
NCSL Information privacy Human genetics Privacy law