The Combined DNA Index System
Combined DNA Index System
(CODIS) is the United States
United States
national DNA database created and maintained by the Federal Bureau of Investigation. CODIS consists of three levels of information; Local DNA Index Systems (LDIS) where DNA profiles originate, State DNA Index Systems (SDIS) which allows for laboratories within states to share information, and the National DNA Index System (NDIS) which allows states to compare DNA information with one another. The CODIS software contains multiple different databases depending on the type of information being searched against. Examples of these databases include, missing persons, convicted offenders, and forensic samples collected from crime scenes. Each state, and the federal system, has different laws for collection, upload, and analysis of information contained within their database. However, for privacy reasons, the CODIS database does not contain any personal identifying information, such as the name associated with the DNA profile. The uploading agency is notified of any hits to their samples and are tasked with the dissemination of personal information pursuant to their laws.


Establishment[edit] The creation of a national DNA database within the U.S. was first mentioned by the Technical Working Group on DNA Analysis Methods (TWGDAM) in 1989.[1] In 1990, the FBI began a pilot DNA databasing program with 14 state and local laboratories.[2] In 1994, Congress passed the DNA Identification Act which authorized the FBI to create a national DNA database of convicted offenders as well as separate databases for missing persons and forensic samples collected from crime scenes. The Act also required that laboratories participating in the CODIS program maintain accreditation from an independent nonprofit organization that is actively involved in the forensic fields and that scientists processing DNA samples for submission into CODIS maintain proficiency and are routinely tested to ensure the quality of the profiles being uploaded into the database.[3] The national level of CODIS (NDIS) was implemented in October 1998. Today, all 50 states, the District of Columbia, federal law enforcement, the Army Laboratory, and Puerto Rico participate in the national sharing of DNA profiles.[4] Database structure[edit] The CODIS database contains several different indexes for the storage of DNA profile information. For assistance in criminal investigations three indexes exist: the offender index, which contains DNA profiles of those convicted of crimes; the arrestee index, which contains profiles of those arrestee of crimes pursuit to the laws of the particular state; and the forensic index, which contains profiles collected from a crime scene.[5] Additional indexes, such as the unidentified human remain index, the missing persons index, and the biological relatives of missing persons index, are used to assist in identifying missing persons.[5] Specialty indexes also exist for other specimens that don't fall into the other categories. These indexes include the staff index, for profiles of employees who work with the samples, and the multi-allelic offender index, for single-source samples that have three or more alleles at two or more loci.[6] Statistics[edit] As of February 2017[update], NDIS contains more than 12 million offender profiles, more than 2.5 million arrestee profiles and more than 750 thousand forensic profiles.[7] The effectiveness of CODIS is measured by the number of investigations aided through database hits. As of February 2017[update], CODIS has aided in over 350 thousand investigations and produced more than 365 thousand hits.[7] Each state has their own SDIS database and each state can set their own inclusionary standards that can be less strict than the national level. For this reason, a number of profiles that are present in state level databases are not in the national database and are not routinely searched across state lines.[8] Scientific basis[edit] The bulk of identifications using CODIS rely on short tandem repeats (STRs) that are scattered throughout the human genome and on statistics that are used to calculate the rarity of that specific profile in the population.[9] STRs are a type of copy-number variation and comprise a sequence of nucleotide base pairs that is repeated over and over again. At each location tested during DNA analysis, also known as a locus (plural loci), a person has two sets of repeats, one from the father and one from the mother. Each set is measured and the number of repeat copies is recorded.[10] If both strands, inherited from the parents, contain the same number of repeats at that locus the person is said to be homozygous at that locus. If the repeat numbers differ they are said to be heterozygous. Every possible difference at a locus is an allele.[11] This repeat determination is performed across a number of loci and the repeat values is the DNA profile that is uploaded to CODIS. As of January 1, 2017, requirements for upload to national level for known offender profiles is 20 loci.[4] Alternatively, CODIS allows for the upload of mitochondrial DNA (mtDNA) information into the missing persons indexes. Since mtDNA is passed down from mother to offspring it can be used to link remains to still living relatives who have the same mtDNA.[4] Loci[edit]

The original 13 core loci and their locations on the genome plus the sex determining locus Amelogenin (AMEL).

Prior to January 1, 2017, the national level of CODIS required that known offender profiles have a set of 13 loci called the "CODIS core". Since then, the requirement has expanded to include seven additional loci. Partial profiles are also allowed in CODIS in separate indexes and are common in crime scene samples that are degraded or are mixtures of multiple individuals. Upload of these profiles to the national level of CODIS requires at least eight of the core loci to be present as well as a profile rarity of 1 in 10 million (calculated using population statistics).[4] Loci that fall within a gene are named after the gene. For example, TPOX, is named after the human thyroid peroxidase gene.[12] Loci that do not fall within genes are given a standard naming scheme for uniformity. These loci are named D + the chromosome the locus is on + S + the order in which the location on that chromosome was described. For example, D3S1358 is on the third chromosome and is the 1358th location described.[13] The CODIS core are listed below; loci with asterisks are the new core and were added to the list in January 2017.[14]

CSF1PO D3S1358 D5S818 D7S820 D8S1179 D13S317 D16S539 D18S51 D21S11 FGA THO1 TPOX vWA D1S1656* D2S441* D2S1338* D10S1248* D12S391* D19S433* D22S1045*

The loci used in CODIS were chosen because they are in regions of noncoding DNA, sections that do not code for proteins. These sections should not be able to tell investigators any additional information about the person such as their hair or eye color, or their race.[15] However, new advancements in the understanding of genetic markers and ancestry have indicated that the CODIS loci may contain phenotypic information.[16][17] International use[edit] While the U.S. database is not directly connected to any other country, the underlying CODIS software is used by other agencies around the world. As of April 2016[update], the CODIS software is used by 90 international laboratories in 50 countries.[2][16] International police agencies that want to search the U.S. database can submit a request to the FBI for review. If the request is reasonable and the profile being searched would meet inclusionary standards for a U.S. profile, such as number of loci, the request can be searched at the national level or forwarded to any states where reasonable suspicion exists that they may be present in that level of the database.[4] Controversies[edit] Arrestee collection[edit]

Current arrestee collection laws as of 2017.      Collection upon conviction only      Collection from some felony arrests      Collection from all felony arrests

The original purpose of the CODIS database was to build upon the sex offender registry through the DNA collection of convicted sex offenders.[18] Over time, that has expanded. Currently, all 50 states collect DNA from those convicted of felonies. A number of states also collect samples from juveniles as well as those who are arrested, but not yet convicted, of a crime.[7][19] The collection of arrestee samples raised constitutional issues, specifically the Fourth Amendment prohibiting unreasonable search and seizure. It was argued that the collection of DNA from those that were not convicted of a crime, without an explicit order to collect, was considered a warrantless search and therefore unlawful.[20] In 2013, the United States Supreme Court ruled in Maryland v. King
Maryland v. King
that the collection of DNA from those arrested for a crime, but not yet convicted, is part of the police booking procedure and therefore a reasonable collection.[21] Familial searching[edit] Further information: Familial searching The inheritance pattern of DNA means that close relatives share a higher percentage of alleles between each other than with other, random, members of society.[22] This allows for the searching of close matches within CODIS when an exact match is not found. By focusing on close matches, investigators can potentially find a close relative whose profile is in CODIS narrowing their search to one specific family. Familial searching
Familial searching
has led to several convictions after the exhaustion of all other leads including the Grim Sleeper serial killer.[23] This practice also raised Fourth Amendment challenges as the individual who ends up being charged with a crime was only implicated because someone else's DNA was in the CODIS database.[24] So far courts have allowed familial searching results to be used and entered into evidence during court proceedings and as of March 2017[update], eleven states have approved the use of familial searching in CODIS.[25] See also[edit]

Debbie Smith Act Integrated Automated Fingerprint
Identification System (IAFIS)


