Yaniv Erlich
   HOME

TheInfoList



OR:

Yaniv Erlich is an
Israeli-American , native_name_lang = , image = , caption = , population = 110,000–150,000 , popplace = New York metropolitan area, Los Angeles metropolitan area, Miami metropolitan area, and other large metropolitan ar ...
scientist A scientist is a person who conducts scientific research to advance knowledge in an area of the natural sciences. In classical antiquity, there was no real ancient analog of a modern scientist. Instead, philosophers engaged in the philosoph ...
. He formerly served as an Associate Professor of
Computer Science Computer science is the study of computation, automation, and information. Computer science spans theoretical disciplines (such as algorithms, theory of computation, information theory, and automation) to practical disciplines (includi ...
at
Columbia University Columbia University (also known as Columbia, and officially as Columbia University in the City of New York) is a private research university in New York City. Established in 1754 as King's College on the grounds of Trinity Church in Manhatt ...
and was the Chief Science Officer of MyHeritage. Erlich's work combines
computer science Computer science is the study of computation, automation, and information. Computer science spans theoretical disciplines (such as algorithms, theory of computation, information theory, and automation) to practical disciplines (includi ...
and genomics.


Biography

Erlich was born in
Israel Israel (; he, יִשְׂרָאֵל, ; ar, إِسْرَائِيل, ), officially the State of Israel ( he, מְדִינַת יִשְׂרָאֵל, label=none, translit=Medīnat Yīsrāʾēl; ), is a country in Western Asia. It is situated ...
. He earned BSc in Brain Sciences in 2006 from
Tel Aviv University Tel Aviv University (TAU) ( he, אוּנִיבֶרְסִיטַת תֵּל אָבִיב, ''Universitat Tel Aviv'') is a public research university in Tel Aviv, Israel. With over 30,000 students, it is the largest university in the country. Locate ...
and a PhD in bioinformatics in 2010 from
Watson School of Biological Sciences The Cold Spring Harbor Laboratory School of Biological Sciences, formerly known as the Watson School of Biological Sciences (WSBS) until 2020, is a biological sciences graduate school at Cold Spring Harbor Laboratory. The school was opened in 1999 ...
at
Cold Spring Harbor Laboratory Cold Spring Harbor Laboratory (CSHL) is a private, non-profit institution with research programs focusing on cancer, neuroscience, plant biology, genomics, and quantitative biology. It is one of 68 institutions supported by the Cancer Centers ...
. From 2010 to 2015, Erlich was a Fellow at the
Whitehead Institute Whitehead Institute for Biomedical Research is a non-profit research institute located in Cambridge, Massachusetts, United States that is dedicated to improving human health through basic biomedical research. It was founded as a fiscally indepen ...
,
MIT The Massachusetts Institute of Technology (MIT) is a private land-grant research university in Cambridge, Massachusetts. Established in 1861, MIT has played a key role in the development of modern technology and science, and is one of the m ...
. From 2015 to 2019, he lead a lab at
Columbia University Columbia University (also known as Columbia, and officially as Columbia University in the City of New York) is a private research university in New York City. Established in 1754 as King's College on the grounds of Trinity Church in Manhatt ...
in computational genomics . From 2020 to present, he has served as CEO of Eleven Therapeutics


Scientific work


Crowd sourcing genomic information

Erlich's team published a study in the journal
Science Science is a systematic endeavor that Scientific method, builds and organizes knowledge in the form of Testability, testable explanations and predictions about the universe. Science may be as old as the human species, and some of the earli ...
that reported crowd-sourcing of tens of millions of genealogical records from the website Geni.com. The team was able to create a single family tree of 13 million people that are all connected and spans tens of generations and over 600 years of history. The study used the data to analyze the genetics of longevity and familial dispersion In a different line of studies, Erlich and Joe Pickrell put together a website called DNA.Land to crowd source genomic datasets of participants of consumer genomics. The website collected over 130,000 datasets by November 2018.


Genetic Privacy

The Erlich group published several studies on the subject of
genetic privacy Genetic privacy involves the concept of personal privacy concerning the storing, repurposing, provision to third parties, and displaying of information pertaining to one's genetic information. This concept also encompasses privacy regarding the abi ...
. In 2013, they reported the possibility of recovering the surname of a male from his allegedly anonymous genomic dataset, which can lead to tracing his full identity. The technique exploits the co-inheritance of surnames and Y-chromosomes in most societies. Thus, by comparing the
Y-chromosome The Y chromosome is one of two sex chromosomes (allosomes) in therian mammals, including humans, and many other animals. The other is the X chromosome. Y is normally the sex-determining chromosome in many species, since it is the presence or abs ...
of the person of interest to genetic genealogy databases of Y-chromosomes, it is possible in some cases to infer the surname. The team estimated that 12% of males in the US are subject to successful surname recovery. The team also demonstrated that after recovering the surname, basic demographic identifiers such as age and state of residency can permit tracing back the identity of the individual. To demonstrate the power of technique, they recover the identity of multiple 1000 Genomes by surname inference. In 2014, Erlich and
Arvind Narayanan Arvind Narayanan is a computer scientist and a professor at Princeton University. Narayanan is recognized for his research in the de-anonymization of data. Biography Narayanan received technical degrees from the Indian Institute of Technolo ...
published a survey of hacking techniques to genomic datasets. They predicted that autosomal searches in
GEDmatch GEDmatch is an online service to compare autosomal DNA data files from different testing companies. The website gained significant media coverage in April 2018 after it was used by law enforcement to identify a suspect in the Golden State Kille ...
can be used to trace back the identity of anonymous people once the GEDmatch user base will reach a certain size, which indeed happened in 2018, where the website used to capture the
Golden State Killer Joseph James DeAngelo Jr. (born November 8, 1945) is an American serial killer, sex offender, burglar, and former police officer who committed at least 13 murders, 51 rapes, and 120 burglaries across California between 1974 a ...
. In 2018, the Erlich team published a study in Science that reported that about 60% of US individuals of European descent have at least a 3rd cousin match in GEDmatch, which can theoretically permit their identification. In two to three years, virtually any person in this ethnic group can be theoretically traced using this technique, if the current rate of growth in GEDmatch will continue. The team suggested a cryptographic signature technique to reduce the chance of misusing direct to consumer websites by police searches.


References

{{DEFAULTSORT:Erlich, Yaniv Columbia University faculty American people of Israeli descent Tel Aviv University alumni Living people American computer scientists Year of birth missing (living people)