HOME

TheInfoList



OR:

Genome sequencing of endangered species is the application of
Next Generation Sequencing DNA sequencing is the process of determining the nucleic acid sequence – the order of nucleotides in DNA. It includes any method or technology that is used to determine the order of the four bases: adenine, guanine, cytosine, and thymine. The ...
(NGS) technologies in the field of conservative biology, with the aim of generating life history, demographic and
phylogenetic In biology, phylogenetics (; from Greek φυλή/ φῦλον [] "tribe, clan, race", and wikt:γενετικός, γενετικός [] "origin, source, birth") is the study of the evolutionary history and relationships among or within groups o ...
data of relevance to the management of endangered wildlife.


Background

In the context of conservation biology, genomic technologies such as the production of large-scale sequencing data sets via DNA sequencing can be used to highlight the relevant aspects of the biology of wildlife species for which management actions may be required. This may involve the estimation of recent demographic events, genetic variations, divergence between species and population structure.
Genome-wide association studies In genomics, a genome-wide association study (GWA study, or GWAS), also known as whole genome association study (WGA study, or WGAS), is an observational study of a genome-wide set of genetic variants in different individuals to see if any varian ...
(GWAS) are useful to examine the role of natural selection at the genome level, to identify the loci associated with fitness, local adaptation, inbreeding, depression or disease susceptibility. The access to all these data and the interrogation of genome-wide variation of SNP markers can help the identification of the genetic changes that influence the fitness of wild species and are also important to evaluate the potential respond to changing environments. NGS projects are expected to rapidly increase the number of threatened species for which assembled genomes and detailed information on sequence variation are available and the data will advance investigations relevant to the conservation of biological diversity.


Methodology


Non-computational methods

The traditional approaches in the preservation of endangered species are captive breeding and the private farming. In some cases those methods led to great results, but some problems still remain. For example, by inbreeding only few individuals, the genetic pool of a subpopulation remains limited or may decrease.


Phylogenetic analysis and gene family estimation

Genetic analyses can remove subjective elements from the determination of the phyliogenetic relationship between organisms. Considering the great variety of information provided by living organisms, it is clear that the type of data will affect both the method of treatment and validity of the results: the higher the correlation of data and genotype, the greater is the validity likely to be. The data analysis can be used to compared different sequencing database and find similar sequences, or similar protein in different species. The comparison can be done using informatic software based on alignment to know the divergence between different species and evaluate the similarities.


NGS/Advanced sequencing methodologies

Since whole-genome sequencing is generally very data-intensive, techniques for reduced representation genomic approaches are sometimes used for practical applications. For example, restriction site-associated DNA sequencing ( RADseq) and double digest RADseq are being developed. With those techniques researchers can target different numbers of loci. With a statistical and bioinformatic approach scientists can make considerations about big genomes, by just focusing on a small representative part of it.


Statistical and computational methods

While solving biological problems, one encounters multiple types of genomic data or sometimes an aggregate of same type of data across multiple studies and decoding such huge amount of data manually is unfeasible and tedious. Therefore, integrated analysis of genomic data using statistical methods has become popular. The rapid advancement in high throughput technologies allows researchers to answer more complex biological questions enabling the development of statistical methods in integrated genomics to establish more effective therapeutic strategies for human disease.


Genome crucial features

While studying the genome, there are some crucial aspects that should be taken in consideration.
Gene prediction In computational biology, gene prediction or gene finding refers to the process of identifying the regions of genomic DNA that encode genes. This includes protein-coding genes as well as RNA genes, but may also include prediction of other functiona ...
is the identification of genetic elements in a genomic sequence. This study is based on a combination of approaches: de novo, homology prediction, and transcription. Tools such as EvidenceModeler are used to merge the different results. Gene structure also have been compared, including mRNA length, exon length, intron length, exon number, and
non-coding RNA A non-coding RNA (ncRNA) is a functional RNA molecule that is not Translation (genetics), translated into a protein. The DNA sequence from which a functional non-coding RNA is transcribed is often called an RNA gene. Abundant and functionally im ...
. Analysis of repeated sequences has been found useful in reconstructing species divergence timelines.


Application and case studies


Genomic approach in gender determination

In order to preserve a specie, knowledge of the mating system is crucial: scientists can stabilize wild populations through captive breeding, followed by the release in the environment of new individuals. This task is particularly difficult by considering the species with homomorphic
sex chromosome A sex chromosome (also referred to as an allosome, heterotypical chromosome, gonosome, heterochromosome, or idiochromosome) is a chromosome that differs from an ordinary autosome in form, size, and behavior. The human sex chromosomes, a typical ...
s and a large genome. For example, in the case of amphibians, there are multiple transitions among male and/or female heterogamety. Sometimes even variation of sex chromosomes within amphibian populations of the same specie were reported.


Japanese giant salamander

The multiple transitions among XY and ZW systems that occur in amphibians determine the sex chromosome systems to be labile in
salamander Salamanders are a group of amphibians typically characterized by their lizard-like appearance, with slender bodies, blunt snouts, short limbs projecting at right angles to the body, and the presence of a tail in both larvae and adults. All t ...
s populations. By understanding the chromosomal basis of sex of those species, it is possible to reconstruct the phylogenetic history of those families and use more efficient strategies in their conservation. By using the ddRADseq method scientists found new sex-related loci in a 56 Gb genome of the family Cryptobranchidae. Their results support the hypothesis of female heterogamety of this species. These loci were confirmed through the bioinformatic analysis of presence/absence of that genetic locus in sex-determined individuals. Their sex was established previously by ultrasound,
laparoscopy Laparoscopy () is an operation performed in the abdomen or pelvis using small incisions (usually 0.5–1.5 cm) with the aid of a camera. The laparoscope aids diagnosis or therapeutic interventions with a few small cuts in the abdomen.Medlin ...
and measuring serum calcium level differences. The determination of those candidate sexual loci was performed so as to test hypotheses of both female heterogamety and male hetegogamety. Finally to evaluate the validity of those loci, they were amplified through PCR directly from samples of known-sex individuals. This final step led to the demonstration of female heterogamety of several divergent populations of the family Cryptobranchidae.


Genomic approach in genetic variability


Dryas monkey and golden snub-nosed monkey

A recent study used
whole-genome sequencing Whole genome sequencing (WGS), also known as full genome sequencing, complete genome sequencing, or entire genome sequencing, is the process of determining the entirety, or nearly the entirety, of the DNA sequence of an organism's genome at a ...
data to demonstrate the sister lineage between the
Dryas monkey The Dryas monkey (''Chlorocebus dryas''), also known as Salonga monkey, ''ekele'', or ''inoko'', is a little-known species of Old World monkey found only in the Congo Basin, restricted to the left bank of the Congo River. It is now established t ...
and
vervet monkey The vervet monkey (''Chlorocebus pygerythrus''), or simply vervet, is an Old World monkey of the family Cercopithecidae native to Africa. The term "vervet" is also used to refer to all the members of the genus ''Chlorocebus''. The five distinct ...
and their divergence with additional bidirectional gene flow approximately 750,000 to approximately 500,000 years ago. With <250 remaining adult individuals, the study showed high genetic diversity and low levels of inbreeding and genetic load in the studied Dryas monkey individuals. Another study used several techniques such as single-molecule real time sequencing, paired-end sequencing, optical maps, and high-throughput
chromosome conformation capture Chromosome conformation capture techniques (often abbreviated to 3C technologies or 3C-based methods) are a set of molecular biology methods used to analyze the spatial organization of chromatin in a cell. These methods quantify the number of int ...
to obtain a high quality chromosome assembly from already constructed incomplete and fragmented genome assembly for the golden snub-nosed monkey. The modern techniques used in this study represented 100-fold improvement in the genome with 22,497 protein-coding genes, of which majority were functionally annotated. The reconstructed genome showed a close relationship between the species and the
Rhesus macaque The rhesus macaque (''Macaca mulatta''), colloquially rhesus monkey, is a species of Old World monkey. There are between six and nine recognised subspecies that are split between two groups, the Chinese-derived and the Indian-derived. Generally ...
, indicating a divergence approximately 13.4 million years ago.


Genomic approach in preservation


Plants

Plants species identified as PSESP ("plant species with extremely small population") have been the focus of genomic studies, with the aim of determining the most endangered populations. The DNA genome can be sequenced starting from the fresh leaves by doing a DNA extraction. The combination of different sequencing techniques together can be used to obtain a high quality data that can be used to assembly the genome. The RNA extraction is essential for the
transcriptome The transcriptome is the set of all RNA transcripts, including coding and non-coding, in an individual or a population of cells. The term can also sometimes be used to refer to all RNAs, or just mRNA, depending on the particular experiment. The t ...
assembly and the extraction process start from stem, roots, fruits, buds and leaves. The ''de novo'' genome assembly can be performed using software to optimize assembly and scaffolding. The software can also be used to fill the gaps and reduce the interaction between chromosome. The combination of different data can be used for the identification of
orthologous Sequence homology is the biological homology between DNA, RNA, or protein sequences, defined in terms of shared ancestry in the evolutionary history of life. Two segments of DNA can have shared ancestry because of three phenomena: either a s ...
gene with different species, phylogenetic tree construction, and interspecific genome comparisons.


Limits and future perspectives

The development of indirect sequencing methods has to some degree mitigated the lack of efficient DNA sequencing technologies. These techniques allowed researchers to increase scientific knowledge in fields like ecology and evolution. Several
genetic marker A genetic marker is a gene or DNA sequence with a known location on a chromosome that can be used to identify individuals or species. It can be described as a variation (which may arise due to mutation or alteration in the genomic loci) that can be ...
s, more or less well suited for the purpose, were developed helping researchers to address many issues among which demography and mating systems, population structures and phylogeography, speciational processes and species differences, hybridization and introgression, phylogenetics at many temporal scales. However, all these approaches had a primary deficiency: they were all limited only to a fraction of the entire genome so that genome-wide parameters were inferred from a tiny amount of genetic material. The invention and rising of DNA sequencing methods brought a huge contribution in increasing available data potentially useful to improve the field of conservation biology. The ongoing development of cheaper and high throughput allowed the production of a wide array of information in several disciplines providing conservation biologists a very powerful databank from which was possible to extrapolate useful information about, for example, population structure, genetic connections, identification of potential risks due to demographic changes and inbreeding processes through population-genomic approaches that rely on the detection of SNPs, indel or CNV. From one side of the coin, data derived from high throughput sequencing of whole genomes were potentially a massive advance in the field of species conservation, opening wide doors for future challenges and opportunities. On the other side all these data brought researchers to face two main issues. First, how to process all these information. Second, how to translate all the available information into conservation's strategies and practice or, in other words, how to fill the gap between genomic researches and conservation application. Unfortunately, there are many analytical and practical problems to consider using approaches involving genome-wide sequencing. Availability of samples is a major limiting factor: sampling procedures may disturb an already fragile population or may have a big impact in individual animals itself putting limitations to samples' collection. For these reasons several alternative strategies where developed: constant monitoring, for example with radio collars, allow us to understand the behaviour and develop strategies to obtain genetic samples and management of the endangered populations. The samples taken from those species are then used to produce primary cell culture from biopsies. Indeed, this kind of material allow us to grow in vitro cells, and allow us to extract and study genetic material without constantly sampling the endangered populations. Despite a faster and easier data production and a continuous improvement of sequencing technologies, there is still a marked delay of data analysis and processing techniques. Genome-wide analysis and big genomes studies require advances in bioinformatics and computational biology. At the same time improvements in the statistical programs and in the population genetics are required to make better conservation strategies. This last aspect work in parallel with prediction strategies which should take in consideration all features that determine fitness of a species.


See also

* Endangered species


References

{{reflist Biotechnology Conservation biology Ecology Endangered species Extinction events Genomics techniques