
Multiomics, multi-omics, integrative omics, "panomics" or "pan-omics" is a biological analysis approach in which the data consists of multiple "
omes", such as the
genome
A genome is all the genetic information of an organism. It consists of nucleotide sequences of DNA (or RNA in RNA viruses). The nuclear genome includes protein-coding genes and non-coding genes, other functional regions of the genome such as ...
,
epigenome
In biology, the epigenome of an organism is the collection of chemical changes to its DNA and histone proteins that affects when, where, and how the DNA is expressed; these changes can be passed down to an organism's offspring via transgenerat ...
,
transcriptome
The transcriptome is the set of all RNA transcripts, including coding and non-coding, in an individual or a population of cells. The term can also sometimes be used to refer to all RNAs, or just mRNA, depending on the particular experiment. The ...
,
proteome
A proteome is the entire set of proteins that is, or can be, expressed by a genome, cell, tissue, or organism at a certain time. It is the set of expressed proteins in a given type of cell or organism, at a given time, under defined conditions. P ...
,
metabolome
The metabolome refers to the complete set of small-molecule chemicals found within a biological sample. The biological sample can be a cell, a cellular organelle, an organ, a tissue, a tissue extract, a biofluid or an entire organism. The ...
,
exposome, and
microbiome
A microbiome () is the community of microorganisms that can usually be found living together in any given habitat. It was defined more precisely in 1988 by Whipps ''et al.'' as "a characteristic microbial community occupying a reasonably wel ...
(i.e., a
meta-genome and/or
meta-transcriptome, depending upon how it is sequenced);
in other words, ''the use of multiple
omics
Omics is the collective characterization and quantification of entire sets of biological molecules and the investigation of how they translate into the structure, function, and dynamics of an organism or group of organisms. The branches of scien ...
technologies to study life in a concerted way''. By combining these "omes", scientists can analyze complex biological
big data
Big data primarily refers to data sets that are too large or complex to be dealt with by traditional data processing, data-processing application software, software. Data with many entries (rows) offer greater statistical power, while data with ...
to find novel associations between biological entities, pinpoint relevant
biomarkers
In biomedical contexts, a biomarker, or biological marker, is a measurable indicator of some biological state or condition. Biomarkers are often measured and evaluated using blood, urine, or soft tissues to examine normal biological processes, p ...
and build elaborate markers of disease and physiology. In doing so, multiomics integrates diverse omics data to find a coherently matching geno-pheno-envirotype relationship or association. The OmicTools service lists more than 99 pieces of software related to multiomic data analysis, as well as more than 99 databases on the topic.
Systems biology
Systems biology is the computational modeling, computational and mathematical analysis and modeling of complex biological systems. It is a biology-based interdisciplinary field of study that focuses on complex interactions within biological system ...
approaches are often based upon the use of multiomic analysis data. The
American Society of Clinical Oncology
The American Society of Clinical Oncology (ASCO) is a professional organization representing physicians of all oncology sub-specialties who care for people with cancer. Founded in 1964 by Fred Ansfield, Harry Bisel, Herman Freckman, Arnoldus G ...
(ASCO) defines panomics as referring to "the interaction of all biological
functions within a cell and with other body functions, combining data collected by targeted tests ... and global assays (such as genome sequencing) with other patient-specific information."
Combined multiomic data collection
Combined multiomic data collection approaches have evolved to address the limitations of traditional multiomics research, which typically requires separate sample processing for different molecular classes then subsequent computational integration, introducing variability and increasing costs. Early advances in this field include sequential extraction, TRIzol-based sequential isolation methods, which demonstrated that a reagent traditionally used for RNA isolation could simultaneously extract DNA, RNA, proteins, metabolites, and lipids from a single sample. Similar approaches like the Metabolite, Protein, and Lipid extraction (MPLEx) and the "Three-in-One" method adapted biphasic fractionation to extract proteins, metabolites, and lipids for LC-MS/MS analysis. More recent technological developments include the Multi-Omic Single-Shot Technology (MOST), which integrates proteome and lipidome analysis in a single LC-MS run using one reverse-phase column and a binary mobile phase system, and the Bead-enabled Accelerated Monophasic Multi-omics (BAMM) method that combines n-butanol-based monophasic extraction with magnetic beads and accelerated protein digestion for the separate analysis of metabolites, lipids, and proteins. One of the most comprehensive technologies in this space is Dalton Bioanalytics Inc.'s Omni-MSĀ®, a multiomic assay that uses its proprietary method to simultaneously profile proteins, lipids, electrolytes, metabolites, and other small molecules in a single preparation and single LC-MS analysis. This platform has been applied to biomarker discovery, identifying potential biomarkers across multiple molecular classes and across various conditions and diseases including COVID severity during pregnancy, 22q11.2 deletion syndrome, and hereditary angioedema. These integrated approaches significantly reduce sample requirements, processing time, and technical variation while improving correlation analysis across different molecular classes, making them increasingly valuable for precision medicine and systems biology research.
Single-cell multiomics
A branch of the field of multiomics is the analysis of multilevel
single-cell data, called single-cell multiomics.
This approach gives us an unprecedented resolution to look at multilevel transitions in health and disease at the single cell level. An advantage in relation to bulk analysis is to mitigate confounding factors derived from cell to cell variation, allowing the uncovering of heterogeneous tissue architectures.
Methods for parallel single-cell genomic and transcriptomic analysis can be based on simultaneous amplification or physical separation of RNA and genomic DNA. They allow insights that cannot be gathered solely from transcriptomic analysis, as RNA data do not contain
non-coding genomic regions and information regarding
copy-number variation
Copy number variation (CNV) is a phenomenon in which sections of the genome are repeated and the number of repeats in the genome varies between individuals. Copy number variation is a type of structural variation: specifically, it is a type of ...
, for example. An extension of this methodology is the integration of single-cell transcriptomes to single-cell methylomes, combining single-cell
bisulfite sequencing to single cell RNA-Seq. Other techniques to query the epigenome, as single-cell
ATAC-Seq
ATAC-seq (Assay for Transposase-Accessible Chromatin using sequencing) is a laboratory technique used in molecular biology to assess genome-wide chromatin, chromatin accessibility. The technique was first described in 2013 as an alternative approa ...
and single-cell
Hi-C also exist.
A different, but related, challenge is the integration of proteomic and transcriptomic data.
One approach to perform such measurement is to physically separate single-cell lysates in two, processing half for RNA, and half for proteins.
The protein content of lysates can be measured by proximity extension assays (PEA), for example, which use DNA-barcoded antibodies. A different approach uses a combination of heavy-metal RNA probes and protein antibodies to adapt
mass cytometry
Mass cytometry is a mass spectrometry technique based on inductively coupled plasma mass spectrometry and time of flight mass spectrometry used for the determination of the properties of cells (cytometry). In this approach, antibodies are Conjugat ...
for multiomic analysis.
Related to Single-cell multiomics is the field of Spatial Omics which assays tissues through omics readouts that preserve the relative spatial orientation of the cells in the tissue. The number of Spatial Omics methods published still lags behind the number of methods published for Single-Cell multiomics, but the numbers are catching up
Single-cell and Spatial methods.
Multiomics and machine learning
In parallel to the advances in high-throughput biology,
machine learning
Machine learning (ML) is a field of study in artificial intelligence concerned with the development and study of Computational statistics, statistical algorithms that can learn from data and generalise to unseen data, and thus perform Task ( ...
applications to biomedical data analysis are flourishing. The integration of multi-omics data analysis and machine learning has led to the discovery of new
biomarker
In biomedical contexts, a biomarker, or biological marker, is a measurable indicator of some biological state or condition. Biomarkers are often measured and evaluated using blood, urine, or soft tissues to examine normal biological processes, ...
s. For example, one of the methods of th
mixOmicsproject implements a method based on sparse
Partial Least Squares
Partial least squares (PLS) regression is a statistics, statistical method that bears some relation to principal component regression, principal components regression and is a reduced rank regression; instead of finding hyperplanes of maximum vari ...
regression for selection of features (putative biomarkers).
A unified and flexible statistical framewok for heterogeneous data integration called "Regularized Generalized Canonical Correlation Analysis" (RGCCA ) enables identifying such putative biomarkers. This framework is implemented and made freely available within th
RGCCA R package.
Multiomics in health and disease

Multiomics currently holds a promise to fill gaps in the understanding of human health and disease, and many researchers are working on ways to generate and analyze disease-related data. The applications range from understanding host-pathogen interactions and infectious diseases, cancer, to understanding better chronic and complex
non-communicable disease
A non-communicable disease (NCD) is a disease that is not transmission (medicine), transmissible directly from one person to another. NCDs include Parkinson's disease, autoimmune diseases, strokes, heart diseases, cancers, Diabetes mellitus, diab ...
s and improving personalized medicine.
Integrated Human Microbiome Project
The second phase of the $170 million
Human Microbiome Project
The Human Microbiome Project (HMP) was a United States National Institutes of Health (NIH) research initiative to improve understanding of the microbiota involved in human health and disease. Launched in 2007, the first phase (HMP1) focused on i ...
was focused on integrating patient data to different omic datasets, considering host genetics, clinical information and microbiome composition. The phase one focused on characterization of communities in different body sites. Phase 2 focused in the integration of multiomic data from host &
microbiome
A microbiome () is the community of microorganisms that can usually be found living together in any given habitat. It was defined more precisely in 1988 by Whipps ''et al.'' as "a characteristic microbial community occupying a reasonably wel ...
to human diseases. Specifically, the project used multiomics to improve the understanding of the interplay of gut and nasal microbiomes with
type 2 diabetes
Type 2 diabetes (T2D), formerly known as adult-onset diabetes, is a form of diabetes mellitus that is characterized by high blood sugar, insulin resistance, and relative lack of insulin. Common symptoms include increased thirst, frequent ...
, gut microbiomes and inflammatory bowel disease and vaginal microbiomes and pre-term birth.
Systems Immunology
The complexity of interactions in the human
immune system
The immune system is a network of biological systems that protects an organism from diseases. It detects and responds to a wide variety of pathogens, from viruses to bacteria, as well as Tumor immunology, cancer cells, Parasitic worm, parasitic ...
has prompted the generation of a wealth of immunology-related multi-scale omic data. Multi-omic data analysis has been employed to gather novel insights about the immune response to infectious diseases, such as pediatric
chikungunya
Chikungunya is an infection caused by the chikungunya virus (CHIKV). The disease was first identified in 1952 in Tanzania and named based on the Kimakonde words for "to become contorted". Chikungunya has become a global health concern due to ...
, as well as noncommunicable
autoimmune disease
An autoimmune disease is a condition that results from an anomalous response of the adaptive immune system, wherein it mistakenly targets and attacks healthy, functioning parts of the body as if they were foreign organisms. It is estimated tha ...
s. Integrative omics has also been employed strongly to understand effectiveness and side effects of
vaccine
A vaccine is a biological Dosage form, preparation that provides active acquired immunity to a particular infectious disease, infectious or cancer, malignant disease. The safety and effectiveness of vaccines has been widely studied and verifi ...
s, a field called systems vaccinology. For example, multiomics was essential to uncover the association of changes in plasma metabolites and immune system transcriptome on response to vaccination against
herpes zoster
Shingles, also known as herpes zoster or zona, is a viral disease characterized by a painful skin rash with blisters in a localized area. Typically the rash occurs in a single, wide mark either on the left or right side of the body or face. T ...
.
List of software used for multi-omic analysis
The
Bioconductor project curates a variety of R packages aimed at integrating omic data:
omicade4 for multiple co-inertia analysis of multi omic datasets
offering a bioconductor interface for overlapping samples
a package focused on using multi omic data for evaluating
alternative splicing
Alternative splicing, alternative RNA splicing, or differential splicing, is an alternative RNA splicing, splicing process during gene expression that allows a single gene to produce different splice variants. For example, some exons of a gene ma ...
bioCancer a package for visualization of multiomic cancer data
a suite of multivariate methods for data integration
a package for encapsulating multiple data sets
Th
RGCCA packageimplements a versatile framework for data integration. This package is freely available on th
Comprehensive R Archive Network (CRAN)
The OmicTools
database further highlights R packages and othertools for multi omic data analysis:
PaintOmics a web resource for visualization of multi-omics datasets
* SIGMA, a Java program focused on integrated analysis of cancer datasets
* iOmicsPASS, a tool in C++ for multiomic-based phenotype prediction
Grimon an R graphical interface for visualization of multiomic data
Omics Pipe a framework in Python for reproducibly automating multiomic data analysis
Multiomic Databases
A major limitation of classical omic studies is the isolation of only one level of biological complexity. For example, transcriptomic studies may provide information at the transcript level, but many different entities contribute to the biological state of the sample (
genomic variants,
post-translational modification
In molecular biology, post-translational modification (PTM) is the covalent process of changing proteins following protein biosynthesis. PTMs may involve enzymes or occur spontaneously. Proteins are created by ribosomes, which translation (biolog ...
s, metabolic products, interacting organisms, among others). With the advent of
high-throughput biology
High throughput biology (or high throughput cell biology) is the use of automation equipment with classical cell biology techniques to address biological questions that are otherwise unattainable using conventional methods. It may incorporate tec ...
, it is becoming increasingly affordable to make multiple measurements, allowing transdomain (e.g. RNA and protein levels) correlations and inferences. These correlations aid the construction or more complete
biological network
A biological network is a method of representing systems as complex sets of binary interactions or relations between various biological entities. In general, networks or graphs are used to capture relationships between entities or objects. A typ ...
s, filling gaps in our knowledge.
Integration of data, however, is not an easy task. To facilitate the process, groups have curated database and pipelines to systematically explore multiomic data:
*
Multi-Omics Profiling Expression Database (MOPED), integrating diverse animal models,
* The Pancreatic Expression Database, integrating data related to
pancreatic tissue,
LinkedOmics connecting data from
TCGA cancer datasets,
* OASIS, a web-based resource for general cancer studies,
* BCIP, a platform for
breast cancer
Breast cancer is a cancer that develops from breast tissue. Signs of breast cancer may include a Breast lump, lump in the breast, a change in breast shape, dimpling of the skin, Milk-rejection sign, milk rejection, fluid coming from the nipp ...
studies,
* C/VDdb, connecting data from several cardiovascular disease studies,
* ZikaVR, a multiomic resource for
Zika virus
Zika virus (ZIKV; pronounced or ) is a member of the virus family ''Flaviviridae''. It is spread by daytime-active ''Aedes'' mosquitoes, such as '' A. aegypti'' and '' A. albopictus''. Its name comes from the Ziika Forest of Uganda, where ...
data
* Ecomics, a normalized multi-omic database for ''
Escherichia coli
''Escherichia coli'' ( )Wells, J. C. (2000) Longman Pronunciation Dictionary. Harlow ngland Pearson Education Ltd. is a gram-negative, facultative anaerobic, rod-shaped, coliform bacterium of the genus '' Escherichia'' that is commonly fo ...
'' data,
* GourdBase, integrating data from studies with
gourd
Gourds include the fruits of some flowering plant species in the family Cucurbitaceae, particularly '' Cucurbita'' and '' Lagenaria''. The term refers to a number of species and subspecies, many with hard shells, and some without. Many gourds ha ...
,
* MODEM, a database for multilevel
maize
Maize (; ''Zea mays''), also known as corn in North American English, is a tall stout grass that produces cereal grain. It was domesticated by indigenous peoples in southern Mexico about 9,000 years ago from wild teosinte. Native American ...
data,
* SoyKB, a database for multilevel
soybean
The soybean, soy bean, or soya bean (''Glycine max'') is a species of legume native to East Asia, widely grown for its edible bean. Soy is a staple crop, the world's most grown legume, and an important animal feed.
Soy is a key source o ...
data,
ProteomicsDB a multi-omics and multi-organism resource for life science research
See also
*
DisGeNET
*
Pangenomics
*
Hologenomics
*
Omics
Omics is the collective characterization and quantification of entire sets of biological molecules and the investigation of how they translate into the structure, function, and dynamics of an organism or group of organisms. The branches of scien ...
**
List of omics topics in biology
Inspired by the terms genome and genomics, other words to describe complete Biology, biological datasets, mostly sets of biomolecules originating from one organism, have been coined with the suffix ''-ome'' and ''-omics''. Some of these terms are ...
*
Systems Biology
Systems biology is the computational modeling, computational and mathematical analysis and modeling of complex biological systems. It is a biology-based interdisciplinary field of study that focuses on complex interactions within biological system ...
*
Network Medicine
Network medicine is the application of network science towards identifying, preventing, and treating diseases. This field focuses on using network topology and network dynamics towards identifying diseases and developing medical drugs. Biological ...
References
{{reflist
Biology theories
Molecular biology