PrecisionFDA
   HOME

TheInfoList



OR:

PrecisionFDA (stylized precisionFDA) is a secure, collaborative, high-performance computing platform that has established a growing community of experts around the analysis of biological datasets in order to advance
precision medicine Precision, precise or precisely may refer to: Science, and technology, and mathematics Mathematics and computing (general) * Accuracy and precision, measurement deviation from true value and its scatter * Significant figures, the number of digit ...
, inform
regulatory science Regulatory science is the scientific and technical foundations upon which regulations are based in various industries – particularly those involving health or safety. Regulatory bodies employing such principles in the United States include, for ex ...
, and enable improvements in health outcomes. This
cloud-based Cloud computing is the on-demand availability of computer system resources, especially data storage (cloud storage) and computing power, without direct active management by the user. Large clouds often have functions distributed over multip ...
platform is developed and served by the
United States The United States of America (U.S.A. or USA), commonly known as the United States (U.S. or US) or America, is a country primarily located in North America. It consists of 50 states, a federal district, five major unincorporated territorie ...
Food and Drug Administration The United States Food and Drug Administration (FDA or US FDA) is a List of United States federal agencies, federal agency of the United States Department of Health and Human Services, Department of Health and Human Services. The FDA is respon ...
(FDA). PrecisionFDA connects experts, citizen scientists, and scholars from around the world and provides them with a library of computational tools, workflow features, and reference data. The platform allows researchers to upload and compare data against
reference genome A reference genome (also known as a reference assembly) is a digital nucleic acid sequence database, assembled by scientists as a representative example of the set of genes in one idealized individual organism of a species. As they are assemble ...
s, and execute
bioinformatic Bioinformatics () is an interdisciplinary field that develops methods and software tools for understanding biological data, in particular when the data sets are large and complex. As an interdisciplinary field of science, bioinformatics combine ...
pipelines. The variant call file (VCF) comparator tool also enables users to compare their genetic test results to reference genomes. The platform's code is open source and available on
GitHub GitHub, Inc. () is an Internet hosting service for software development and version control using Git. It provides the distributed version control of Git plus access control, bug tracking, software feature requests, task management, continuous ...
. The platform also features a
crowdsourcing Crowdsourcing involves a large group of dispersed participants contributing or producing goods or services—including ideas, votes, micro-tasks, and finances—for payment or as volunteers. Contemporary crowdsourcing often involves digita ...
model to sponsor community challenges in order to stimulate the development of innovative analytics that inform precision medicine and regulatory science. Community members from around the world come together to participate in scientific challenges, solving problems that demonstrate the effectiveness of their tools, testing the capabilities of the platform, sharing their results, and engaging the community in discussions. Globally, precisionFDA has more than 5,000 users. The precisionFDA team collaborates with multiple FDA Centers, the National Institutes of Health, and other government agencies to support the vision and intent of the American Innovation & Competitiveness Act and the
21st Century Cures Act The 21st Century Cures Act is a United States law enacted by the 114th United States Congress in December 2016 and then signed into law on December 13, 2016. It authorized $6.3 billion in funding, mostly for the National Institutes of Health. The ...
.


History

President Barack Obama announced the formation of the
Precision Medicine Initiative The All of Us Research Program (previously known as the Precision Medicine Initiative Cohort Program) is a research program created in 2015 during the tenure of Barack Obama with $130 million in funding that aims to make advances in tailoring med ...
during the
State of the Union Address The State of the Union Address (sometimes abbreviated to SOTU) is an annual message delivered by the president of the United States to a joint session of the United States Congress near the beginning of each calendar year on the current conditio ...
in January 2015. In August 2015, the FDA announced the launch of precisionFDA as a part of the initiative. In November 2015, the FDA launched a "closed beta" version of the platform, giving select groups and individuals access to the platform. An open beta version of the platform was released in December 2015. In February 2016, the FDA announced the first precisionFDA challenge, th
Consistency Challenge
which tasked users with testing the reliability and reproducibility of gene mapping and variant calling tools. Th
Truth Challenge
followed the Consistency Challenge and asked participants to assess the accuracy of bioinformatics tools for identifying genetic variants. Th
Hidden Treasures – Warm Up
challenge evaluated variant calling pipelines on a targeted set of ''in silico'' injected variants. Th
CFSAN Pathogen Detection Challenge
evaluated bioinformatics pipelines for accurate and rapid detection of foodborne pathogens in metagenomics samples. Th
CDRH ID-NGS Diagnostics Biothreat Challenge
addressed the issue of early detection during pathogen outbreaks by evaluating algorithms for identifying and quantifying emerging pathogens, such as the Ebola virus, from their genomic fingerprints. Subsequent challenges expanded beyond genomics into multi-omics and other data types. Th
NCI-CPTAC Multi-omics Enabled Sample Mislabeling Correction Challenge
addressed the issue of sample mislabeling, which contributes to irreproducible research results and invalid conclusions, by evaluating algorithms for accurate detection and correction of mislabeled samples using multi-omics to enabl
Rigor and Reproducibility
in biomedical research. Th
Brain Cancer Predictive Modeling and Biomarker Discovery Challenge
run in collaboration with Georgetown University, asked participants to develop
machine learning Machine learning (ML) is a field of inquiry devoted to understanding and building methods that 'learn', that is, methods that leverage data to improve performance on some set of tasks. It is seen as a part of artificial intelligence. Machine ...
(ML) and
artificial intelligence Artificial intelligence (AI) is intelligence—perceiving, synthesizing, and inferring information—demonstrated by machines, as opposed to intelligence displayed by animals and humans. Example tasks in which this is done include speech re ...
(AI) models to identify biomarkers and predict brain cancer patient outcomes using gene expression, DNA copy number, and clinical data. Th
Gaining New Insights by Detecting Adverse Event Anomalies Using FDA Open Data Challenge
engaged data scientists to use unsupervised ML and AI techniques to identify anomalies in FDA adverse events, regulated product substances, and clinical trials data, essential for improving the mission of FDA. Th
Truth Challenge V2
assessed variant calling pipeline performance in difficult-to-map regions, segmental duplications, and Major Histocompatibility Complex (HMC) using Genome in a Bottle human genome benchmarks. Th
COVID-19 Risk Factor Modeling Challenge
in collaboration with the
Veterans Health Administration The Veterans Health Administration (VHA) is the component of the United States Department of Veterans Affairs (VA) led by the Under Secretary of Veterans Affairs for Health that implements the healthcare program of the VA through a national ...
, called upon the scientific and analytics community to develop and evaluate computational models to predict COVID-19 related health outcomes in Veterans. In total, ten community challenges have been completed on precisionFDA, which have generated a total of 562 responses from 240 participants. PrecisionFDA challenges have led to meaningful regulatory science advancements, including published best practices for benchmarking germline small-variant calls in human genomes. In addition, the challenges have incentivized the development and benchmarking of novel computational pipelines, including a pipeline that uses deep neural networks to identify genetic variants. In addition to challenges, in-person and virtual app-a-thon events, which promote the development and sharing of apps and tools, are hosted on precisionFDA. In August 2016, precisionFDA launche
App-a-Thon in a Box
which aimed to encourage the creation and sharing of Next Generation Sequencing (NGS) apps and executable Linux command wrappers. The most recent app-a-thon, th
BioCompute Object App-a-thon
sought to improve the reproducibility of bioinformatics pipelines. Participants were asked to create BioCompute Objects (BCOs), a standardized schema for reporting computational scientific workflows, and apps to develop BCOs and check their conformance to BioCompute Specifications. In April 2016, precisionFDA was awarded the top prize in the Informatics category at the Bio IT World Best Practices Awards. In 2018, th
DNAnexus
platform, which is leveraged by precisionFDA, was grante
Authority to Operate
(ATO) by Health and Human Services (HHS) fo
FedRAMP Moderate
In addition, the precisionFDA team received an FDA Commissioner’s Special Citation Award in 2019 for outstanding achievements and collaboration in the development of the precisionFDA platform promoting innovative regulatory science research to modernize the regulation of NGS-based genomic tests. In 2019, precisionFDA received a FedHealthIT Innovation Award and transitioned from a beta to a production release state.


Functionality

PrecisionFDA is an open-source, cloud-based platform for collaborating and testing bioinformatics pipelines and multi-omics data. PrecisionFDA is available to all innovators in the field of multi-omics, including members of the scientific community, diagnostic test providers, pharmaceutical and biotechnology companies, and other constituencies such as advocacy groups and patients. The platform allows researchers to upload and analyze data from both their own and other groups’ studies. The platform hosts files such as reference genomes and genomic data, comparisons (quantification of similarities between sets of genomic variants), and apps (bioinformatics pipelines) that scientists and researchers can upload and work with. The precisionFDA virtual lab environment provides users with their own secure private area to conduct their research, and with configurable shared spaces where the FDA and external parties can share data and tools. For challenge sponsors, the precisionFDA platform provides a comprehensive challenge development framework enabling presentation of challenge assets, grading of submissions, and publication of results. To get involved, visi
precision.fda.gov
and request access to become a member of a growing community that is informing the evolution of precision medicine, advancing regulatory science, and enabling improvements in health outcomes.


References

{{reflist, 2


External links


Official websiteprecisionFDA on GitHub
Supercomputing Free software Personalized medicine