Statistics Software
   HOME

TheInfoList



OR:

Statistical software are specialized
computer program A computer program is a sequence or set of instructions in a programming language for a computer to execute. Computer programs are one component of software, which also includes documentation and other intangible components. A computer program ...
s for analysis in
statistics Statistics (from German language, German: ''wikt:Statistik#German, Statistik'', "description of a State (polity), state, a country") is the discipline that concerns the collection, organization, analysis, interpretation, and presentation of ...
and
econometrics Econometrics is the application of Statistics, statistical methods to economic data in order to give Empirical evidence, empirical content to economic relationships.M. Hashem Pesaran (1987). "Econometrics," ''The New Palgrave: A Dictionary of ...
.


Open-source

*
ADaMSoft ADaMSoft is a free and open-source statistical software developed in Java and can run on any platform supporting Java. History ADaMSoft was initially started by Marco Scarnò as a simple prototype of the statistical software developed by UNESC ...
– a generalized statistical software with data mining algorithms and methods for data management *
ADMB ADMB or AD Model Builder is a free and open source software suite for non-linear statistical modeling. It was created by David Fournier and now being developed by the ADMB Project, a creation of the non-profit ADMB Foundation. The "AD" in AD Model ...
– a software suite for non-linear statistical modeling based on
C++ C++ (pronounced "C plus plus") is a high-level general-purpose programming language created by Danish computer scientist Bjarne Stroustrup as an extension of the C programming language, or "C with Classes". The language has expanded significan ...
which uses
automatic differentiation In mathematics and computer algebra, automatic differentiation (AD), also called algorithmic differentiation, computational differentiation, auto-differentiation, or simply autodiff, is a set of techniques to evaluate the derivative of a function ...
*
Chronux Chronux is an open-source software package developed for the loading, visualization and analysis of a variety of modalities / formats of neurobiological time series data. Usage of this tool enables neuroscientists to perform a variety of analysis ...
– for neurobiological time series data *
DAP DAP or Dap may refer to: Science * DAP (gene), human gene that encodes death-associated proteins, which mediate programmed cell death * Diamidophosphate, phosphorylating compound * Diaminopimelic acid, amino acid derivative of lysine * Diamin ...
– free replacement for SAS *
Environment for DeveLoping KDD-Applications Supported by Index-Structures ELKI (for ''Environment for DeveLoping KDD-Applications Supported by Index-Structures'') is a data mining (KDD, knowledge discovery in databases) software framework developed for use in research and teaching. It was originally at the database sy ...
(ELKI) a
software framework In computer programming, a software framework is an abstraction in which software, providing generic functionality, can be selectively changed by additional user-written code, thus providing application-specific software. It provides a standard ...
for developing data mining algorithms in
Java Java (; id, Jawa, ; jv, ꦗꦮ; su, ) is one of the Greater Sunda Islands in Indonesia. It is bordered by the Indian Ocean to the south and the Java Sea to the north. With a population of 151.6 million people, Java is the world's List ...
*
Epi Info Epi Info is statistical software for epidemiology developed by Centers for Disease Control and Prevention (CDC) in Atlanta Atlanta ( ) is the capital and most populous city of the U.S. state of Georgia. It is the seat of Fulton County, ...
statistical software Statistical software are specialized computer programs for analysis in statistics and econometrics. Open-source * ADaMSoft – a generalized statistical software with data mining algorithms and methods for data management * ADMB – a software ...
for
epidemiology Epidemiology is the study and analysis of the distribution (who, when, and where), patterns and determinants of health and disease conditions in a defined population. It is a cornerstone of public health, and shapes policy decisions and evidenc ...
developed by
Centers for Disease Control and Prevention The Centers for Disease Control and Prevention (CDC) is the national public health agency of the United States. It is a United States federal agency, under the Department of Health and Human Services, and is headquartered in Atlanta, Georgi ...
(CDC). Apache 2 licensed *
Fityk Fityk is a curve fitting and data analysis application, predominantly used to fit analytical, bell-shaped functions to experimental data. It is positioned to fill the gap between general plotting software and programs specific for one field, e.g. c ...
– nonlinear regression software (GUI and command line) *
GNU Octave GNU Octave is a high-level programming language primarily intended for scientific computing and numerical computation. Octave helps in solving linear and nonlinear problems numerically, and for performing other numerical experiments using a langu ...
– programming language very similar to MATLAB with statistical features *
gretl gretl is an open-source statistical package, mainly for econometrics. The name is an acronym for ''G''nu ''R''egression, ''E''conometrics and ''T''ime-series ''L''ibrary. It has both a graphical user interface (GUI) and a command-line interfa ...
– gnu regression, econometrics and time-series library *
intrinsic Noise Analyzer Within bioinformatics, intrinsic Noise Analyzer (iNA) is an open source software for studying reaction kinetics in living cells. The software analyzes mathematical models of intracellular reaction kinetics such as gene expression, regulatory ne ...
(iNA) – For analyzing intrinsic fluctuations in biochemical systems *
jamovi Jamovi (stylized in all lower-case as jamovi) is a free and open-source computer program for data analysis and performing statistical tests. The core developers of Jamovi are Jonathon Love, Damian Dropmann, and Ravi Selker who are developers for ...
– A free software alternative to IBM
SPSS SPSS Statistics is a statistical software suite developed by IBM for data management, advanced analytics, multivariate analysis, business intelligence, and criminal investigation. Long produced by SPSS Inc., it was acquired by IBM in 2009. C ...
Statistics *
JASP JASP (Jeffreys’s Amazing Statistics Program) is a free and open-source program for statistical analysis supported by the University of Amsterdam. It is designed to be easy to use, and familiar to users of SPSS. It offers standard analysis proce ...
– A free software alternative to IBM
SPSS SPSS Statistics is a statistical software suite developed by IBM for data management, advanced analytics, multivariate analysis, business intelligence, and criminal investigation. Long produced by SPSS Inc., it was acquired by IBM in 2009. C ...
Statistics with additional option for Bayesian methods *
JMulTi JMulTi is an open-source interactive software for econometric analysis, specialised in univariate and multivariate time series analysis. It has a Java graphical user interface. The motivation for its designed was to provide the means by which so ...
– For
econometric analysis Econometrics is the application of statistical methods to economic data in order to give empirical content to economic relationships.M. Hashem Pesaran (1987). "Econometrics," '' The New Palgrave: A Dictionary of Economics'', v. 2, p. 8 p. 8 ...
, specialised in univariate and multivariate
time series In mathematics, a time series is a series of data points indexed (or listed or graphed) in time order. Most commonly, a time series is a sequence taken at successive equally spaced points in time. Thus it is a sequence of discrete-time data. Exa ...
analysis *
Just another Gibbs sampler Just another Gibbs sampler (JAGS) is a program for simulation from Bayesian hierarchical models using Markov chain Monte Carlo (MCMC), developed by Martyn Plummer. JAGS has been employed for statistical work in many fields, for example ecology, m ...
(JAGS) – a program for analyzing Bayesian hierarchical models using
Markov chain Monte Carlo In statistics, Markov chain Monte Carlo (MCMC) methods comprise a class of algorithms for sampling from a probability distribution. By constructing a Markov chain that has the desired distribution as its equilibrium distribution, one can obtain ...
developed by Martyn Plummer. It is similar to WinBUGS *
KNIME KNIME (), the Konstanz Information Miner, is a free and open-source data analytics, reporting and integration platform. KNIME integrates various components for machine learning and data mining through its modular data pipelining "Building Blocks ...
– An open source analytics platform built with
Java Java (; id, Jawa, ; jv, ꦗꦮ; su, ) is one of the Greater Sunda Islands in Indonesia. It is bordered by the Indian Ocean to the south and the Java Sea to the north. With a population of 151.6 million people, Java is the world's List ...
and
Eclipse An eclipse is an astronomical event that occurs when an astronomical object or spacecraft is temporarily obscured, by passing into the shadow of another body or by having another body pass between it and the viewer. This alignment of three ce ...
using modular data pipeline workflows *
LIBSVM LIBSVM and LIBLINEAR are two popular open source machine learning libraries, both developed at the National Taiwan University and both written in C++ though with a C API. LIBSVM implements the Sequential minimal optimization (SMO) algorithm for ...
– C++ support vector machine libraries *
mlpack mlpack is a machine learning software library for C++, built on top of the Armadillo library and thensmallennumerical optimization library. mlpack has an emphasis on scalability, speed, and ease-of-use. Its aim is to make machine learning possibl ...
– open-source library for machine learning, exploits C++ language features to provide maximum performance and flexibility while providing a simple and consistent
application programming interface An application programming interface (API) is a way for two or more computer programs to communicate with each other. It is a type of software interface, offering a service to other pieces of software. A document or standard that describes how t ...
(API) * Mondrian – data analysis tool using interactive statistical graphics with a link to R *
Neurophysiological Biomarker Toolbox The Neurophysiological Biomarker Toolbox (NBT) is an open source MATLAB toolbox for the computation and integration of neurophysiological biomarkers (e.g., biomarkers based on EEG or MEG recordings). The NBT toolbox has so far been used in se ...
– Matlab toolbox for data-mining of neurophysiological biomarkers *
OpenBUGS OpenBUGS is a software application for the Bayesian analysis of complex statistical models using Markov chain Monte Carlo (MCMC) methods. OpenBUGS is the open source variant of WinBUGS (Bayesian inference Using Gibbs Sampling). It runs under Micr ...
*
OpenEpi OpenEpi is a free, web-based, open source, operating system-independent series of programs for use in epidemiology, biostatistics, public health, and medicine, providing a number of epidemiologic and statistical tools for summary data. OpenEpi w ...
– A web-based, open-source, operating-independent series of programs for use in epidemiology and statistics based on JavaScript and HTML *
OpenMx OpenMx is an open source program for extended structural equation modeling. It runs as a package under R. Cross platform, it runs under Linux, Mac OS and Windows.S. Boker, M. Neale, H. Maes, M. Wilde, M. Spiegel, T. Brick, J. Spies, R. Estabrook, ...
– A package for
structural equation modeling Structural equation modeling (SEM) is a label for a diverse set of methods used by scientists in both experimental and observational research across the sciences, business, and other fields. It is used most in the social and behavioral scienc ...
running in
R (programming language) R is a programming language for statistical computing and graphics supported by the R Core Team and the R Foundation for Statistical Computing. Created by statisticians Ross Ihaka and Robert Gentleman, R is used among data miners, bioinform ...
*
OpenNN OpenNN (Open Neural Networks Library) is a software library written in the C++ programming language which implements neural networks, a main area of deep learning research. The library is open-source, licensed under the GNU Lesser General Public L ...
– A
software library In computer science, a library is a collection of non-volatile resources used by computer programs, often for software development. These may include configuration data, documentation, help data, message templates, pre-written code and subr ...
written in the programming language
C++ C++ (pronounced "C plus plus") is a high-level general-purpose programming language created by Danish computer scientist Bjarne Stroustrup as an extension of the C programming language, or "C with Classes". The language has expanded significan ...
which implements
neural network A neural network is a network or circuit of biological neurons, or, in a modern sense, an artificial neural network, composed of artificial neurons or nodes. Thus, a neural network is either a biological neural network, made up of biological ...
s, a main area of
deep learning Deep learning (also known as deep structured learning) is part of a broader family of machine learning methods based on artificial neural networks with representation learning. Learning can be supervised, semi-supervised or unsupervised. De ...
research *
Orange Orange most often refers to: *Orange (fruit), the fruit of the tree species '' Citrus'' × ''sinensis'' ** Orange blossom, its fragrant flower *Orange (colour), from the color of an orange, occurs between red and yellow in the visible spectrum * ...
, a data mining,
machine learning Machine learning (ML) is a field of inquiry devoted to understanding and building methods that 'learn', that is, methods that leverage data to improve performance on some set of tasks. It is seen as a part of artificial intelligence. Machine ...
, and
bioinformatics Bioinformatics () is an interdisciplinary field that develops methods and software tools for understanding biological data, in particular when the data sets are large and complex. As an interdisciplinary field of science, bioinformatics combi ...
software *
Pandas Pediatric autoimmune neuropsychiatric disorders associated with streptococcal infections (PANDAS) is a controversial hypothetical diagnosis for a subset of children with rapid onset of obsessive-compulsive disorder (OCD) or tic disorders. Sy ...
High-performance computing High-performance computing (HPC) uses supercomputers and computer clusters to solve advanced computation problems. Overview HPC integrates systems administration (including network and security knowledge) and parallel programming into a mult ...
(HPC) data structures and data analysis tools for
Python Python may refer to: Snakes * Pythonidae, a family of nonvenomous snakes found in Africa, Asia, and Australia ** ''Python'' (genus), a genus of Pythonidae found in Africa and Asia * Python (mythology), a mythical serpent Computing * Python (pro ...
in Python and
Cython Cython () is a programming language that aims to be a superset of the Python programming language, designed to give C-like performance with code that is written mostly in Python with optional additional C-inspired syntax. Cython is a compiled ...
(statsmodels,
scikit-learn scikit-learn (formerly scikits.learn and also known as sklearn) is a free software machine learning library for the Python programming language. It features various classification, regression and clustering algorithms including support-vector m ...
) *
Perl Data Language Perl Data Language (abbreviated PDL) is a set of free software array programming extensions to the Perl programming language. PDL extends the data structures built into Perl, to include large multidimensional arrays, and adds functionality to m ...
– Scientific computing with Perl *
Ploticus Ploticus is a free, open-source (GPL) computer program for producing plots and charts from data. It runs under Unix, Solaris, Mac OS X, Linux and Win32 systems.
– software for generating a variety of graphs from raw data *
PSPP PSPP is a free software application for analysis of sampled data, intended as a free alternative for IBM SPSS Statistics. It has a graphical user interface and conventional command-line interface. It is written in C and uses GNU Scientific Lib ...
– A free software alternative to IBM
SPSS SPSS Statistics is a statistical software suite developed by IBM for data management, advanced analytics, multivariate analysis, business intelligence, and criminal investigation. Long produced by SPSS Inc., it was acquired by IBM in 2009. C ...
Statistics * R
free Free may refer to: Concept * Freedom, having the ability to do something, without having to obey anyone/anything * Freethought, a position that beliefs should be formed only on the basis of logic, reason, and empiricism * Emancipate, to procur ...
implementation of the
S (programming language) S is a statistical programming language developed primarily by John Chambers and (in earlier versions) Rick Becker and Allan Wilks of Bell Laboratories. The aim of the language, as expressed by John Chambers, is "to turn ideas into software, quic ...
**
Programming with Big Data in R Programming with Big Data in R (pbdR) is a series of R packages and an environment for statistical computing with big data by using high-performance statistical computation. The pbdR uses the same programming language as R with S3/S4 classes ...
(pbdR) – a series of R packages enhanced by
SPMD In computing, single program, multiple data (SPMD) is a technique employed to achieve parallelism; it is a subcategory of MIMD. Tasks are split up and run simultaneously on multiple processors with different input in order to obtain results fast ...
parallelism for
big data Though used sometimes loosely partly because of a lack of formal definition, the interpretation that seems to best describe Big data is the one associated with large body of information that we could not comprehend when used only in smaller am ...
analysis **
R Commander R Commander (Rcmdr) is a GUI for the R programming language, licensed under the GNU General Public License, and developed and maintained by John Fox in the sociology department at McMaster University. Rcmdr looks and works similarly to SPSS GUI ...
– GUI interface for R **
Rattle GUI Rattle GUI is a free and open source software (GNU GPL v2) package providing a graphical user interface (GUI) for data mining using the R statistical programming language. Rattle is used in a variety of situations. Currently there are 15 differe ...
– GUI interface for R **
Revolution Analytics Revolution Analytics (formerly REvolution Computing) is a statistical software company focused on developing open source and "open-core" versions of the free and open source software R for enterprise, academic and analytics customers. Revolution ...
– production-grade software for the enterprise big data analytics **
RStudio RStudio is an integrated development environment for R, a programming language for statistical computing and graphics. It is available in two formats: RStudio Desktop is a regular desktop application while RStudio Server runs on a remote server ...
– GUI interface and development environment for R *
ROOT In vascular plants, the roots are the organs of a plant that are modified to provide anchorage for the plant and take in water and nutrients into the plant body, which allows plants to grow taller and faster. They are most often below the sur ...
– an open-source C++ system for data storage, processing and analysis, developed by CERN and used to find the
Higgs boson The Higgs boson, sometimes called the Higgs particle, is an elementary particle in the Standard Model of particle physics produced by the quantum excitation of the Higgs field, one of the fields in particle physics theory. In the Stand ...
*
Salstat Salstat is a free software application for the statistical analysis of numeric data with an emphasis on ease-of-use. Using both a graphical user interface The GUI ( "UI" by itself is still usually pronounced . or ), graphical user interface, ...
– menu-driven statistics software *
Scilab Scilab is a free and open-source, cross-platform numerical computational package and a high-level, numerically oriented programming language. It can be used for signal processing, statistical analysis, image enhancement, fluid dynamics simulat ...
– uses GPL-compatible
CeCILL CeCILL (from CEA CNRS INRIA Logiciel Libre) is a free software license adapted to both international and French legal matters, in the spirit of and retaining compatibility with the GNU General Public License (GPL). It was jointly developed by ...
license *
SciPy SciPy (pronounced "sigh pie") is a free and open-source Python library used for scientific computing and technical computing. SciPy contains modules for optimization, linear algebra, integration, interpolation, special functions, FFT, signal ...
Python Python may refer to: Snakes * Pythonidae, a family of nonvenomous snakes found in Africa, Asia, and Australia ** ''Python'' (genus), a genus of Pythonidae found in Africa and Asia * Python (mythology), a mythical serpent Computing * Python (pro ...
library for scientific computing that contains the ''stats'' sub-package which is partly based on the venerable '', STAT'' (a.k.a. ''PipeStat'', formerly ''UNIX, STAT'') software **
scikit-learn scikit-learn (formerly scikits.learn and also known as sklearn) is a free software machine learning library for the Python programming language. It features various classification, regression and clustering algorithms including support-vector m ...
– extends SciPy with a host of machine learning models (classification, clustering, regression, etc.) **
statsmodels Statsmodels is a Python package that allows users to explore data, estimate statistical models, and perform statistical tests. An extensive list of descriptive statistics, statistical tests, plotting functions, and result statistics are available ...
– extends SciPy with statistical models and tests (regression, plotting, example datasets,
generalized linear model In statistics, a generalized linear model (GLM) is a flexible generalization of ordinary linear regression. The GLM generalizes linear regression by allowing the linear model to be related to the response variable via a ''link function'' and b ...
(GLM),
time series analysis In mathematics, a time series is a series of data points indexed (or listed or graphed) in time order. Most commonly, a time series is a sequence taken at successive equally spaced points in time. Thus it is a sequence of discrete-time data. Exa ...
,
autoregressive–moving-average model In the statistical analysis of time series, autoregressive–moving-average (ARMA) models provide a parsimonious description of a (weakly) stationary stochastic process in terms of two polynomials, one for the autoregression (AR) and the second ...
(ARMA),
vector autoregression Vector autoregression (VAR) is a statistical model used to capture the relationship between multiple quantities as they change over time. VAR is a type of stochastic process model. VAR models generalize the single-variable (univariate) autoregres ...
(VAR),
non-parametric statistics Nonparametric statistics is the branch of statistics that is not based solely on parametrized families of probability distributions (common examples of parameters are the mean and variance). Nonparametric statistics is based on either being distr ...
,
ANOVA Analysis of variance (ANOVA) is a collection of statistical models and their associated estimation procedures (such as the "variation" among and between groups) used to analyze the differences among means. ANOVA was developed by the statistician ...
,
empirical likelihood Empirical likelihood (EL) is a nonparametric method that requires fewer assumptions about the error distribution while retaining some of the merits in likelihood-based inference. The estimation method requires that the data are independent and ident ...
) *
Shogun (toolbox) Shogun is a Free software, free, open-source software, open-source machine learning software library written in C++. It offers numerous algorithms and data structures for machine learning problems. It offers interfaces for GNU Octave, Octave, Py ...
open-source Open source is source code that is made freely available for possible modification and redistribution. Products include permission to use the source code, design documents, or content of the product. The open-source model is a decentralized sof ...
, large-scale machine learning toolbox that provides several SVM (Support Vector Machine) implementations (like libSVM, SVMlight) under a common framework and interfaces to Octave, MATLAB, Python, R * Simfit – simulation, curve fitting, statistics, and plotting *
SOCR The Statistics Online Computational Resource (SOCR) is an online multi-institutional research and education organization. SOCR designs, validates and broadly shares a suite of online tools for statistical computing, and interactive materials for ...
*
SOFA Statistics SOFA Statistics is an open-source statistical package. The name stands for ''S''tatistics ''O''pen ''F''or ''A''ll. It has a graphical user interface and can connect directly to MySQL, PostgreSQL, SQLite, MS Access (map), and Microsoft SQL Ser ...
– desktop GUI program focused on ease of use, learn as you go, and beautiful output *
Stan (software) Stan is a probabilistic programming language for statistical inference written in C++.Stan Development Team. 2015Stan Modeling Language User's Guide and Reference Manual, Version 2.9.0/ref> The Stan language is used to specify a (Bayesian) stati ...
– open-source package for obtaining Bayesian inference using the No-U-Turn sampler, a variant of
Hamiltonian Monte Carlo The Hamiltonian Monte Carlo algorithm (originally known as hybrid Monte Carlo) is a Markov chain Monte Carlo method for obtaining a sequence of random samples which converge to being distributed according to a target probability distribution for whi ...
. It is somewhat like BUGS, but with a different language for expressing models and a different sampler for sampling from their posteriors * Statistical LabR-based and focusing on educational purposes *
TOPCAT (software) TOPCAT is an interactive graphical viewer and editor for tabular data. Although a general purpose tool capable of handling large and sparse datasets with correlation functionality its specialist application area is astronomy and it was initia ...
– interactive graphical analysis and manipulation package for astronomers that understands
FITS Flexible Image Transport System (FITS) is an open standard defining a digital file format useful for storage, transmission and processing of data: formatted as multi-dimensional arrays (for example a 2D image), or tables. FITS is the most com ...
, VOTable and CDF formats. *
Torch (machine learning) Torch is an open-source machine learning library, a scientific computing framework, and a script language based on the Lua programming language. It provides a wide range of algorithms for deep learning, and uses the scripting language LuaJIT, and ...
– a
deep learning Deep learning (also known as deep structured learning) is part of a broader family of machine learning methods based on artificial neural networks with representation learning. Learning can be supervised, semi-supervised or unsupervised. De ...
software library written in
Lua (programming language) Lua ( ; from meaning ''moon'') is a lightweight, high-level, multi-paradigm programming language designed primarily for embedded use in applications. Lua is cross-platform, since the interpreter of compiled bytecode is written in ANSI C, an ...
*
Weka (machine learning) Waikato Environment for Knowledge Analysis (Weka), developed at the University of Waikato, New Zealand, is free software licensed under the GNU General Public License, and the companion software to the book "Data Mining: Practical Machine Learning ...
– a suite of machine learning software written at the
University of Waikato The University of Waikato ( mi, Te Whare Wānanga o Waikato), is a Public university, public research university in Hamilton, New Zealand, Hamilton, New Zealand established in 1964. An additional campus is located in Tauranga. The university perfo ...


Public domain

*
CSPro CSPro, short for the Census and Survey Processing System, is a public domain data processing software package developed by the U.S. Census Bureau and ICF International. Serpro S.A. was involved in past development. Funding for development comes ...
(core is public domain but without publicly available source code; the web UI has been open sourced under Apache version 2 and the help system under GPL version 3) *
Dataplot Dataplot is a public domain software system for scientific visualization and statistical analysis. It was developed and is being maintained at the National Institute of Standards and Technology. Dataplot's source code In computing, source cod ...
(NIST) *
X-13ARIMA-SEATS X-13ARIMA-SEATS, successor to X-12-ARIMA and X-11, is a set of statistical methods for seasonal adjustment and other descriptive analysis of time series data that are implemented in the U.S. Census Bureau's software package. These methods are or ...
(public domain in the United States only; outside of the United States is under US government copyright)


Freeware

* BV4.1 *
GeoDA GeoDa is a free software package that conducts spatial data analysis, geovisualization, spatial autocorrelation and spatial modeling. It runs on different versions of Windows, Mac OS, and Linux. The package was initially developed by the Spatial ...
* MaxStat Lite – general statistical software *
MINUIT MINUIT, now MINUIT2, is a numerical minimization software library developed at the European Organization for Nuclear Research (CERN). It provides several algorithms that search for parameter values that minimize a user-defined function, and comput ...
*
WinBUGS WinBUGS is statistical software for Bayesian analysis using Markov chain Monte Carlo (MCMC) methods. It is based on the BUGS ( Bayesian inference Using Gibbs Sampling) project started in 1989. It runs under Microsoft Windows, though it can also ...
Bayesian analysis Bayesian inference is a method of statistical inference in which Bayes' theorem is used to update the probability for a hypothesis as more evidence or information becomes available. Bayesian inference is an important technique in statistics, and ...
using
Markov chain Monte Carlo In statistics, Markov chain Monte Carlo (MCMC) methods comprise a class of algorithms for sampling from a probability distribution. By constructing a Markov chain that has the desired distribution as its equilibrium distribution, one can obtain ...
methods *
Winpepi WinPepi is a freeware package of statistical programs for epidemiologists Epidemiology is the study and analysis of the distribution (who, when, and where), patterns and determinants of health and disease conditions in a defined population. It ...
– package of statistical programs for epidemiologists


Proprietary

*
Alteryx Alteryx is an American computer software company based in Irvine, California, with a development center in Broomfield, Colorado. The company's products are used for data science and analytics. The software is designed to make advanced analytic ...
– analytics platform with drag and drop statistical models; R and Python integration * Analytica – visual analytics and statistics package *
Angoss Angoss Software Corporation, headquartered in Toronto, Ontario, Canada, with offices in the United States and UK, acquired by Datawatch and now owned by Altair, was a provider of predictive analytics systems through software licensing and ser ...
– products KnowledgeSEEKER and KnowledgeSTUDIO incorporate several data mining algorithms *
ASReml ASReml is a statistical software package for fitting linear mixed models using restricted maximum likelihood, a technique commonly used in plant and animal breeding and quantitative genetics as well as other fields. It is notable for its ability to ...
– for restricted maximum likelihood analyses *
BMDP BMDP was a statistical package developed in 1965 by Wilfrid Dixon at the University of California, Los Angeles. The acronym stands for Bio-Medical Data Package, the word package was added by Dixon as the software consisted of a series of programs ...
– general statistics package *
DataGraph DataGraph is a graphing and data analysis software application for the macOS operating system, developed by Visual Data Tools in Chapel Hill, NC. DataGraph is used for creating publication quality graphics, particularly for research and scienc ...
– visual analysis with linear and nonlinear regression * DB Lytix – 800+ in-database models *
EViews EViews is a statistical package for Microsoft Windows, Windows, used mainly for time-series oriented econometrics, econometric analysis. It is developed by Quantitative Micro Software (QMS), now a part of IHS Inc., IHS. Version 1.0 was released ...
– for
econometric analysis Econometrics is the application of statistical methods to economic data in order to give empirical content to economic relationships.M. Hashem Pesaran (1987). "Econometrics," '' The New Palgrave: A Dictionary of Economics'', v. 2, p. 8 p. 8 ...
*
FAME (database) FAME (Forecasting Analysis and Modeling Environment) is a time series database released in 1981 and owned by FIS Global. History The FAME software environment had several development phases during its history. Lawrence C. Rafsky founded GemN ...
– a system for managing time-series databases *
GAUSS Johann Carl Friedrich Gauss (; german: Gauß ; la, Carolus Fridericus Gauss; 30 April 177723 February 1855) was a German mathematician and physicist who made significant contributions to many fields in mathematics and science. Sometimes refer ...
programming language A programming language is a system of notation for writing computer programs. Most programming languages are text-based formal languages, but they may also be graphical. They are a kind of computer language. The description of a programming ...
for statistics * Genedata – software for integration and interpretation of experimental data in the life science R&D *
GenStat Genstat (General Statistics) is a statistical software package with data analysis capabilities, particularly in the field of agriculture. It was developed in 1968 by the Rothamsted Research in the United Kingdom and was designed to provide modul ...
– general statistics package * GLIM – early package for fitting
generalized linear model In statistics, a generalized linear model (GLM) is a flexible generalization of ordinary linear regression. The GLM generalizes linear regression by allowing the linear model to be related to the response variable via a ''link function'' and b ...
s * GraphPad InStat – very simple with much guidance and explanations *
GraphPad Prism GraphPad Software Inc. was a privately held software development corporation until its acquisition by Insight Partners in 2017. The company was named Insightful Science, which itself merged with Dotmatics in 2021. The original software was writt ...
– biostatistics and nonlinear regression with clear explanations *
IMSL Numerical Libraries IMSL (International Mathematics and Statistics Library) is a commercial collection of software libraries of numerical analysis functionality that are implemented in the computer programming languages C, Java, C#.NET, and Fortran. A Python interf ...
– software library with statistical algorithms * JMP – visual analysis and statistics package *
LIMDEP LIMDEP is an econometric and statistical software package with a variety of estimation tools. In addition to the core econometric tools for analysis of cross sections and time series, LIMDEP supports methods for panel data analysis, frontier and e ...
– comprehensive statistics and econometrics package *
LISREL LISREL (linear structural relations) is a proprietary statistical software package used in structural equation modeling (SEM) for manifest and latent variables. It requires a "fairly high level of statistical sophistication". History LISREL was d ...
– statistics package used in structural equation modeling *
Maple ''Acer'' () is a genus of trees and shrubs commonly known as maples. The genus is placed in the family Sapindaceae.Stevens, P. F. (2001 onwards). Angiosperm Phylogeny Website. Version 9, June 2008 nd more or less continuously updated since http ...
programming language A programming language is a system of notation for writing computer programs. Most programming languages are text-based formal languages, but they may also be graphical. They are a kind of computer language. The description of a programming ...
with statistical features *
Mathematica Wolfram Mathematica is a software system with built-in libraries for several areas of technical computing that allow machine learning, statistics, symbolic computation, data manipulation, network analysis, time series analysis, NLP, optimizat ...
– a software package with statistical particularly ŋ features *
MATLAB MATLAB (an abbreviation of "MATrix LABoratory") is a proprietary multi-paradigm programming language and numeric computing environment developed by MathWorks. MATLAB allows matrix manipulations, plotting of functions and data, implementation ...
programming language A programming language is a system of notation for writing computer programs. Most programming languages are text-based formal languages, but they may also be graphical. They are a kind of computer language. The description of a programming ...
with statistical features * MaxStat Pro – general statistical software *
MedCalc __NOTOC__ MedCalc is a statistical software package designed for the biomedical sciences. It has an integrated spreadsheet for data input and can import files in several formats (Microsoft Excel, Excel, SPSS, Comma-separated values, CSV, ...). M ...
– for biomedical sciences * Microfit – econometrics package, time series *
Minitab Minitab is a statistics package developed at the Pennsylvania State University by researchers Barbara F. Ryan, Thomas A. Ryan, Jr., and Brian L. Joiner in conjunction with Triola Statistics Company in 1972. It began as a light version of OMNITA ...
– general statistics package *
MLwiN MLwiN is a statistical software package for fitting multilevel models. It uses both maximum likelihood estimation and Markov chain Monte Carlo (MCMC) methods. MLwiN is based on an earlier package, MLn, but with a graphical user interface (as well ...
– multilevel models (free to UK academics) *
Nacsport Video Analysis Software Nacsport Video Analysis Software has been designed for analysing sports through video. It is available for Windows and MacOS and was developed by the Spanish company Nacsport. Nacsport can be used for both live and retrospective analysis and ca ...
– software for analysing sports and obtaining statistical intelligence *
NAG Numerical Library The NAG Numerical Library is a software product developed and sold by The Numerical Algorithms Group Ltd. It is a software library of numerical analysis routines, containing more than 1,900 mathematical and statistical algorithms. Areas covered by ...
– comprehensive math and statistics library * NCSS – general statistics package *
Neural Designer Neural Designer is a software tool for machine learning based on neural networks, a main area of artificial intelligence research, and contains a graphical user interface which simplifies data entry and interpretation of results. In 2015, Neural ...
– commercial
deep learning Deep learning (also known as deep structured learning) is part of a broader family of machine learning methods based on artificial neural networks with representation learning. Learning can be supervised, semi-supervised or unsupervised. De ...
package *
NLOGIT NLOGIT is an extension of the econometric and statistical software package LIMDEP. In addition to the estimation tools in LIMDEP, NLOGIT provides programs for estimation, model simulation and analysis of multinomial choice data, such as brand cho ...
– comprehensive statistics and econometrics package *
nQuery Sample Size Software nQuery is a clinical trial design platform used for the design and monitoring of adaptive, group sequential, and fixed sample size trials. It is most commonly used by biostatisticians to calculate sample size and statistical power for adaptive c ...
– Sample Size and Power Analysis Software * O-Matrix
programming language A programming language is a system of notation for writing computer programs. Most programming languages are text-based formal languages, but they may also be graphical. They are a kind of computer language. The description of a programming ...
* OriginPro – statistics and graphing, programming access to NAG library *
PASS Sample Size Software PASS is a computer program for estimating sample size or determining the power of a statistical test or confidence interval. NCSS LLC is the company that produces PASS. NCSS LLC also produces NCSS (for statistical analysis). PASS includes over ...
(PASS) – power and sample size software from NCSS *
Plotly Plotly is a technical computing company headquartered in Montreal, Quebec, that develops online data analytics and data visualization, visualization tools. Plotly provides online graphing, analytics, and statistics tools for individuals and coll ...
– plotting library and styling interface for analyzing data and creating browser-based graphs. Available for R,
Python Python may refer to: Snakes * Pythonidae, a family of nonvenomous snakes found in Africa, Asia, and Australia ** ''Python'' (genus), a genus of Pythonidae found in Africa and Asia * Python (mythology), a mythical serpent Computing * Python (pro ...
,
MATLAB MATLAB (an abbreviation of "MATrix LABoratory") is a proprietary multi-paradigm programming language and numeric computing environment developed by MathWorks. MATLAB allows matrix manipulations, plotting of functions and data, implementation ...
,
Julia Julia is usually a feminine given name. It is a Latinate feminine form of the name Julio and Julius. (For further details on etymology, see the Wiktionary entry "Julius".) The given name ''Julia'' had been in use throughout Late Antiquity (e.g. ...
, and
Perl Perl is a family of two high-level, general-purpose, interpreted, dynamic programming languages. "Perl" refers to Perl 5, but from 2000 to 2019 it also referred to its redesigned "sister language", Perl 6, before the latter's name was offici ...
*
Primer-E Primer Plymouth Routines In Multivariate Ecological Research (PRIMER) is a statistical package that is a collection of specialist univariate, multivariate, and graphical routines for analyzing species sampling data for community ecology. Types of data an ...
– environmental and ecological specific *
PV-WAVE PV-WAVE (Precision Visuals - Workstation Analysis and Visualization Environment) is an array oriented fourth-generation programming language used by engineers, scientists, researchers, business analysts and software developers to build and deploy ...
programming language A programming language is a system of notation for writing computer programs. Most programming languages are text-based formal languages, but they may also be graphical. They are a kind of computer language. The description of a programming ...
comprehensive data analysis and visualization with IMSL statistical package *
Qlucore Qlucore was founded in early 2007. Qlucore started as a collaborative research project at Lund University , motto = Ad utrumque , mottoeng = Prepared for both , established = , type ...
Omics Explorer – interactive and visual data analysis software *
RapidMiner RapidMiner is a data science platform designed for enterprises that analyses the collective impact of organizations’ employees, expertise and data. Rapid Miner's data science platform is intended to support many analytics users across a broad AI ...
machine learning Machine learning (ML) is a field of inquiry devoted to understanding and building methods that 'learn', that is, methods that leverage data to improve performance on some set of tasks. It is seen as a part of artificial intelligence. Machine ...
toolbox *
Regression Analysis of Time Series RATS, an abbreviation of Regression Analysis of Time Series, is a statistical package for time series analysis and econometrics. RATS is developed and sold by Estima, Inc., located in Evanston, IL. History The forerunner of RATS was a FORTRAN p ...
(RATS) – comprehensive
econometric analysis Econometrics is the application of statistical methods to economic data in order to give empirical content to economic relationships.M. Hashem Pesaran (1987). "Econometrics," '' The New Palgrave: A Dictionary of Economics'', v. 2, p. 8 p. 8 ...
package *
S-PLUS S-PLUS is a commercial implementation of the S programming language sold by TIBCO Software Inc. It features object-oriented programming capabilities and advanced analytical algorithms. Due to the increasing popularity of the open source S succ ...
– general statistics package *
SAS (software) SAS (previously "Statistical Analysis System") is a statistical software suite developed by SAS Institute for data management, advanced analytics, multivariate analysis, business intelligence, criminal investigation, and predictive analytics. ...
– comprehensive statistical package *
SHAZAM (Econometrics and Statistics Software) Shazam is a comprehensive econometrics and statistics package for estimating, testing, simulating and forecasting many types of econometrics and statistical models. SHAZAM was originally created in 1977 by Kenneth White. Data management All S ...
– comprehensive econometrics and statistics package *
SigmaStat SigmaStat is a statistical software package, which was originally developed by Jandel Scientific Software in the 1980s. As of October 1996, Systat Software is now based in San Jose, California San Jose, officially San José (; ; ), is a maj ...
– package for group analysis * Simul – econometric tool for multidimensional (multi-sectoral, multi-regional) modeling *
SmartPLS SmartPLS is a software with graphical user interface for variance-based structural equation modeling (SEM) using the partial least squares (PLS) path modeling method. Users can estimate models with their data by using basic PLS-SEM, weighted PL ...
– statistics package used in
partial least squares path modeling The partial least squares path modeling or partial least squares structural equation modeling (PLS-PM, PLS-SEM) is a method for structural equation modeling that allows estimation of complex cause-effect relationships in path models with latent var ...
(PLS) and PLS-based
structural equation modeling Structural equation modeling (SEM) is a label for a diverse set of methods used by scientists in both experimental and observational research across the sciences, business, and other fields. It is used most in the social and behavioral scienc ...
*
SOCR The Statistics Online Computational Resource (SOCR) is an online multi-institutional research and education organization. SOCR designs, validates and broadly shares a suite of online tools for statistical computing, and interactive materials for ...
– online tools for teaching
statistics Statistics (from German language, German: ''wikt:Statistik#German, Statistik'', "description of a State (polity), state, a country") is the discipline that concerns the collection, organization, analysis, interpretation, and presentation of ...
and
probability theory Probability theory is the branch of mathematics concerned with probability. Although there are several different probability interpretations, probability theory treats the concept in a rigorous mathematical manner by expressing it through a set o ...
*
Speakeasy (computational environment) Speakeasy was a numerical computing interactive environment also featuring an interpreted programming language. It was initially developed for internal use at the Physics Division of Argonne National Laboratory by the theoretical physicist ...
– numerical computational environment and programming language with many statistical and
econometric analysis Econometrics is the application of statistical methods to economic data in order to give empirical content to economic relationships.M. Hashem Pesaran (1987). "Econometrics," '' The New Palgrave: A Dictionary of Economics'', v. 2, p. 8 p. 8 ...
features *
SPSS Modeler IBM SPSS Modeler is a data mining and text analytics software application from IBM. It is used to build predictive models and conduct other analytic tasks. It has a visual interface which allows users to leverage statistical and data mining a ...
– comprehensive data mining and text analytics workbench *
SPSS SPSS Statistics is a statistical software suite developed by IBM for data management, advanced analytics, multivariate analysis, business intelligence, and criminal investigation. Long produced by SPSS Inc., it was acquired by IBM in 2009. C ...
Statistics – comprehensive statistics package *
Stata Stata (, , alternatively , occasionally stylized as STATA) is a general-purpose statistical software package developed by StataCorp for data manipulation, visualization, statistics, and automated reporting. It is used by researchers in many fie ...
– comprehensive statistics package *
StatCrunch StatCrunch is a web-based statistical software application from Pearson Education. StatCrunch was originally created for use in college statistics courses. As a full-featured statistics package, it is now also used for research and for other sta ...
– comprehensive statistics package, originally designed for college statistics courses *
Statgraphics Statgraphics is a statistics package that performs and explains basic and advanced statistical functions. History The software was created in 1980 by Dr. Neil W. Polhemus while on the faculty at the Princeton University School of Engineering ...
– general statistics package to include
cloud computing Cloud computing is the on-demand availability of computer system resources, especially data storage ( cloud storage) and computing power, without direct active management by the user. Large clouds often have functions distributed over mul ...
and Six Sigma for use in
business development Business development entails tasks and processes to develop and implement growth opportunities within and between organizations. It is a subset of the fields of business, commerce and organizational theory. Business development is the creation of ...
, process improvement,
data visualization Data and information visualization (data viz or info viz) is an interdisciplinary field that deals with the graphic representation of data and information. It is a particularly efficient way of communicating when the data or information is num ...
and statistical analysis,
design of experiment A design is a plan or specification for the construction of an object or system or for the implementation of an activity or process or the result of that plan or specification in the form of a prototype, product, or process. The verb ''to design' ...
, point processes, geospatial analysis, regression, and time series analysis are all included within this complete statistical package. *
Statistica Statistica is an advanced analytics software package originally developed by StatSoft and currently maintained by TIBCO Software Inc. Statistica provides data analysis, data management, statistics, data mining, machine learning, text analytics a ...
– comprehensive statistics package *
StatsDirect StatsDirect is a statistical software package designed for biomedical, public health, and general health science uses. The second generation of the software was reviewed in general medical and public health journals. Features and use StatsDire ...
– statistics package designed for biomedical, public health and general health science uses *
StatXact StatXact is a statistical software package for analyzing data using exact statistics. It calculates exact p-values and confidence intervals for contingency tables and non-parametric procedures. It is marketed by Cytel Cytel is a multinational s ...
– package for exact nonparametric and parametric statistics *
SuperCROSS The AMA Supercross Championship (commercially known as Monster Energy AMA Supercross) is an American motorcycle racing series. Founded by the American Motorcyclist Association (AMA) in 1974, the AMA Supercross Championship races are held from ...
– comprehensive statistics package with ad-hoc, cross tabulation analysis * Systat – general statistics package * The Unscrambler – free-to-try commercial
multivariate analysis Multivariate statistics is a subdivision of statistics encompassing the simultaneous observation and analysis of more than one outcome variable. Multivariate statistics concerns understanding the different aims and background of each of the dif ...
software for Windows *
Unistat The Unistat computer program is a statistical data analysis tool featuring two modes of operation: The stand-alone user interface is a complete workbench for data input, analysis and visualization while the Microsoft Excel add-in mode extends t ...
– general statistics package that can also work as Excel add-in *
WarpPLS WarpPLS is a software with graphical user interface for variance-based and factor-based structural equation modeling, structural equation modeling (SEM) using the partial least squares path modeling, partial least squares and factor-based methods. ...
– statistics package used in
structural equation modeling Structural equation modeling (SEM) is a label for a diverse set of methods used by scientists in both experimental and observational research across the sciences, business, and other fields. It is used most in the social and behavioral scienc ...
*
Wolfram Language The Wolfram Language ( ) is a general multi-paradigm programming language developed by Wolfram Research. It emphasizes symbolic computation, functional programming, and rule-based programming and can employ arbitrary structures and data. It is ...
– the computer language that evolved from the program
Mathematica Wolfram Mathematica is a software system with built-in libraries for several areas of technical computing that allow machine learning, statistics, symbolic computation, data manipulation, network analysis, time series analysis, NLP, optimizat ...
. It has similar statistical capabilities as Mathematica. *
World Programming System The World Programming System, also known as WPS Analytics or WPS, is a software product developed by a company called World Programming (acquired by Altair Engineering). WPS Analytics supports users of mixed ability to access and process data a ...
(WPS) – statistical package that supports the use of
Python Python may refer to: Snakes * Pythonidae, a family of nonvenomous snakes found in Africa, Asia, and Australia ** ''Python'' (genus), a genus of Pythonidae found in Africa and Asia * Python (mythology), a mythical serpent Computing * Python (pro ...
, R and
SAS SAS or Sas may refer to: Arts, entertainment, and media * ''SAS'' (novel series), a French book series by Gérard de Villiers * ''Shimmer and Shine'', an American animated children's television series * Southern All Stars, a Japanese rock ba ...
languages within in a single user program. *
XploRe Xplore may refer to: Science and technology * IEEE Xplore, an online database of IEEE research publications * XploRe, a statistical software environment * Xplore Technologies, designer, marketer and manufacturer of rugged tablet computers * Xplore ...


Add-ons

*
Analyse-it Analyse-it is a statistical analysis add-in for Microsoft Excel. Analyse-it is the successor to Astute, developed in 1992 for Excel 4 and the first statistical analysis add-in for Microsoft Excel. Analyse-it provides a range of standard parametric ...
– add-on to
Microsoft Excel Microsoft Excel is a spreadsheet developed by Microsoft for Microsoft Windows, Windows, macOS, Android (operating system), Android and iOS. It features calculation or computation capabilities, graphing tools, pivot tables, and a macro (comp ...
for statistical analysis *
Statgraphics Statgraphics is a statistics package that performs and explains basic and advanced statistical functions. History The software was created in 1980 by Dr. Neil W. Polhemus while on the faculty at the Princeton University School of Engineering ...
Sigma Express – add-on to
Microsoft Excel Microsoft Excel is a spreadsheet developed by Microsoft for Microsoft Windows, Windows, macOS, Android (operating system), Android and iOS. It features calculation or computation capabilities, graphing tools, pivot tables, and a macro (comp ...
for Six Sigma statistical analysis *
SUDAAN SUDAAN is a proprietary statistical software package for the analysis of correlated data, including correlated data encountered in complex sample surveys. SUDAAN originated in 1972 at RTI International (the trade name of Research Triangle Institu ...
– add-on to
SAS SAS or Sas may refer to: Arts, entertainment, and media * ''SAS'' (novel series), a French book series by Gérard de Villiers * ''Shimmer and Shine'', an American animated children's television series * Southern All Stars, a Japanese rock ba ...
and
SPSS SPSS Statistics is a statistical software suite developed by IBM for data management, advanced analytics, multivariate analysis, business intelligence, and criminal investigation. Long produced by SPSS Inc., it was acquired by IBM in 2009. C ...
for
statistical survey Survey methodology is "the study of survey methods". As a field of applied statistics concentrating on human-research surveys, survey methodology studies the sampling of individual units from a population and associated techniques of survey da ...
s *
XLfit XLfit is a Microsoft Excel add-in that can perform regression analysis, curve fitting, and statistical analysis. It is approved by the UK National Physical Laboratory and the US National Institute of Standards and Technology XLfit can generat ...
add-on to
Microsoft Excel Microsoft Excel is a spreadsheet developed by Microsoft for Microsoft Windows, Windows, macOS, Android (operating system), Android and iOS. It features calculation or computation capabilities, graphing tools, pivot tables, and a macro (comp ...
for curve fitting and statistical analysis


See also

*
Comparison of statistical packages The following tables compare general and technical information for a number of statistical analysis packages. General information Operating system support ANOVA Support for various ANOVA methods Regression Support for various Regression an ...
*
Free statistical software Free statistical software is a practical alternative to commercial packages. Many of the free to use programs aim to be similar in function to commercial packages, in that they are general statistical packages that perform a variety of statistica ...
*
List of computer algebra systems The following tables provide a comparison of computer algebra systems (CAS). A CAS is a package comprising a set of algorithms for performing symbolic manipulations on algebraic objects, a language to implement them, and an environment in which to ...
*
List of information graphics software This is a list of software to create any kind of information graphics: * either includes the ability to create one or more infographics from a provided data set * either it is provided specifically for information visualization Vector graphics V ...
*
List of numerical libraries This is a list of numerical libraries, which are libraries used in software development for performing numerical calculations. It is not a complete listing but is instead a list of numerical libraries with articles on Wikipedia, with few exceptio ...
*
List of numerical-analysis software Listed here are notable end-user computer applications intended for use with numerical or data analysis: Numerical-software packages General-purpose computer algebra systems Interface-oriented Language-oriented Historically significa ...
*
Mathematical software Mathematical software is software used to model, analyze or calculate numeric, symbolic or geometric data. Evolution of mathematical software Numerical analysis and symbolic computation had been in most important place of the subject, but other ki ...
*
Psychometric software Psychometric software is software that is used for psychometric analysis of data from tests, questionnaires, or inventories reflecting latent psychoeducational variables. While some psychometric analyses can be performed with standard statistical ...


References


External links

* {{Public health
Statistical packages Statistical software are specialized computer programs for analysis in statistics and econometrics. Open-source * ADaMSoft – a generalized statistical software with data mining algorithms and methods for data management * ADMB – a software ...
Software Software is a set of computer programs and associated documentation and data. This is in contrast to hardware, from which the system is built and which actually performs the work. At the lowest programming level, executable code consists ...