Dap is a
statistics and
graphics
Graphics () are visual images or designs on some surface, such as a wall, canvas, screen, paper, or stone, to inform, illustrate, or entertain. In contemporary usage, it includes a pictorial representation of data, as in design and manufacture, ...
program based on the
C programming language
''The C Programming Language'' (sometimes termed ''K&R'', after its authors' initials) is a computer programming book written by Brian Kernighan and Dennis Ritchie, the latter of whom originally designed and implemented the language, as well as ...
that performs data management, analysis, and C-style graphical visualization tasks without requiring complex syntax. Its name is an acronym for Data Analysis and Presentation.
Dap was written to be a free replacement for
SAS
SAS or Sas may refer to:
Arts, entertainment, and media
* ''SAS'' (novel series), a French book series by Gérard de Villiers
* ''Shimmer and Shine'', an American animated children's television series
* Southern All Stars, a Japanese rock ba ...
, but users are assumed to have a basic familiarity with the
C programming language
''The C Programming Language'' (sometimes termed ''K&R'', after its authors' initials) is a computer programming book written by Brian Kernighan and Dennis Ritchie, the latter of whom originally designed and implemented the language, as well as ...
in order to permit greater flexibility.
It has been designed to be used on large data sets and is primarily used in statistical consulting practices.
However, even with its clear benefits, Dap hasn't been updated since 2014 and hasn't seen widespread use when compared to other statistical analysis programs.
Features
Dap is a command line driven program. Below are various features that DAP can perform.
DAP can compute
mean
There are several kinds of mean in mathematics, especially in statistics. Each mean serves to summarize a given group of data, often to better understand the overall value ( magnitude and sign) of a given data set.
For a data set, the '' ari ...
s and
percentiles,
correlation
In statistics, correlation or dependence is any statistical relationship, whether causal or not, between two random variables or bivariate data. Although in the broadest sense, "correlation" may indicate any type of association, in statisti ...
, & ANOVA from data sets. This includes Unbalanced as well as Crossed, Nested
ANOVA. It can also be used to create
scatterplots,
line graph
In the mathematical discipline of graph theory, the line graph of an undirected graph is another graph that represents the adjacencies between edges of . is constructed in the following way: for each edge in , make a vertex in ; for every ...
s and
histograms
A histogram is an approximate representation of the distribution of numerical data. The term was first introduced by Karl Pearson. To construct a histogram, the first step is to " bin" (or " bucket") the range of values—that is, divide the ent ...
of data. This can include
split plot
In statistics, restricted randomization occurs in the design of experiments and in particular in the context of randomized experiments and randomized controlled trials. Restricted randomization allows intuitively poor allocations of treatments to ...
s, treatment combinations, as well as
latin square
In combinatorics and in experimental design, a Latin square is an ''n'' × ''n'' array filled with ''n'' different symbols, each occurring exactly once in each row and exactly once in each column. An example of a 3×3 Latin sq ...
s.
DAP can perform
linear regression
In statistics, linear regression is a linear approach for modelling the relationship between a scalar response and one or more explanatory variables (also known as dependent and independent variables). The case of one explanatory variable is ...
and can utilize regressions to build
linear model
In statistics, the term linear model is used in different ways according to the context. The most common occurrence is in connection with regression models and the term is often taken as synonymous with linear regression model. However, the term ...
s. In addition to linear regression, DAP can also perform
logistic regression
In statistics, the logistic model (or logit model) is a statistical model that models the probability of an event taking place by having the log-odds for the event be a linear function (calculus), linear combination of one or more independent var ...
analysis as well. There's a variety of other analysis that DAP can do as well including building
loglinear models as well as Logit models for linear-by-linear association.
In terms of models, DAP can create mixed balanced and unbalanced models as well as random unbalanced models.
It has been designed so as to cope with very large data sets; even when the size of the data exceeds the size of the computer's
memory
Memory is the faculty of the mind by which data or information is encoded, stored, and retrieved when needed. It is the retention of information over time for the purpose of influencing future action. If past events could not be remembered ...
due to the fact that the program processes files one line at a time rather than reading entire files into memory.
Applications
Industry Uses
* Statistical Consulting Practices
* Low-level Statistical Analysis
References
Sources
*
See also
*
Comparison of statistical packages
*
gretl
*
PSPP
PSPP is a free software application for analysis of sampled data, intended as a free alternative for IBM SPSS Statistics. It has a graphical user interface and conventional command-line interface. It is written in C and uses GNU Scientific ...
External links
*
administrative page
GNU Project software
Free software programmed in C
Free statistical software
{{Science-software-stub