HOME

TheInfoList



OR:

SPSS Statistics is a
statistical software Statistical software are specialized computer programs for analysis in statistics and econometrics. Open-source * ADaMSoft – a generalized statistical software with data mining algorithms and methods for data management * ADMB – a softwar ...
suite developed by IBM for data management, advanced analytics,
multivariate analysis Multivariate statistics is a subdivision of statistics encompassing the simultaneous observation and analysis of more than one outcome variable. Multivariate statistics concerns understanding the different aims and background of each of the diff ...
,
business intelligence Business intelligence (BI) comprises the strategies and technologies used by enterprises for the data analysis and management of business information. Common functions of business intelligence technologies include reporting, online analytical ...
, and criminal investigation. Long produced by
SPSS Inc. SPSS Inc. was a software house headquartered in Chicago and incorporated in Delaware, most noted for the proprietary software of the same name SPSS. The company was started in 1968 when Norman Nie, Dale Bent, and Hadlai "Tex" Hull developed a ...
, it was acquired by IBM in 2009. Current versions (post 2015) have the brand name: IBM SPSS Statistics. The software name originally stood for Statistical Package for the Social Sciences (SPSS), reflecting the original market, then later changed to Statistical Product and Service Solutions.


Overview

SPSS is a widely used program for statistical analysis in
social science Social science is one of the branches of science, devoted to the study of societies and the relationships among individuals within those societies. The term was formerly used to refer to the field of sociology, the original "science of s ...
. It is also used by market researchers, health researchers, survey companies, government, education researchers, marketing organizations, data miners, and others. The original SPSS manual (Nie, Bent & Hull, 1970) has been described as one of "sociology's most influential books" for allowing ordinary researchers to do their own statistical analysis. In addition to statistical analysis, data management (case selection, file reshaping, creating derived data) and data documentation (a metadata dictionary is stored in the
data In the pursuit of knowledge, data (; ) is a collection of discrete values that convey information, describing quantity, quality, fact, statistics, other basic units of meaning, or simply sequences of symbols that may be further interpret ...
file) are features of the base software. The many features of SPSS Statistics are accessible via
pull-down menus In user interface design, a menu is a list of options or commands presented to the user of a computer or embedded system. A menu may either be a system's entire user interface, or only part of a more complex one. Navigation A user chooses ...
or can be programmed with a proprietary
4GL A fourth-generation programming language (4GL) is any computer programming language that belongs to a class of languages envisioned as an advancement upon third-generation programming languages (3GL). Each of the programming language generation ...
''command syntax language''. Command syntax programming has the benefits of reproducible output, simplifying repetitive tasks, and handling complex data manipulations and analyses. Additionally, some complex applications can only be programmed in syntax and are not accessible through the menu structure. The pull-down menu interface also generates command syntax: this can be displayed in the output, although the default settings have to be changed to make the syntax visible to the user. They can also be pasted into a syntax file using the "paste" button present in each menu. Programs can be run interactively or unattended, using the supplied Production Job Facility. Additionally a "macro" language can be used to write command language subroutines. A Python programmability extension can access the information in the data dictionary and data and dynamically build command syntax programs. The Python programmability extension, introduced in SPSS 14, replaced the less functional SAX Basic "scripts" for most purposes, although SaxBasic remains available. In addition, the Python extension allows SPSS to run any of the statistics in the
free software Free software or libre software is computer software distributed under terms that allow users to run the software for any purpose as well as to study, change, and distribute it and any adapted versions. Free software is a matter of liberty, n ...
package R. From version 14 onwards, SPSS can be driven externally by a Python or a VB.NET program using supplied "plug-ins". (From Version 20 onwards, these two scripting facilities, as well as many scripts, are included on the installation media and are normally installed by default.) SPSS Statistics places constraints on internal file structure, data types, data processing, and matching files, which together considerably simplify programming. SPSS datasets have a two-dimensional table structure, where the rows typically represent cases (such as individuals or households) and the columns represent measurements (such as age, sex, or household income). Only two data types are defined: numeric and text (or "string"). All data processing occurs sequentially case-by-case through the file (dataset). Files can be matched one-to-one and one-to-many, but not many-to-many. In addition to that cases-by-variables structure and processing, there is a separate Matrix session where one can process data as matrices using matrix and linear algebra operations. The
graphical user interface The GUI ( "UI" by itself is still usually pronounced . or ), graphical user interface, is a form of user interface that allows users to interact with electronic devices through graphical icons and audio indicator such as primary notation, ins ...
has two views which can be toggled by clicking on one of the two tabs in the bottom left of the SPSS Statistics window. The 'Data View' shows a
spreadsheet A spreadsheet is a computer application for computation, organization, analysis and storage of data in tabular form. Spreadsheets were developed as computerized analogs of paper accounting worksheets. The program operates on data entered in ...
view of the cases (rows) and variables (columns). Unlike spreadsheets, the data cells can only contain numbers or text, and formulas cannot be stored in these cells. The 'Variable View' displays the metadata dictionary where each row represents a variable and shows the variable name, variable label, value label(s), print width, measurement type, and a variety of other characteristics. Cells in both views can be manually edited, defining the file structure and allowing data entry without using command syntax. This may be sufficient for small datasets. Larger datasets such as statistical surveys are more often created in data entry software, or entered during computer-assisted personal interviewing, by scanning and using optical character recognition and
optical mark recognition Optical mark recognition (also called optical mark reading and OMR) is the process of reading information that people mark on surveys, tests and other paper documents. OMR is used to read questionnaires, multiple choice examination papers in th ...
software, or by direct capture from online questionnaires. These datasets are then read into SPSS. SPSS Statistics can read and write data from
ASCII ASCII ( ), abbreviated from American Standard Code for Information Interchange, is a character encoding standard for electronic communication. ASCII codes represent text in computers, telecommunications equipment, and other devices. Because ...
text files (including hierarchical files), other statistics packages, spreadsheets and databases. SPSS Statistics can read and write to external relational database tables via ODBC and SQL. Statistical output is to a
proprietary file format A proprietary file format is a file format of a company, organization, or individual that contains data that is ordered and stored according to a particular encoding-scheme, designed by the company or organization to be secret, such that the decodi ...
(*.spv file, supporting pivot tables) for which, in addition to the in-package viewer, a stand-alone reader can be downloaded. The proprietary output can be exported to text or Microsoft Word, PDF, Excel, and other formats. Alternatively, output can be captured as data (using the OMS command), as text, tab-delimited text, PDF, XLS,
HTML The HyperText Markup Language or HTML is the standard markup language for documents designed to be displayed in a web browser. It can be assisted by technologies such as Cascading Style Sheets (CSS) and scripting languages such as JavaS ...
,
XML Extensible Markup Language (XML) is a markup language and file format for storing, transmitting, and reconstructing arbitrary data. It defines a set of rules for encoding documents in a format that is both human-readable and machine-readable. T ...
, SPSS dataset or a variety of graphic image formats ( JPEG, PNG, BMP and EMF). Several variants of SPSS Statistics exist.
SPSS Statistics Gradpacks
are highly discounted versions sold only to students. SPSS Statistics Server is a version of SPSS Statistics with a client/server architecture. Add-on packages can enhance the base software with additional features (examples include complex samples which can adjust for clustered and stratified samples, and custom tables which can create publication-ready tables). SPSS Statistics is available under either an annual or a monthly subscription license. SPSS Statistics launched version 25 on Aug 08, 2017. SPSS v25 adds new and advanced statistics, such as random effects solution results (GENLINMIXED), robust standard errors (GLM/UNIANOVA), and profile plots with error bars within the Advanced Statistics and Custom Tables add-on. V25 also includes new Bayesian Statistics capabilities, a method of statistical inference and publication ready charts, such as powerful new charting capabilities, including new default templates and the ability to share with Microsoft Office applications.


Versions and ownership history

* SPSS 1 - 1968 * SPSS 2 - 1983 * SPSS 5 - 1993 * SPSS 6.1 - 1995 * SPSS 7.5 - 1997 * SPSS 8 - 1998 * SPSS 9 - 1999 * SPSS 10 - 1999 * SPSS 11 - 2002 * SPSS 12 - 2004 * SPSS 13 - 2005 * SPSS 14 - 2006 * SPSS 15 - 2006 * SPSS 16 - 2007 * SPSS 17 - 2008 * PASW 17 - 2009 * PASW 18 - 2009 * SPSS 19 - 2010 * SPSS 20 - 2011 * SPSS 21 - 2012 * SPSS 22 - 2013 * SPSS 23 - 2015 * SPSS 24 - 2016, March * SPSS 25 - 2017, July * SPSS 26 - 2018 *SPSS 27 - 2019, June (and 27.0.1 in November, 2020) *SPSS 28 - 2021, May *SPSS 29 - 2022, Sept SPSS was released in its first version in 1968 as the Statistical Package for the Social Sciences (SPSS) after being developed by
Norman H. Nie Norman H. Nie was an American social scientist, university professor, inventor, and pioneering technology entrepreneur, known for being one of the developers of the Statistical Package for the Social Sciences (SPSS). Born in St. Louis, Missouri in ...
, Dale H. Bent, and C. Hadlai Hull. Those principals incorporated as
SPSS Inc. SPSS Inc. was a software house headquartered in Chicago and incorporated in Delaware, most noted for the proprietary software of the same name SPSS. The company was started in 1968 when Norman Nie, Dale Bent, and Hadlai "Tex" Hull developed a ...
in 1975. Early versions of SPSS Statistics were written in Fortran and designed for
batch processing Computerized batch processing is a method of running software programs called jobs in batches automatically. While users are required to submit the jobs, no other interaction by the user is required to process the batch. Batches may automatically ...
on mainframes, including for example IBM and ICL versions, originally using punched cards for data and program input. A processing run read a command file of SPSS commands and either a raw input file of fixed-format data with a single record type, or a 'getfile' of data saved by a previous run. To save precious computer time an 'edit' run could be done to check command syntax without analysing the data. From version 10 (SPSS-X) in 1983, data files could contain multiple record types. Prior to SPSS 16.0, different versions of SPSS were available for
Windows Windows is a group of several proprietary graphical operating system families developed and marketed by Microsoft. Each family caters to a certain sector of the computing industry. For example, Windows NT for consumers, Windows Server for se ...
, Mac OS X and
Unix Unix (; trademarked as UNIX) is a family of multitasking, multiuser computer operating systems that derive from the original AT&T Unix, whose development started in 1969 at the Bell Labs research center by Ken Thompson, Dennis Ritchie, ...
. SPSS Statistics version 13.0 for Mac OS X was not compatible with
Intel Intel Corporation is an American multinational corporation and technology company headquartered in Santa Clara, California. It is the world's largest semiconductor chip manufacturer by revenue, and is one of the developers of the x86 ser ...
-based Macintosh computers, due to the Rosetta emulation software causing errors in calculations. SPSS Statistics 15.0 for Windows needed a downloadable hotfix to be installed in order to be compatible with Windows Vista. From version 16.0, the same version runs under Windows, Mac, and Linux. The
graphical user interface The GUI ( "UI" by itself is still usually pronounced . or ), graphical user interface, is a form of user interface that allows users to interact with electronic devices through graphical icons and audio indicator such as primary notation, ins ...
is written in
Java Java (; id, Jawa, ; jv, ꦗꦮ; su, ) is one of the Greater Sunda Islands in Indonesia. It is bordered by the Indian Ocean to the south and the Java Sea to the north. With a population of 151.6 million people, Java is the world's mo ...
. The Mac OS version is provided as a Universal binary, making it fully compatible with both PowerPC and Intel-based Mac hardware. SPSS Inc announced on July 28, 2009 that it was being acquired by IBM for US$1.2 billion. Because of a dispute about ownership of the name "SPSS", between 2009 and 2010, the product was referred to as PASW (Predictive Analytics SoftWare). As of January 2010, it became "SPSS: An IBM Company". Complete transfer of business to IBM was done by October 1, 2010. By that date, SPSS: An IBM Company ceased to exist. IBM SPSS is now fully integrated into the IBM Corporation, and is one of the brands under IBM Software Group's Business Analytics Portfolio, together with IBM Algorithmics,
IBM Cognos Cognos Incorporated was an Ottawa, Ontario-based company making business intelligence (BI) and performance management (PM) software. Founded in 1969, at its peak Cognos employed almost 3,500 people and served more than 23,000 customers in over 1 ...
and IBM OpenPages. Companion software in the "IBM SPSS" family are used for data mining and text analytics ( IBM SPSS Modeler), and realtime credit scoring services ( IBM SPSS Collaboration and Deployment Services). SPSS Data Collection and SPSS Dimensions were sold in 2015 to UNICOM Systems, Inc., a division of
UNICOM Global UNICOM Global is an American multinational technology corporation headquartered in Mission Hills, California. The company was founded by Corry Hong in Los Angeles, California in 1981 to develop AUTOMON/CICS and related products for the CICS main ...
, and merged into the integrated software suite UNICOM Intelligence (survey design, survey deployment, data collection, data management and reporting).


(Interactive Data Analysis)

IDA (Interactive Data Analysis) was a software package that originated at what formerly was the National Opinion Research Center ( NORC), at the University of Chicago. Initially offered on the
HP-2000 The HP 2100 is a series of 16-bit minicomputers that were produced by Hewlett-Packard (HP) from the mid-1960s to early 1990s. Tens of thousands of machines in the series were sold over its twenty-five year lifetime, making HP the fourth largest mi ...
, somewhat later, under the ownership of SPSS, it was also available on DEC's DECSYSTEM-20. Regression analysis was one of ''IDA'''s strong points.


- Conversational / Columnar SPSS

SCSS was a software product intended for online use of IBM mainframes. Although the "C" was for Conversational, it also represented a distinction regarding how the data was stored: it used a column-oriented rather than a row-oriented (internal) database. This gave good interactive response time for the SPSS Conversational Statistical System (SCSS), whose strong point, as with SPSS, was Cross-tabulation.


Project NX

In October, 2020 IBM announced the start of an Early Access Program for the "New SPSS Statistics", codenamed Project NX. It contains "many of your favorite SPSS capabilities presented in a new easy to use interface, with integrated guidance, multiple tabs, improved graphs and much more". In December, 2021, IBM opened up the Early Access Program for the next generation of SPSS Statistics for more users and shared more visuals about it.


See also

*
Comparison of statistical packages The following tables compare general and technical information for a number of statistical analysis packages. General information Operating system support ANOVA Support for various ANOVA methods Regression Support for various regression m ...
* JASP and jamovi, both open-source and free of charge alternatives, offering frequentist and Bayesian models * PSPP, a free SPSS replacement from the
GNU Project The GNU Project () is a free software, mass collaboration project announced by Richard Stallman on September 27, 1983. Its goal is to give computer users freedom and control in their use of their computers and computing devices by collabor ...
*
SPSS Modeler IBM SPSS Modeler is a data mining and text analytics software application from IBM. It is used to build predictive models and conduct other analytic tasks. It has a visual interface which allows users to leverage statistical and data mining a ...


References


Further reading

* * * * *


External links

*
Official SPSS User Community

50 years of SPSS history

Raynald Levesque's SPSS Tools
nbsp;– library of worked solutions for SPSS programmers ( FAQ, command syntax; macros; scripts; Python)
Archives of SPSSX-L Discussion
nbsp;– SPSS Listserv active since 1996. Discusses programming, statistics and analysis
UCLA ATS Resources to help you learn SPSS
nbsp;– Resources for learning SPSS
UCLA ATS Technical Reports
nbsp;– Report 1 compares Stata, SAS, and SPSS against R ( R is a language and environment for statistical computing and graphics).
SPSS Community?ref=wikipedia
nbsp;– Support for developers of applications using SPSS products, including materials and examples of the Python and R programmability features
Biomedical Statistics - An educational website dedicated to statistical evaluation of biomedical data using SPSS software
{{Statistical software IBM software Business intelligence Java platform software Science software for Linux Proprietary commercial software for Linux Data mining and machine learning software Statistical software Statistical programming languages Econometrics software Time series software Data warehousing Proprietary cross-platform software Extract, transform, load tools Mathematical optimization software Numerical software