SPSS
Logo icon, v23
 
Developer(s)  IBM Corporation 

Initial release  1968 
Stable release  26.0
/ April 9, 2019

Operating system  Windows, macOS, Linux on z Systems, Linux and UNIX 
Platform  Java 
Size  ~700 MB 
Type  Statistical analysis, data mining, text analytics, data collection 
License  Trialware or SaaS 
Website  www.ibm.com/products/spssstatistics 
SPSS Statistics is a software package used for interactive, or batched, statistical analysis. Long produced by SPSS Inc., it was acquired by IBM in 2009. The current versions (2015) are named IBM SPSS Statistics.
The software name originally stood for Statistical Package for the Social Sciences (SPSS),^{[1]} reflecting the original market, although the software is now popular in other fields as well, including the health sciences and marketing.
Contents
Overview
SPSS is a widely used program for statistical analysis in social science. It is also used by market researchers, health researchers, survey companies, government, education researchers, marketing organizations, data miners,^{[2]} and others. The original SPSS manual (Nie, Bent & Hull, 1970)^{[3]} has been described as one of "sociology's most influential books" for allowing ordinary researchers to do their own statistical analysis.^{[4]} In addition to statistical analysis, data management (case selection, file reshaping, creating derived data) and data documentation (a metadata dictionary is stored in the datafile) are features of the base software.
Statistics included in the base software:
 Descriptive statistics: Cross tabulation, Frequencies, Descriptives, Explore, Descriptive Ratio Statistics
 Bivariate statistics: Means, ttest, ANOVA, Correlation (bivariate, partial, distances), Nonparametric tests, Bayesian
 Prediction for numerical outcomes: Linear regression
 Prediction for identifying groups: Factor analysis, cluster analysis (twostep, Kmeans, hierarchical), Discriminant
 Geo spatial analysis, simulation
 R extension (GUI), Python
The many features of SPSS Statistics are accessible via pulldown menus or can be programmed with a proprietary 4GL command syntax language. Command syntax programming has the benefits of reproducible output, simplifying repetitive tasks, and handling complex data manipulations and analyses. Additionally, some complex applications can only be programmed in syntax and are not accessible through the menu structure. The pulldown menu interface also generates command syntax: this can be displayed in the output, although the default settings have to be changed to make the syntax visible to the user. They can also be pasted into a syntax file using the "paste" button present in each menu. Programs can be run interactively or unattended, using the supplied Production Job Facility.
Additionally a "macro" language can be used to write command language subroutines. A Python programmability extension can access the information in the data dictionary and data and dynamically build command syntax programs. The Python programmability extension, introduced in SPSS 14, replaced the less functional SAX Basic "scripts" for most purposes, although SaxBasic remains available. In addition, the Python extension allows SPSS to run any of the statistics in the free software package R. From version 14 onwards, SPSS can be driven externally by a Python or a VB.NET program using supplied "plugins". (From Version 20 onwards, these two scripting facilities, as well as many scripts, are included on the installation media and are normally installed by default.)
SPSS Statistics places constraints on internal file structure, data types, data processing, and matching files, which together considerably simplify programming. SPSS datasets have a twodimensional table structure, where the rows typically represent cases (such as individuals or households) and the columns represent measurements (such as age, sex, or household income). Only two data types are defined: numeric and text (or "string"). All data processing occurs sequentially casebycase through the file (dataset). Files can be matched onetoone and onetomany, but not manytomany. In addition to that casesbyvariables structure and processing, there is a separate Matrix session where one can process data as matrices using matrix and linear algebra operations.
The graphical user interface has two views which can be toggled by clicking on one of the two tabs in the bottom left of the SPSS Statistics window. The 'Data View' shows a spreadsheet view of the cases (rows) and variables (columns). Unlike spreadsheets, the data cells can only contain numbers or text, and formulas cannot be stored in these cells. The 'Variable View' displays the metadata dictionary where each row represents a variable and shows the variable name, variable label, value label(s), print width, measurement type, and a variety of other characteristics. Cells in both views can be manually edited, defining the file structure and allowing data entry without using command syntax. This may be sufficient for small datasets. Larger datasets such as statistical surveys are more often created in data entry software, or entered during computerassisted personal interviewing, by scanning and using optical character recognition and optical mark recognition software, or by direct capture from online questionnaires. These datasets are then read into SPSS.
SPSS Statistics can read and write data from ASCII text files (including hierarchical files), other statistics packages, spreadsheets and databases. SPSS Statistics can read and write to external relational database tables via ODBC and SQL.
Statistical output is to a proprietary file format (*.spv file, supporting pivot tables) for which, in addition to the inpackage viewer, a standalone reader can be downloaded. The proprietary output can be exported to text or Microsoft Word, PDF, Excel, and other formats. Alternatively, output can be captured as data (using the OMS command), as text, tabdelimited text, PDF, XLS, HTML, XML, SPSS dataset or a variety of graphic image formats (JPEG, PNG, BMP and EMF).
Several variants of SPSS Statistics exist. SPSS Statistics Gradpacks are highly discounted versions sold only to students. SPSS Statistics Server is a version of SPSS Statistics with a client/server architecture. Addon packages can enhance the base software with additional features (examples include complex samples which can adjust for clustered and stratified samples, and custom tables which can create publicationready tables). SPSS Statistics is available under either an annual or a monthly subscription license.
SPSS Statistics launched version 25 on Aug 08, 2017. SPSS v25 adds new and advanced statistics, such as random effects solution results (GENLINMIXED), robust standard errors (GLM/UNIANOVA), and profile plots with error bars within the Advanced Statistics and Custom Tables addon. V25 also includes new Bayesian Statistics capabilities, a method of statistical inference and publication ready charts, such as powerful new charting capabilities, including new default templates and the ability to share with Microsoft Office applications.^{[5]}
Versions and ownership history
The software was released in its first version in 1968 as the Statistical Package for the Social Sciences (SPSS) after being developed by Norman H. Nie, Dale H. Bent, and C. Hadlai Hull. Those principals incorporated as SPSS Inc. in 1975. Early versions of SPSS Statistics were written in Fortran and designed for batch processing on mainframes, including for example IBM and ICL versions, originally using punched cards for data and program input. A processing run read a command file of SPSS commands and either a raw input file of fixed format data with a single record type, or a 'getfile' of data saved by a previous run. To save precious computer time an 'edit' run could be done to check command syntax without analysing the data. From version 10 (SPSSX) in 1983, data files could contain multiple record types.
Prior to SPSS 16.0, different versions of SPSS were available for Windows, Mac OS X and Unix.
SPSS Statistics version 13.0 for Mac OS X was not compatible with Intelbased Macintosh computers, due to the Rosetta emulation software causing errors in calculations. SPSS Statistics 15.0 for Windows needed a downloadable hotfix to be installed in order to be compatible with Windows Vista.
From version 16.0 the same version runs under Windows, Mac, and Linux. The graphical user interface is written in Java. The Mac OS version is provided as a Universal binary, making it fully compatible with both PowerPC and Intelbased Mac hardware.
SPSS Inc announced on July 28, 2009 that it was being acquired by IBM for US$1.2 billion.^{[6]} Because of a dispute about ownership of the name "SPSS", between 2009 and 2010, the product was referred to as PASW (Predictive Analytics SoftWare).^{[7]} As of January 2010, it became "SPSS: An IBM Company". Complete transfer of business to IBM was done by October 1, 2010. By that date, SPSS: An IBM Company ceased to exist. IBM SPSS is now fully integrated into the IBM Corporation, and is one of the brands under IBM Software Group's Business Analytics Portfolio, together with IBM Algorithmics, IBM Cognos and IBM OpenPages.
Companion software in the "IBM SPSS" family are used for data mining and text analytics (IBM SPSS Modeler), and realtime credit scoring services (IBM SPSS Collaboration and Deployment Services).
SPSS Data Collection and SPSS Dimensions were sold in 2015 to UNICOM Systems, Inc., a division of UNICOM Global, and merged into the integrated software suite UNICOM Intelligence (survey design, survey deployment, data collection, data management and reporting).^{[8]}^{[9]}^{[10]}
IDA (Interactive Data Analysis)
IDA (Interactive Data Analysis)^{[11]} was a software package that originated at what formerly was the National Opinion Research Center (NORC), at the University of Chicago. Initially offered on the HP2000,^{[12]} somewhat later, under the ownership of SPSS, it was also available on DEC's DECSYSTEM20.^{[13]}
SCSS  Conversational / Columnar SPSS
SCSS was a software product intended for online use of IBM mainframes. ^{[14]}
Although the "C" was for Conversational, it also represented a distinction regarding how the data was stored: it used a columnoriented rather than a roworiented (internal) database.^{[citation needed]}
This gave good interactive response time for the SPSS Conversational Statistical System (SCSS), whose strong point, as with SPSS, was Crosstabulation.^{[15]}
See also
 Comparison of statistical packages
 SPSS Modeler
 PSPP, a free SPSS replacement from the GNU Project
References
 ^ Quintero, Dino; et al. "Workload Optimized Systems: Tuning POWER7 for Analytics". Abstract.
 ^ "KDnuggets Annual Software Poll: Analytics/Data mining software used?". KDnuggets. May 2013.
 ^ Nie, Norman H; Bent, Dale H; Hadlai Hull, C (1970). "SPSS: Statistical package for the social sciences".
 ^ Wellman. 1998. pp. 71–78.
 ^ "What's New in SPSS Statistics 25 & Subscription  SPSS Predictive Analytics". SPSS Predictive Analytics. 18 July 2017. Retrieved 15 December 2017.
 ^ IBM press release. 2009.
 ^ Sachdev, Ameet (September 27, 2009). "IBM's $1.2 billion bid for SPSS Inc. helps resolve trademark dispute". Chicago Tribune.
 ^ "IBM SPSS Data Collection Divestiture". 2 February 2016. Retrieved 7 June 2017.
 ^ "UNICOM® Global Acquires IBM® Data Collection Suite from IBM Corp". 31 October 2015. Retrieved 7 June 2017.
 ^ https://teamblue.unicomsi.com/products/unicomintelligence/
 ^ or Analyzer
 ^ Ling, Robert F; Roberts, Harry V (1975). "IDA: An Approach to Interactive Data Analysis in Teaching". The Journal of Business. 48 (3): 411–451. JSTOR 2352233.
 ^ "IDA" (PDF).
 ^ SCSS: A User's Guide to the SPSS Conversational Statistical System. 1980. ISBN 9780070465336.
 ^ "SCSS from SPSS, Inc". ComputerWorld. September 26, 1977. p. 28.
Further reading
 Argyrous, G. (20051123). Statistics for Research: With a Guide to SPSS. London: SAGE. ISBN 9781412919487.
 Bryman, Alan; Cramer, Duncan (2011). Quantitative Data Analysis with IBM SPSS 17, 18 and 19: A Guide for Social Scientists. New York: Routledge. ISBN 9780415579186.
 Levesque, R. (2007). SPSS Programming and Data Management: A Guide for SPSS and SAS Users (4th ed.). Chicago, Illinois: SPSS Inc. ISBN 9781568273907.
 SPSS 15.0 Command Syntax Reference. Chicago, Illinois: SPSS Inc. 2006.
 Wellman, B. (1998). "Doing It Ourselves: The SPSS Manual as Sociology's Most Influential Recent Book". In Clawson, Dan (ed.). Required Reading: Sociology's Most Influential Books. Amherst: University of Massachusetts Press. pp. 71–78. ISBN 9781558491533.
External links
Wikimedia Commons has media related to SPSS. 
Wikiversity has learning resources about SPSS 
 Official website
 Official SPSS User Community
 50 years of SPSS history
 Raynald Levesque's SPSS Tools – library of worked solutions for SPSS programmers (FAQ, command syntax; macros; scripts; Python)
 Archives of SPSSXL Discussion – SPSS Listserv active since 1996. Discusses programming, statistics and analysis
 UCLA ATS Resources to help you learn SPSS – Resources for learning SPSS
 UCLA ATS Technical Reports – Report 1 compares Stata, SAS, and SPSS against R (R is a language and environment for statistical computing and graphics).
 SPSS Community?ref=wikipedia – Support for developers of applications using SPSS products, including materials and examples of the Python and R programmability features
 Biomedical Statistics  An educational website dedicated to statistical evaluation of biomedical data using SPSS software