Data & Intelligence

IBM SPSS Split File

You can use the IBM SPSS Split File feature to split your data pond into separate groups for further analysis –based on the values of one or more grouping variables.

If you select multiple grouping variables, cases are grouped by each variable within categories of the preceding variable on the Groups Based On list.

For example, if you selected gender as the first grouping variable and minority as the second grouping variable, your cases will be grouped by minority classification within each gender category.

Good to know:

• You can specify up to eight grouping variables.

• Each eight bytes of a long string variable (string variables longer than eight bytes) counts as a variable toward the limit of eight grouping variables.

• Cases should be sorted by values of the grouping variables and in the same order that variables are listed in the Groups Based On list. If the data file isn’t already sorted, select Sort the file by grouping variables.

In my data pond, I have variables defined indicating marital status (a 0 or a 1) and retired indicator (again, a 0 or a 1).  Using SPSS’s “split file” I chose “Organize output by groups”, selected my 2 variables (Marital status and Retired) as “Groups Based on” and then indicated “Sort the file by grouping variables”:

Data Intelligence - The Future of Big Data
The Future of Big Data

With some guidance, you can craft a data platform that is right for your organization’s needs and gets the most return from your data capital.

Get the Guide

When I click the OK button, my result is a file into 2 “split files” – single people sorted by their retirement status and married people sorted by their retirement status.

Comparing Your Groups

These “split-files” are really groups (not physically separate files) and are presented together for comparison purposes. But all results from any procedures are displayed separately for each split-file group!

Pivot Tables

A single pivot table is created and each split-file variable can be moved between table dimensions.

 

Charts

A separate chart is created for each split-file group and the charts are displayed together in the Viewer.

Organizing Your Output Using Groups.

to split a data pond for analysis:

From the menus you choose:

1.     Data > Split File…

2.     Select Compare groups or Organize output by groups.

3.     Select one or more grouping variables.

Remember! If your data isn’t already sorted by values of the grouping variables you’ll need to select Sort the file by grouping variables!

About the Author

Mr. Miller is an IBM certified and accomplished Senior Project Leader and Application/System Architect-Developer with over 30 years of extensive applications and system design and development experience. His current role is National FPM Practice Leader. His experience includes BI, Web architecture & design, systems analysis, GUI design and testing, Database modeling and systems analysis, design, and development of Client/Server, Web and Mainframe applications and systems utilizing: Applix TM1 (including TM1 rules, TI, TM1Web and Planning Manager), dynaSight - ArcPlan, ASP, DHTML, XML, IIS, MS Visual Basic and VBA, Visual Studio, PERL, Websuite, MS SQL Server, ORACLE, SYBASE SQL Server, etc. His Responsibilities have included all aspects of Windows and SQL solution development and design including: analysis; GUI (and Web site) design; data modeling; table, screen/form and script development; SQL (and remote stored procedures and triggers) development and testing; test preparation and management and training of programming staff. Other experience includes development of ETL infrastructure such as data transfer automation between mainframe (DB2, Lawson, Great Plains, etc.) systems and client/server SQL server and Web based applications and integration of enterprise applications and data sources. In addition, Mr. Miller has acted as Internet Applications Development Manager responsible for the design, development, QA and delivery of multiple Web Sites including online trading applications, warehouse process control and scheduling systems and administrative and control applications. Mr. Miller also was responsible for the design, development and administration of a Web based financial reporting system for a 450 million dollar organization, reporting directly to the CFO and his executive team. Mr. Miller has also been responsible for managing and directing multiple resources in various management roles including project and team leader, lead developer and applications development director. Specialties Include: Cognos/TM1 Design and Development, Cognos Planning, IBM SPSS and Modeler, OLAP, Visual Basic, SQL Server, Forecasting and Planning; International Application Development, Business Intelligence, Project Development. IBM Certified Developer - Cognos TM1 (perfect score 100% on exam) IBM Certified Business Analyst - Cognos TM1

More from this Author

Leave a Reply

This site uses Akismet to reduce spam. Learn how your comment data is processed.

Subscribe to the Weekly Blog Digest:

Sign Up