Skip to main content

Posts Tagged ‘IBM SPSS’

Data Mining with IBM SPSS Modeler v15

Having recently completed the course “IBM SPSS Modeler & Data Mining” offered by Global Knowledge, I was looking to find more opportunities to do some modeling with SPSS Modeler. So, when I read in the news recently, about college recruiters using predictive techniques to determine the probability of a particular recruit graduating on time, I […]

Transforming Data with Loop and Vector

As promised, today I am sharing the fundamentals of using the LOOP and VECTOR in a SPSS Syntax script. LOOP LOOP is a common programming concept – it simply allows you to apply or execute the same script commands for an identified number of times without having to recode them.   VECTOR A VECTOR is […]

IBM SPSS Syntax -using Functions to perform Data Transformations

I recently finished another productive mentoring session exploring Data Transformations using SPSS Syntax. We had so much fun using some basic SPSS, I just had to share, so: Getting back to my Predictive startup example, we now have a new version of the quarterly hours file. This one includes a few new columns (“variables” to […]

IBM SPSS Syntax for File Operations

My start-up predictive analytics organization “Predictive Performers” wants to do some internal planning. We receive extract files from an an accounting service each month that provide the total hours billed per each of our consultants, along with each consultants hourly rate. The files are saved to a folder on our network: The files also breakout […]

IBM SPSS Statistics Syntax Best Practice

I recently audited the IBM course IBM SPSS Statistics Syntax I – ILO 0L406. In that course, you are introduced to the scripting language that IBM SPSS Statistics offers. It’s well worth your time. SPSS Syntax is a scripting language composed of a library of functions that can be used to modify, manage and analyze […]

A Data Mining Workbench

Data mining provides organizations with a clearer view of current conditions as well deeper insight into the future. In many previous posts, I’ve talked about IBM SPSS Statistics and Modeler. Here is some basic information: IBM® SPSS® Modeler Professional is the data mining workbench for the analysis of structured numerical data to model outcomes and […]

SPSS – Making good on an investment

Research (for example, “The Real ROI from SPSS”, NUCLEUS Research 2005), shows that IBM SPSS is a safe investment and indicate that 94 percent of SPSS customers achieve a positive return within less than a year! These returns are found by way of reduced costs, increased productivity, increased employee and customer satisfaction and greater visibility. […]

Dummy Coding with IBM SPSS

Dummy Coding with IBM SPSS To understand what is meant by dummy coding, you need to understand 2 forms of data: Qualitative or Quantitative? “Qualitative data describes items in terms of some quality or categorization while Quantitative data are described in terms of quantity (and in which a range of numerical values are used without […]

IBM SPSS Split File

You can use the IBM SPSS Split File feature to split your data pond into separate groups for further analysis –based on the values of one or more grouping variables. If you select multiple grouping variables, cases are grouped by each variable within categories of the preceding variable on the Groups Based On list. For […]

Simple Inferential Statistics

 Simple Inferential Statistics “Inferential statistics” is a term used to describe the use of information regarding a sample of subjects to make: (1) Assumptions about the population at large and/or (2) Predictions about what might happen in the future   What’s your Batting Average? You can calculate the mean (or average) batting average of a […]

Computing New Variables in IBM SPSS

Computing New Variables IBM SPSS Statistics offers more than 70 built-in functions to compute new variables based on your corporate logic. These include: • Arithmetic functions • Statistical functions • Distribution functions • Logical functions • Date and time aggregation and extraction functions • Missing-value functions • Cross-case functions • String functions   These functions […]

Ranking your Cases: IBM SPSS Statistics

Ranking A ranking is a relationship between a set of items such that, for any two items, the first is either “ranked higher than”, “ranked lower than” or “ranked equal to” the second. – Wikipedia   Ranking in SPSS Statistics IBM SPSS Statistics ranks cases in your data pond by automatically defining new variables to […]