Skip to main content

Posts Tagged ‘TM1’

IBM SPSS and Frequencies Command

The Frequencies command is one of the simplest yet one of the most useful descriptive techniques.  Its objective is to simply sum the number of instances within a particular category. For example, the following questions could be easily answered: How many males and females make up my data pond? What are the number of ethnic […]

SPSS Codebook

A  codebook is a type of document used for gathering and storing codes. Originally codebooks were often literally books, but today codebook is a byword for the complete record of a series of codes, regardless of physical format. – Wikipedia The codebook command was introduced in IBM SPSS Statistics version 17. It provides information about the […]

Multiple Data Sources and Predictive Modeling

Knowledge Bottleneck? When building a predictive model, the larger the number of examples or “cases” considered, the better the model. Typically, these cases exist in multiple data files (or data sources) that must be “stitched together”. The task of accessing each data source, performing some analysis on the cases contained in the data and formatting […]

A Simple Analytical Architectural Strategy

Over the last month I’ve been taking a tactical view of analytics by focusing on some of the specific features of IBM SPSS Statistics so today, I have decided to think a bit more “strategically”. If your organization wants to begin leveraging survey-response type data for example, what might be a reasonable approach? If I […]

SPSS Virtual Files

The power of SPSS allows the data scientist or predictive modeler to consume large data volumes. This data may come in smaller manageable subsets or possible huge “data ponds”.  Depending upon the procedures you will be performing in your analysis, SPSS may reread the entire data set for each procedure.  Of course, procedures that change […]

Automated Data Preparation (ADP) IBM SPSS Statistics Base

Automated Data Preparation (ADP)   The seasoned data scientist knows that probably the single most import step in creating a predictive model is pinpointing the appropriate “data pond” and ensuring that it is properly “prepared”. I’ve written about the many “out of the box” tools that SPSS users can use to manage data, such as […]

IBM SPSS Time

IBM SPSS defines each variable with a “TYPE”. By default, all variables in SPSS are assumed to be numeric until you change them. SPSS V20 currently supports the following variable types: Numeric and String (the most common), Comma, Dot, Scientific Notation, Date, Dollar, Custom Currency and Restricted Numeric What day is it? Today I want […]

Predictive Model Engineering

Organizations interested in using analytics to predict outcomes will score data pools by applying an appropriate predictive model. Pre-built predictive models are becoming increasingly available in the market place. Data scientists that are knowledge experts in particular areas are developing models that have increasingly better success rates. However the best approach may be for an […]

Lift Analysis and IBM SPSS

Defining what lift is “Lift” is the measure used to determine how well your targeting model does at prophesying cases as having a greater response with respect to the population as a whole. Your model may be doing its job if the response (within the target) is better than the average response of the population […]

BM SPSS Statistics – Data Management Toolset

IBM SPSS Statistics – Data Management Toolset (DMS) In a recent blog post I listed some of the more helpful “data management tools” offered within IBM SPSS Statistics version 20 (Case Summaries, Replace Missing Values, Transform and Compute, Recode, Select Cases, Sort Cases and Merge Files) and would like to review them today. These tools […]

IBM SPSS Statistics – Continued Exploration

Getting Started…Again Back to Statistics; I restart IBM SPSS and from the startup/open dialog, locate my previously defined data file from the “Open an existing data source” list and click OK. My file opens in the data editor (just as I left it) and the Statistics Viewer shows the very first transaction “GET” (and then […]

SPSS Collaboration and Deployment Services

Last time I mentioned IBM SPSS collaboration and deployment services and promised to talk more about it – so here we go: Analytical Assets Organizations positioning themselves to take full advantage of analytics will look to separate the effort of developing analytical assets and actually using them – between “creators” and “consumers”.  Generally speaking, an […]

Load More