Which Modeling Approach Should You Use with SPSS Modeler?

Jim Miller, Senior Solutions Architect at Perficient, recently wrote a blog post explaining the many options when using IBM SPSS Modeler. Coming from a TM1 background (more business than statistics), it is easy to get started with modeling once you determine your modeling objective, and Modeler can help with that. IBM SPSS Modeler offers an intuitive […]

IBM SPSS Modeler and Duplicate Data

Transactional datasets (especially those originating from databases) may contain duplicate records that must be removed before any modeling can begin.  There are simply two situations where duplicate records occur are: Datasets ARE erroneous (causing the same record to multiple times) Datasets ARE NOT erroneous (but records appear multiple times because information is collected different moments […]

Missing Data – “Nothing from nothing” – leaves something?

In TM1, missing data usually means that there is a defect in the logic of your ETL script or you need to check your SQL. In SPSS Modeler, missing values arise for a variety of reasons and they must be considered carefully. You might expect that missing values imply errors or should those records be […]

Data Indiscretions

Data loaded into a TM1 or SPSS model will, in most cases, include files consisting of thousands (or hundreds of thousands) of records. It is not reasonable, given the number of fields and records in files of this size, for you to visually inspect all fields in every record (of every file) for missing or […]

Primary Practices for Examining Data

SPSS Data Audit Node       Once data is imported into SPSS Modeler, the next step is to explore the data and to become “thoroughly acquainted” with its characteristics. Most (if not all) data will contain problems or errors such as missing information and/or invalid values. Before any real work can be done using […]

Data Consumption – Cognos TM1 vs. SPSS Modeler

In TM1, you may be used to “integer or string”, in SPSS Modeler, data gets much more interesting. In fact, you will need to be familiar with a concept known as “Field Measurement Level” and the practice of “Data Instantiation”. In TM1, data is transformed by aggregation, multiplication or division, concatenation or translation, and so […]

Importing Data into SPSS Modeler for the TM1 Developer

If you have a TM1 background it is a quick step to using SPSS Modeler -if you look for similarities in how the tools handle certain tasks like, for example, importing data. With TM1, source data is transformed and loaded into cube structures for consolidation, modeling and reporting using its ETL tool TurboIntegrator. In SPSS […]

Data Mining with IBM SPSS Modeler v15

Having recently completed the course “IBM SPSS Modeler & Data Mining” offered by Global Knowledge, I was looking to find more opportunities to do some modeling with SPSS Modeler. So, when I read in the news recently, about college recruiters using predictive techniques to determine the probability of a particular recruit graduating on time, I […]

CFO Performance Insight – Déjà vu?

Recently, I attended the IBM Vision conference in Orlando. At the conference, I watched a presentation on what our friends at IBM are calling one of their “signature solutions”: CFO Performance Insight. This reminded me of various blog posts of mine, such as: Reengineering the Forecasting Process with Predictive Models (Nov 2nd 2012) and Forecasting […]

CRISP and IBM Cognos TM1

CRISP stands for Cross Industry Standard Process. It is a process model that describes commonly used approaches that experts use to tackle problems. Typically, you’ll hear of CRISP in the context of CRISP-DM, defining a process or methodology that breaks the process of data mining into six major phases. A little more about the CRISP […]

Transforming Data with Loop and Vector

As promised, today I am sharing the fundamentals of using the LOOP and VECTOR in a SPSS Syntax script. LOOP LOOP is a common programming concept – it simply allows you to apply or execute the same script commands for an identified number of times without having to recode them.   VECTOR A VECTOR is […]

IBM Vision 2013 – 2 thumbs Up!

I just returned from the IBM Vision Conference in Orlando, Florida. I attended a session in every available timeslot from Monday morning to Wednesday afternoon and it was worth every single minute of my time! Although there were too many sessions and presenters to mention, here are my “top picks”: Designing Solutions with IBM Cognos […]