Posts Tagged ‘PMML’

Software Remediation Excellence

Overview The purpose of this post is to provide a strategy for approaching and resolving a “concern” that has been identified within a application. Overall, the idea is to “add structure” to the process of responding to defects (remediation) in your application. Steps to Remediation Using a resolution methodology referred to as CERT, the steps […]

Chi-Squared Challenging using SPSS

  Chi-Squared Challenging using SPSS A Chi-Square Challenge (or Test) procedure organizes your data pond variables into groups and computes a chi-square statistic. Here is the specific definition: “The chi-square (chi, the Greek letter pronounced “kye”) statistic is a statistical technique used to determine if a “distribution of observed frequencies” differs from the “theoretical expected […]

IBM SPSS Split File

You can use the IBM SPSS Split File feature to split your data pond into separate groups for further analysis –based on the values of one or more grouping variables. If you select multiple grouping variables, cases are grouped by each variable within categories of the preceding variable on the Groups Based On list. For […]

Metadata Attributes

IBM SPSS Statistics offers many ways to help save time when analyzing data, particularly if you are continually performing the same types of analysis on similar sets of pools of data. TIME SAVERS “Metadata Attributes” – Data attributes have properties associated with them, and these properties are defined in metadata.   During data analysis, documentation […]

Automated Data Preparation (ADP) IBM SPSS Statistics Base

Automated Data Preparation (ADP)   The seasoned data scientist knows that probably the single most import step in creating a predictive model is pinpointing the appropriate “data pond” and ensuring that it is properly “prepared”. I’ve written about the many “out of the box” tools that SPSS users can use to manage data, such as […]

IBM SPSS Add-On Modules

  Serious Analytical Architect? Any serious analytic architect will need to at least be aware of the individual SPSS products offered and have at least a basic understanding of what each of them can do. Here is the list: IBM Showcase Report Writer Quickly create professional-looking, presentation-quality reports using intuitive, word processor-like page layout and […]

Predictive Model Engineering

Organizations interested in using analytics to predict outcomes will score data pools by applying an appropriate predictive model. Pre-built predictive models are becoming increasingly available in the market place. Data scientists that are knowledge experts in particular areas are developing models that have increasingly better success rates. However the best approach may be for an […]

Lift Analysis and IBM SPSS

Defining what lift is “Lift” is the measure used to determine how well your targeting model does at prophesying cases as having a greater response with respect to the population as a whole. Your model may be doing its job if the response (within the target) is better than the average response of the population […]

IBM SPSS Statistics – Continued Exploration

Getting Started…Again Back to Statistics; I restart IBM SPSS and from the startup/open dialog, locate my previously defined data file from the “Open an existing data source” list and click OK. My file opens in the data editor (just as I left it) and the Statistics Viewer shows the very first transaction “GET” (and then […]

SPSS Collaboration and Deployment Services

Last time I mentioned IBM SPSS collaboration and deployment services and promised to talk more about it – so here we go: Analytical Assets Organizations positioning themselves to take full advantage of analytics will look to separate the effort of developing analytical assets and actually using them – between “creators” and “consumers”.  Generally speaking, an […]

Basic Data Analysis and IBM SPSS

    The basic steps in data analysis might be simplified into (1) Identifying data, (2) Selecting an analysis and summarization method and (3) Presenting the results. Over the next couple of weeks I will look at using IBM SPSS version 20 to accomplish these tasks. Today, I want to focus on loading a data […]

Interoperability and PMML

If you work within the rapidly expanding analytics space, you will need to think about defining and sharing statistical models between applications. PMML (or Predictive Model Markup Language) is an XML-based language developed by the Data Mining Group (DMG) for this purpose. I’d like to pass on some of the essentials: The Basics PMML provides […]