34
Chapter 4
have been resolved adequately. Similarly, the evaluation phase can lead you to reevaluate your
original business understanding, and you may decide that you have been trying to answer the
wrong question. At this point, you can revise your business understanding a ndp roceed through
the rest of the process again with a better target in mind.
The second key point is the iterative nature of data mining. Youwill rarely, if ever, simply
plan a data mining project, complete it, and then pack up your data and go home. Datamin ing to
address your customers’ demands is an ongoing endeavor. The knowledge gainedfrom one cycle
of data mining will almost invariably lead to new questions, new issues, and new opportunities
to identify and meet your customers’ needs. Those new questions, issues, and opp ortunities can
usually be addressed by mining your data once again. This process of mining and identifyin g new
opportunities should become part of the way you think about your business and a cor nerstone of
your overall business strategy.
This introduction provides only a brief overview of the CRISP-DM process model. For
complete details on the model, consult the following resources:
The CRISP-DM Guide, which can be accessed along with other documentation from the
\Documentation folder on the installation disk.
The CRISP-DM Help system, available from the Start menu or by clicking CRISP-DM Help on
the Help menu in IBM® SPSS® Modeler.
Types of Models
IBM® SPSS® Modeler offers a variety of modeling methods taken from machine learning,
articial intelligence, and statistics. The m ethods available on the Modeling palette allow you
to derive new information from your data and to develop predictive models. Each method has
certain strengths and is best suited for particular types of problems.
The SPSS Modeler Applications Guide provides examples for many of these methods, a long
with a general introduction to the modeling process. Th is guide is available as an online tutorial,
and also in PDF format. For more information, see the topic Application Examples in Chapter1
on p. 5.
Modeling methods are divided into three categories:
Classication
Association
Segmentation
Classification Models
Classication models use the values of one or more input elds to pre dict the value of one or
more output, or target,elds. Some examples of these techniqu es are: decision trees (C&R Tree,
QUEST,CHAID andC5.0 algorith ms), regression (linear, logistic, generalized linear, and Cox
regression algorithms), neural networks, support vector machines, and Bayesian networks.
Classication models helps organizations to predict a known result, such as whether a customer
will buy or leave or whether a transaction ts a known pattern of fraud. Modeling techniques
include machine learning, rule induction, subgroup identication, statistical methods, and m ultiple
model generation.