|
Typical Data Mining Process
for Predictive Modeling
|
The following diagram depicts the overall process for a typical data
mining project in which we are doing predictive
modeling using techniques such as linear or logistic regression, nominal or ordinal CHAID,
etc. As the diagram shows, we begin by defining the business problem to be solved. Then we go through a process of
data discovery, which might include examining not only the client's proprietary data, but also available syndicated
demographic and lifestyle data that we may be able to overlay onto the client's data to enhance the
information.
Before we begin the predictive modeling process, we split the data sample in half. One half will be used to
build the predictive model; the other half will be used as a "holdout" sample on which to validate the predictive
model. Finally, we apply the model parameters to the client's master data file using a scoring algorithm. This will
allow the client to achieve more profitable customer prospecting or customer cross-sell/up-sell.

Back to Marketing Analytics page
|