265
Sample Files
stroke_clean.sav. Thishypothetical data le contains the state of a medical database after it
has been cleaned using proceduresin the Data Preparation option.
stroke_invalid.sav. This hypothetical data le contains the initial state of a medicaldatabase
and contains several data entry errors.
stroke_survival. This hypothetical data le concerns survival times for patients exiting a
rehabilitation program post-ischemic stroke face a number of challenges. Post-stroke, the
occurrence of myocardial infarction, ischemic stroke, or hemorrhagic stroke is noted and the
time of the eventrecorded. The sample isleft-truncated because it only includes patients who
survived through the end of the rehabilitation program administered post-stroke.
stroke_valid.sav. Thishypothetical data le contains the state of a medical database after the
values have been checked using the ValidateData procedure. It still containspotentially
anomalous cases.
survey_sample.sav. This data le contains survey data, including demographic data and
various attitude measures. It is based on a subset of variables from the 1998 NORC General
Social Survey,although some data values have been modied and additional ctitious
variables have been added for demonstration purposes.
telco.sav. This isa hypotheticaldata le that concerns a telecommunications company’s
effortsto reduce churn in their customer base. Each case corresponds to a separate customer
and records various demographic and serviceusage information.
telco_extra.sav. This data leis similar to the telco.sav data le, but the “tenure”and
log-transformed customer spending variables have been removed and replaced by
standardized log-transformedcustomer spending variables.
telco_missing.sav. This data le is a subset of the telco.sav data le, but some of the
demographic data values have been replaced with missing values.
testmarket.sav. Thishypothetical data le concerns a fast food chain’s plans to add a new item
to its menu. There are three possible campaigns for promoting the new product, so the new
item is introduced at locations in several randomly selected markets. A different promotion
is used at each location, and the weekly sales of the new itemare recorded for the rst four
weeks. Each case corresponds to a separate location-week.
testmarket_1month.sav. This hypotheticaldata le is the testmarket.sav data le with the
weekly sales “rolled-up”so that each case corresponds to a separate location. Some of the
variables that changedweekly disappear as a result, and the sales recorded is now the sum of
the sales during the four weeksof the study.
tree_car.sav. Thisis a hypothetical data le containing demographic and vehicle purchase
price data.
tree_credit.sav. Thisis a hypothetical data le containing demographic and bank loan history
data.
tree_missing_data.sav This is a hypothetical data le containing demographic and bank loan
history data with a largenumber of missing values.
tree_score_car.sav. Thisis a hypothetical data le containing demographic and vehicle
purchase price data.
tree_textdata.sav. Asimpledatalewith only two variables intended primarily to show the
default state of variables prior to assignment of measurement level and value labels.