264
Appendix A
and sample weights. The additional variable Current value was collected and added to the
data le after the sample was taken.
recidivism.sav. Thisis a hypothetical data le that concerns a government law enforcement
agency’sefforts to understand recidivism rates in their area of jurisdiction. Each case
corresponds to a previous offender and records their demographic information, some details
of their rst crime, and then the time until their second arrest, if it occurred within two years
of the rst arrest.
recidivism_cs_sample.sav. Thisis a hypothetical data le that concerns a government law
enforcementagency’s efforts to understand recidivism rates in their area of jurisdiction. Each
case corresponds to a previous offender, released from their rst arrest during the month of
June, 2003, and records their demographic information, some details of their rst crime, and
thedata of their second arrest, if it occurred by the end of June, 2006. Offenderswere selected
from sampled departmentsaccording to the sampling plan specied in recidivism_cs.csplan;
because it makes use of aprobability-proportional-to-size (PPS) method, there is also a le
containing the jointselection probabilities (recidivism_cs_jointprob.sav).
rfm_transactions.sav. Ahypothetical data le containing purchase transaction data, including
date of purchase, item(s) purchased, and monetary amount of eachtransaction.
salesperformance.sav. This is a hypothetical data ���le that concerns the evaluation of two
new sales training courses. Sixty employees, divided into three groups, all receive standard
training. In addition, group 2 gets technical training; group 3, a hands-on tutorial. Each
employee was testedat the end of the training course and their score recorded. Eachcase in
the data le represents a separatet raineeand records the group to which they were assigned
and the score they received on the exam.
satisf.sav. Thisis a hypothetical data le that concerns a satisfaction survey conducted by
a retail company at 4 store locations. 582 customers were surveyed in all, and each case
represents the responses from a single customer.
screws.sav. Thisdata le contains information on the characteristics of screws, bolts, nuts,
and tacks (Hartigan, 1975).
shampoo_ph.sav. Thisis a hy pothetical data le that concerns the quality control at a factory
for hair products. At regular time intervals, six separate output batches are measured and their
pH recorded. The target range is 4.5–5.5.
ships.sav. Adataset presented and analyzed elsewhere (McCullagh et al., 1989) that concerns
damage to cargoships caused by waves. Theincident counts can be modeled as occurri ng at
a Poisson rate given the shiptype, construction period, and service period. Theaggregate
months of service for each cell of the table formed by the cross-classicationof factors
provides values for the exposure to risk.
site.sav. This isa hypothetical data le that concerns a company’s efforts to choose new
sites for their expandingbusiness. They have hired two consultantsto separately evaluate
the sites, who, in addition to an extended report, summarized each site as a “good,” “fair,”
or “poor” prospect.
smokers.sav. This data le is abstracted from the 1998 National Household
Survey of Drug Abuse and is a probability sample of American households.
(http://dx.doi.org/10.3886/ICPSR02934)Thus,therststepinananalysisofthisdatale
should be to weightthe data to reect population trends.