263
Sample Files
nhis2000_subset.sav. TheNational Health Interview Survey (NHIS) is a large, population-based
survey of the U.S. civilian population. Interviews are carried out face-to-face in a nationally
representative sample of households. Demographic information and observations about
health behaviors and status are obtained for members of each household. This data
file contains a subset of information from the 2000survey. National Center for Health
Statistics. National HealthInterview Survey,2000. Public-use data file and documentation.
ftp://ftp.cdc.gov/pub/Health_Statistics/NCHS/Datasets/NHIS/2000/.Accessed 2003.
ozone.sav. Thedata include 330 observations on six meteorological variables for predicting
ozone concentration from the remaining variables. Previous researchers (Breiman and
Friedman, 1985), (Hastie and Tibshirani,1990), among others found nonlinearities among
these variables, which hinder standard regression approaches.
pain_medication.sav. This hypothetical data file contains the results of a clinicaltrial for
anti-inflammatory medicationfor treating chronic arthritic pain. Of particular interestis the
time it takes for the drug to take effectand how it compares to an existing medication.
patient_los.sav. Thishypothetical data file contains the treatment records of patients who were
admittedto the hospital for suspected myocardial infarction (MI, or “heart attack”). Eachcase
corresponds to a separate patient and records many variables related to their hospital stay.
patlos_sample.sav. This hypothetical data file contains the treatment records of a sample
of patients who received thrombolyticsduring treatment for myocardial infarction (MI, or
“heart attack”). Each case corresponds to a separate patient and records many variables
related to their hospital stay.
poll_cs.sav. Thisis a hypothetical data file that concerns pollsters’ efforts to determine the
level of public support for a bill before the legislature. The cases correspond to registered
voters. Each case records the county, township, and neighborhood in which the voter lives.
poll_cs_sample.sav. This hypothetical data file contains a sample of the voters listed in
poll_cs.sav. The sample was taken according to the design specified in the poll.csplan plan
file, and this data file records the inclusion probabilities and sample weights. Note, however,
that becausethe sampling plan makes use of a probability-proportional-to-size (PPS) method,
there is also a file containing the joint selectionprobabilities (poll_jointprob.sav). The
additional variables corresponding to voter demographics and their opinion on the proposed
bill were collected and added the datafile afte r the sample as taken.
property_assess.sav. Thisis a hypothetical data file that concerns a county assessor’s efforts to
keep property value assessments upto date on limited resources. The cases correspond to
properties sold in the county in the past year. Each case in the data file records the township
in which the property lies, the assessor who last visited the property,the time since that
assessment, the valuation made at that time, and the sale value of the property.
property_assess_cs.sav. Thisis a h ypothetical data file that concernsa state assessor ’sefforts
to keep property value assessments up to date on limitedresources. The cases correspond
to properties in the state. Each case in the data file records the county, township, and
neighborhood in which the property lies, the time since the last assessment, and the valuation
made at that time.
property_assess_cs_sample.sav. Thishypothetical data file contains a sample of the proper ties
listed in property_assess_cs.sav. The sample was taken according to the design specified in
the property_assess.csplanplan file, and this data file records the inclusion probabilities