R/data.R
nh0506_3groups.Rd
NHANES 2005-2006 data on smoking and homocysteine levels in adults, comparing daily smokers to never smokers and occasional smokers.
nh0506_3groups
A data frame with 4457 rows and 11 variables:
NHANES identification number.
smoking status treatment factor: 0 = never smoker, 1 = some smoking, 2 = daily smoker.
factor with levels "Male" and "Female".
age in years, 20-85, with 85 recorded for everyone >= 85 years.
factor with levels "Mexican American", "Other Hispanic", "Non-Hispanic White", "Non-Hispanic Black", and "Other Race - Including Multi-Racial".
factor with levels "< Grade 9", "9-11th grade", "High school grad/GED", "Some college or AA degree", "College graduate or above".
ratio of family income to the poverty level, capped at 5 times poverty, has missing entries.
BMI (body mass index), has missing entries.
cigarettes smoked per day, 0 for never smokers.
blood cotinine level, a biomarker of recent exposure to tobacco.
homocysteine level.
The code used to generate this data is documented in the source version of this package under `data-raw/`. This data is composed of adults aged at least 20 years. Individuals who have smoked at least 100 cigarettes but do not now smoke at least 10 cigarettes daily are excluded. Individuals with missing homocysteine values, cotinine values, or smoking information are excluded. After filtering for all these criteria, five individuals with unknown education remain and are also excluded. Missing values remain in the poverty ratio and bmi covariates.
data('nh0506_3groups')