Smoking and Homocysteine Data — SmokeH

This dataset, SmokeH_df, is a data frame containing information on smoking, homocysteine levels, demographics, and socioeconomic indicators. The dataset consists of 2,475 observations and 15 variables, including biomarkers, smoking-related measures, age, education, and poverty ratio. Some observations contain missing values.

Usage

data(SmokeH_df)

Format

A data frame with 2,475 observations and 15 variables:

SEQN: Participant identifier (integer)
homocysteine: Homocysteine level (numeric)
z: Z score indicator (integer)
female: Sex indicator (integer, 1 = female, 0 = male)
age: Age in years (integer)
education: Education level (integer code)
povertyr: Poverty ratio (numeric)
bmi: Body mass index (numeric)
cotinine: Cotinine level (numeric)
st: Smoking type indicator (integer)
stf: Smoking type (character string)
age3: Age category (integer code)
ed3: Education category (integer code)
bmi3: BMI category (integer code)
pov2: Poverty category (logical)

Source

Data taken from the evident package version 1.0.4

Details

The dataset name has been kept as 'SmokeH_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the ForCausality package and assists users in identifying its specific characteristics. The suffix 'df' indicates that the dataset is a data frame. The original content has not been modified in any way.