Skip to contents

This dataset, SmokeH_df, is a data frame containing information on smoking, homocysteine levels, demographics, and socioeconomic indicators. The dataset consists of 2,475 observations and 15 variables, including biomarkers, smoking-related measures, age, education, and poverty ratio. Some observations contain missing values.

Usage

data(SmokeH_df)

Format

A data frame with 2,475 observations and 15 variables:

SEQN

Participant identifier (integer)

homocysteine

Homocysteine level (numeric)

z

Z score indicator (integer)

female

Sex indicator (integer, 1 = female, 0 = male)

age

Age in years (integer)

education

Education level (integer code)

povertyr

Poverty ratio (numeric)

bmi

Body mass index (numeric)

cotinine

Cotinine level (numeric)

st

Smoking type indicator (integer)

stf

Smoking type (character string)

age3

Age category (integer code)

ed3

Education category (integer code)

bmi3

BMI category (integer code)

pov2

Poverty category (logical)

Source

Data taken from the evident package version 1.0.4

Details

The dataset name has been kept as 'SmokeH_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the ForCausality package and assists users in identifying its specific characteristics. The suffix 'df' indicates that the dataset is a data frame. The original content has not been modified in any way.