Skip to contents

This dataset, Stroke_df, contains fictional case-control data for ischemic stroke, including exposures, risk factors, and confounders. The dataset includes 16,623 observations and 21 variables, covering demographic details, lifestyle factors, biomarkers, and comorbidities. Some observations contain missing values.

Usage

data(Stroke_df)

Format

A data frame with 16,623 observations and 21 variables:

regionnn7

Geographic region (factor)

case

Case indicator for ischemic stroke (numeric)

esex

Sex of the participant (integer)

eage

Age of the participant (integer)

htnadmbp

Hypertension or blood pressure measure (numeric)

nevfcur

Smoking status (factor)

global_stress2

Perceived stress indicator (factor)

whrs2tert

Waist-to-hip ratio tertiles (factor)

phys

Physical activity indicator (factor)

alcohfreqwk

Weekly alcohol consumption frequency (factor)

dmhba1c2

Diabetes / HbA1c category (factor)

cardiacrfcat

Cardiac risk factor category (factor)

ahei3tert

Alternative Healthy Eating Index tertiles (factor)

apob_apoatert

ApoB/ApoA ratio tertiles (factor)

subeduc

Sub-education level (factor)

moteduc

Mother’s education level (factor)

fatduc

Father’s education level (factor)

subhtn

Sub-hypertension indicator (factor)

whr

Waist-to-hip ratio (numeric)

apob_apoa

ApoB/ApoA continuous ratio (numeric)

weights

Sample weights (numeric)

Source

Data taken from the causalPAF package version 1.2.5

Details

The dataset name has been kept as 'Stroke_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the ForCausality package and assists users in identifying its specific characteristics. The suffix 'df' indicates that the dataset is a data frame. The original content has not been modified in any way.