This dataset, Stroke_df, contains fictional case-control data for ischemic stroke, including exposures, risk factors, and confounders. The dataset includes 16,623 observations and 21 variables, covering demographic details, lifestyle factors, biomarkers, and comorbidities. Some observations contain missing values.
Usage
data(Stroke_df)Format
A data frame with 16,623 observations and 21 variables:
- regionnn7
Geographic region (factor)
- case
Case indicator for ischemic stroke (numeric)
- esex
Sex of the participant (integer)
- eage
Age of the participant (integer)
- htnadmbp
Hypertension or blood pressure measure (numeric)
- nevfcur
Smoking status (factor)
- global_stress2
Perceived stress indicator (factor)
- whrs2tert
Waist-to-hip ratio tertiles (factor)
- phys
Physical activity indicator (factor)
- alcohfreqwk
Weekly alcohol consumption frequency (factor)
- dmhba1c2
Diabetes / HbA1c category (factor)
- cardiacrfcat
Cardiac risk factor category (factor)
- ahei3tert
Alternative Healthy Eating Index tertiles (factor)
- apob_apoatert
ApoB/ApoA ratio tertiles (factor)
- subeduc
Sub-education level (factor)
- moteduc
Mother’s education level (factor)
- fatduc
Father’s education level (factor)
- subhtn
Sub-hypertension indicator (factor)
- whr
Waist-to-hip ratio (numeric)
- apob_apoa
ApoB/ApoA continuous ratio (numeric)
- weights
Sample weights (numeric)
Details
The dataset name has been kept as 'Stroke_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the ForCausality package and assists users in identifying its specific characteristics. The suffix 'df' indicates that the dataset is a data frame. The original content has not been modified in any way.
