Purpose: Stroke is the leading cause of death among Chinese population. The aim of this study was to explore the influencing factors for stroke, so as to provide theoretical basis for the study of the etiology of and the prevention and control of stroke.
Methods: We conducted a population-based case-control study using frequency matching, with 1141 stroke patients and 1141 controls selected from the Luzhou population health information platform. Unconditional logistic regression model, CHAID classification tree model and Support Vector Machine (SVM) were used to explore the influencing factors for stroke. We evaluated the accuracy of the three statistical techniques by using the area under the receiver operating characteristic curve (AUC).
Result: The results of multivariate unconditional logistic regression analysis showed that the influencing factors of stroke were age, exercise, hypercholesterolemia, low levels of high density lipoprotein cholesterol (HDL-C), hypertension, diabetes, coronary heart disease (CHD). Classification tree models screened six influencing factors for stroke: hypertension, high salt diet, insufficient exercise, low HDL-C, diabetes and age, and hypertension was the major risk factor for stroke. High risk groups are mainly distributed in people with hypertension, diabetes, age over 50 and lack of exercise. The results of the support vector machine model presented that there were nine factors affecting stroke, the influence levels from high to low were hypertension (42%), insufficient exercise (18%), age group (12%), alcohol consumption (4%), Hypoglycemia (4%), gender (4%), hypercholesterolemia (3%), high salt diet (3%) and diabetes mellitus (2%), respectively. The AUC of logistic regression, classification tree model and SVM model were 0.769, 0.753 and 0.880, respectively.
Conclusion: All three models were reliable and got similar but different results. Therefore, the results of the three models can complement each other. Hypertension, diabetes, CHD, smoking, insufficient exercise, hypercholesterolemia, low HDL-C, and high salt diet, fifty years of age or more are the important risk factors that affect the prevalence of stroke.