[1]赵小强,乔慧.基于不完整数据的IHB-LightGBM心脏病预测模型[J].中国医学物理学杂志,2023,40(4):512-520.[doi:DOI:10.3969/j.issn.1005-202X.2023.04.018]
 ZHAO Xiaoqiang,QIAO Hui.Heart disease prediction using IHB-LightGBM model based on incomplete?ata[J].Chinese Journal of Medical Physics,2023,40(4):512-520.[doi:DOI:10.3969/j.issn.1005-202X.2023.04.018]
点击复制

基于不完整数据的IHB-LightGBM心脏病预测模型()
分享到:

《中国医学物理学杂志》[ISSN:1005-202X/CN:44-1351/R]

卷:
40卷
期数:
2023年第4期
页码:
512-520
栏目:
其他(激光医学等)
出版日期:
2023-04-25

文章信息/Info

Title:
Heart disease prediction using IHB-LightGBM model based on incomplete?ata
文章编号:
1005-202X(2023)04-0512-09
作者:
赵小强123乔慧1
1.兰州理工大学电气工程与信息工程学院, 甘肃 兰州 730050; 2.兰州理工大学甘肃省工业过程先进控制重点实验室, 甘肃 兰州 730050; 3.兰州理工大学国家级电气与控制工程实验教学中心, 甘肃 兰州 730050
Author(s):
ZHAO Xiaoqiang123 QIAO Hui1
1. School of Electrical Engineering and Information Engineering, Lanzhou University of Technology, Lanzhou 730050, China 2. Key Laboratory of Gansu Advanced Control for Industrial Processes, Lanzhou University of Technology, Lanzhou 730050, China 3. National Experimental Teaching Center of Electrical and Control Engineering, Lanzhou University of Technology, Lanzhou 730050, China
关键词:
数据挖掘心脏病预测超参数优化算法LightGBM算法
Keywords:
Keywords: data mining heart disease prediction hyperparameter optimization algorithm LightGBM algorithm
分类号:
R318;R541
DOI:
DOI:10.3969/j.issn.1005-202X.2023.04.018
文献标志码:
A
摘要:
提出基于不完整数据的IHB-LightGBM(Improved Hyperband-Light Gradient Boosting Machine)心脏病预测模型。首先,在Hyperband算法超参数采样的基础上引入了权重值,并通过蓄水池法按特征权重对其进行排序,从而筛选出最优参数以提高算法的参数寻优能力;其次,针对心脏病数据样本小且属性缺失的问题,使用K近邻算法对不完整数据进行缺失值插补,再将处理得到的完整数据进行归一化,使数据映射至0~1范围内;最后,对LightGBM采用改进后的IHB优化算法进行全局参数寻优,建立IHB-LightGBM心脏病预测模型。使用UCI心脏病数据集进行实验,结果表明IHB算法的参数寻优效果优于贝叶斯、随机搜索等优化算法,IHB-LightGBM模型在各项评价指标也上明显高于随机森林、极端随机树等算法,可以获得更快的预测速度和更高的预测精度。
Abstract:
Abstract: An improved Hyperband-light gradient boosting machine (IHB-LightGBM) model for heart disease prediction based on incomplete data is presented in the study. Based on the sampling of Hyperband algorithm for hyperparameter estimation, the weight values are introduced, and the reservoir method is used to sort the parameters according to the feature weights, so as to screen out the optimal parameters to improve the parameter optimization ability of the algorithm. Subsequently, to overcome the problems of small sample size and missing attributes of heart disease data, K-nearest neighbor algorithm is used to perform the interpolation of missing values for incomplete data, and the obtained complete data are normalized and mapped to the range from 0 to 1. The IHB optimization algorithm is adopted for global parameter optimization, and the IHB-LightGBM model for heart disease prediction is established. Experiments are carried out using the UCI heart disease data sets, and the results show that IHB algorithm is superior to Bayesian, random search and other optimization algorithms in parameter optimization, and that the various evaluation indicators of IHB-LightGBM model are significantly higher than those of random forest, extreme random tree and other algorithms. The proposed model improves both efficiency and accuracy of prediction.

相似文献/References:

[1]马晶,蔡文杰,杨利.基于机器学习的心音识别分类研究[J].中国医学物理学杂志,2021,38(1):75.[doi:DOI:10.3969/j.issn.1005-202X.2021.01.013]
 MA Jing,CAI Wenjie,YANG Li.Research on heart sounds classification based on machine learning[J].Chinese Journal of Medical Physics,2021,38(4):75.[doi:DOI:10.3969/j.issn.1005-202X.2021.01.013]

备注/Memo

备注/Memo:
【收稿日期】2022-12-15 【基金项目】国家重点研发计划(2020YFB1713600);国家自然科学基金(61763029);甘肃省教育厅产业支撑计划项目(2021CYZC-02) 【作者简介】赵小强,博士,教授,博士生导师,主要研究方向为数据挖掘、图像处理、故障诊断,E-mail: xqzhao@lut.edu.cn
更新日期/Last Update: 2023-04-25