[1]徐淑坦,冷银辉,陈明.基于改进TF-IDF算法的基因通路富集方法[J].中国医学物理学杂志,2022,39(9):1173-1181.[doi:DOI:10.3969/j.issn.1005-202X.2022.09.019]
 XU Shutan,LENG Yinhui,et al.A gene pathway enrichment method based on improved TF-IDF algorithm[J].Chinese Journal of Medical Physics,2022,39(9):1173-1181.[doi:DOI:10.3969/j.issn.1005-202X.2022.09.019]
点击复制

基于改进TF-IDF算法的基因通路富集方法()
分享到:

《中国医学物理学杂志》[ISSN:1005-202X/CN:44-1351/R]

卷:
39卷
期数:
2022年第9期
页码:
1173-1181
栏目:
其他(激光医学等)
出版日期:
2022-11-02

文章信息/Info

Title:
A gene pathway enrichment method based on improved TF-IDF algorithm
文章编号:
1005-202X(2022)09-1173-09
作者:
徐淑坦12冷银辉12陈明12
1.上海海洋大学信息学院, 上海 201306; 2.农业农村部渔业信息重点实验室, 上海 201306
Author(s):
XU Shutan1 2 LENG Yinhui1 2 CHEN Ming1 2
1. College of Information Technology, Shanghai Ocean University, Shanghai 201306, China 2.Key Laboratory of Fisheries Information, Ministry of Agriculture and Rural Affairs of the Peoples Republic of China, Shanghai 201306, China
关键词:
通路富集基因影响力基因集富集分析
Keywords:
pathway enrichment genet impact gene set enrichment analysis
分类号:
R318
DOI:
DOI:10.3969/j.issn.1005-202X.2022.09.019
文献标志码:
A
摘要:
提出一种综合考虑通路局部和全局信息的基因通路富集分析(GIGSEA)方法。首先利用基因相互作用数据,通过基因在通路的局部重要性和在通路数据库的全局特异性计算基因的影响力;然后将基因影响力和表型相关性值融合在一起,计算通路的富集分数;最后通过置换基因富集出统计学显著的通路。将GIGSEA方法运用于肝细胞癌和结肠直肠癌数据集进行风险通路富集,与基因集富集分析方法相比,GIGSEA方法能富集出一些新的相关通路,并排除无关的通路,提高疾病相关通路的富集效果。
Abstract:
A gene pathway enrichment method (GIGSEA method) that comprehensively considers the local and global information of the pathways is proposed. The gene interaction data are used to calculate the gene impact based on the local importance of the gene in the pathway and its global specificity in the pathway database. Then the obtained gene impact is fused with the phenotypic correlation value to calculate enrichment score, and statistically significant pathways are identified by permutating gene. GIGSEA is applied to the data sets of hepatocellular carcinoma and colorectal cancer for the enrichment of risk pathways. Compared with the gene set enrichment analysis method, GIGSEA method can enrich some new related pathways and exclude irrelevant pathways, which improves the enrichment effect of disease-associated pathways.

备注/Memo

备注/Memo:
【收稿日期】2022-04-05 【基金项目】广东省重点领域研发计划(2021B0202070001) 【作者简介】徐淑坦,博士后,副教授,研究方向:生物信息,E-mail: stxu@shou.edu.cn 【通信作者】陈明,博士,教授,研究方向:生物信息,E-mail: mchen@shou.edu.cn
更新日期/Last Update: 2022-09-27