临床荟萃 ›› 2025, Vol. 40 ›› Issue (7): 608-618.doi: 10.3969/j.issn.1004-583X.2025.07.005

• 论著 • 上一篇    下一篇

基于生物信息学方法的多囊卵巢综合征分子机制

郭倩, 陈婕, 马蔚蓉, 张岩, 杨颖倩, 谈勇()   

  1. 南京中医药大学附属医院 妇科,江苏 南京 210029
  • 收稿日期:2025-04-22 出版日期:2025-07-20 发布日期:2025-07-17
  • 通讯作者: 谈勇 E-mail:xijun1025@163.com

Bioinformatics-driven investigation into the molecular pathways of polycystic ovary syndrome

Guo Qian, Chen Jie, Ma Weirong, Zhang Yan, Yang Yingqian, Tan Yong()   

  1. Department of Gynecology, Affiliated Hospital of Nanjing University of Chinese Medicine, Nanjing 210029, China
  • Received:2025-04-22 Online:2025-07-20 Published:2025-07-17
  • Contact: Tan Yong E-mail:xijun1025@163.com

摘要:

目的 利用微阵列数据集确定多囊卵巢综合征(PCOS)新的遗传靶点,并揭示导致PCOS发生发展的分子基础。方法 对来自基因表达综合数据库的3个独立PCOS数据集进行了全面分析,使用R软件进行数据处理和规范化。对差异表达基因 (DEG) 与PCOS之间关系的评估包括差异表达分析、表达数量性状位点分析和孟德尔随机化 (MR)分析。运用基因集变异分析和基因本体/京都基因和基因组百科全书富集分析来探索这些基因的功能作用和途径。借助Lasso回归、SVM机器学习、随机森林树法筛选PCOS交集特征基因,再与MR方法进行交叉验证。此外,采用CIBERSORT分析评估PCOS中22种免疫细胞的浸润水平。最后,构建单基因的ceRNA调控网络进行直观化展示。结果 本研究鉴定出59个显著的差异基因,包括28个上调的DEG和31个下调的DEG。Lasso回归、SVM机器学习、随机森林树法筛选出PCOS相关的6个交集特征基因,具体为DNAJC3、ASPH、TLR4、SEC24D、SGK1、AMFR。这些基因参与重要的生物学过程和信号通路,包括对内质网应激的反应、对化学应激的反应、自噬调节、内源性凋亡、脂质和动脉粥样硬化、神经退行性变等。MR分析方法与前述3种方法进行交叉验证,进一步筛选出了与PCOS存在因果关系的基因SEC24D,以及与SEC24D密切相关的miRNA37个和lncRNA15个。CIBERSORT 分析表明PCOS中存在独特的免疫细胞分布,尤其是中性粒细胞比列显著升高,强调了免疫过程在PCOS中的重要性。结论 本研究为PCOS的分子基础提供了新的见解,并强调了治疗干预的前景以及针对特定分子途径治疗PCOS的潜力,为进一步的研究和临床工作奠定了基础。

关键词: 多囊卵巢综合征, 差异表达基因, 微阵列数据, eQTL分析, 孟德尔随机化, 免疫细胞浸润

Abstract:

Objective To identify novel genetic targets for polycystic ovary syndrome (PCOS) and to reveal the molecular basis underlying the development of PCOS using microarray datasets. Methods A comprehensive analysis was conducted on three independent PCOS datasets from the Gene Expression Omnibus database, and data processing and normalization were performed using R software. The evaluation of the relationship between differentially expressed genes (DEGs) and PCOS included differential expression analysis, expression quantitative trait locus analysis, and Mendelian randomization (MR) analysis. Gene Set Variation Analysis and Gene Ontology/Kyoto Encyclopedia of Genes and Genomes enrichment analysis were performed to explore the functional roles and pathways of these genes. Lasso regression, SVM machine learning, and random forest tree method were used to screen PCOS intersection feature genes, and then a cross validation was conducted with MR method. In addition, CIBERSORT analysis was used to evaluate the infiltration levels of 22 immune cells in PCOS. Finally, a single gene ceRNA regulatory network was constructed for visualization. Results This study identified 59 significant differentially expressed genes (DEGs), including 28 upregulated DEGs and 31 downregulated DEGs. Lasso regression, SVM machine learning, and random forest tree method were used to screen six intersection feature genes related to PCOS, including DNAJC3, ASPH, TLR4, SEC24D, SGK1 and AMFR. These genes were involved in important biological processes and pathways, including responses to endoplasmic reticulum stress, chemical stress, autophagy regulation, endogenous apoptosis, lipid and atherosclerosis, neurodegeneration, etc. The MR analysis method for the cross validation of the aforementioned three methods further suggested that SEC24D had a causal relationship with PCOS, and 37 miRNAs and 15 lncRNAs were closely related to SEC24D. CIBERSORT analysis showed a unique distribution of immune cells in PCOS, particularly a significant increase in neutrophil count, emphasizing the importance of immune processes in PCOS. Conclusion This study provides new insights into the molecular basis of PCOS and emphasizes the potential for therapeutic interventions and targeted molecular pathways for treating PCOS, laying the foundation for further research and clinical work.

Key words: polycystic ovary syndrome, differentially expressed genes, microarray data, eQTL analysis, Mendelian randomization, immune cell infiltration

中图分类号: