生物技术通报 ›› 2025, Vol. 41 ›› Issue (9): 335-344.doi: 10.13560/j.cnki.biotech.bull.1985.2025-0418

• 研究报告 • 上一篇    

食管鳞状细胞癌早期进展风险的代谢物预警模型

张雅祺1(), 王芹芹1, 沈夏1, 李旭苗1, 高敏1, 李军2, 李辰1(), 王慧1()   

  1. 1.上海交通大学医学院公共卫生学院单细胞组学与疾病研究中心,上海 200025
    2.盐亭县肿瘤医院肿瘤防治办公室,绵阳 621600
  • 收稿日期:2025-04-21 出版日期:2025-09-26 发布日期:2025-09-24
  • 通讯作者: 王慧,女,教授,研究方向 :肿瘤分子靶点的应用基础研究;E-mail: huiwang@shsmu.edu.cn
    李辰,女,研究员,研究方向 :肿瘤多组学研究;E-mail: cli@shsmu.edu.cn
  • 作者简介:张雅祺,女,硕士研究生,研究方向 :消化道肿瘤代谢组学;E-mail: zhangyaqimiao@sjtu.edu.cn
    王芹芹同为本文第一作者
  • 基金资助:
    国家自然科学基金项目(82030099);国家自然科学基金项目(82373446);国家重点研发计划(2022YFD2101500);上海市科委“科技创新行动计划”自然科学基金项目(23ZR1435800);上海市科委“科技创新行动计划”技术标准项目(21DZ2201700);上海市公共卫生体系建设三年行动计划(GWVI-11. 1-43)

Metabolite Early Warning Model for the Risk of Early Progression in Esophageal Squamous Cell Carcinoma

ZHANG Ya-qi1(), WANG Qin-qin1, SHEN Xia1, LI Xu-miao1, GAO Min1, LI Jun2, LI Chen1(), WANG Hui1()   

  1. 1.Center for Single-cell Omics, School of Public Health, Shanghai Jiao Tong University School of Medicine, Shanghai 200025
    2.Cancer Prevention and Treatment Office, Yanting Cancer Hospital, Mianyang 621600
  • Received:2025-04-21 Published:2025-09-26 Online:2025-09-24

摘要:

目的 基于代谢组学构建食管鳞状细胞癌(esophageal squamous cell carcinoma, ESCC)早期风险预警模型,精准识别高风险人群。 方法 纳入84例低级别上皮内瘤变患者,采集基线期血清,根据随访期间是否进展为高级别上皮内瘤变或ESCC分为进展组(N=28)和无进展组(N=56)。采用反相液相色谱和亲水相互作用液相色谱联合高分辨质谱开展非靶向代谢组学分析。结合单变量与多变量分析评估组间代谢特征差异,对差异代谢物进行通路富集分析。将样本按7∶3比例分为训练集与测试集,在训练集中采用单变量逻辑回归联合最小绝对收缩与选择算子回归筛选与病程进展相关的关键代谢物,基于回归系数构建风险预警模型。通过受试者工作特征曲线和曲线下面积(area under the curve, AUC)评估模型性能。 结果 共鉴定10类1 431种代谢物,差异代谢物在类固醇激素生物合成、初级胆汁酸合成及亚油酸代谢通路显著富集。最终筛选出18个与病程进展密切相关的关键代谢物,包括甘油-3-磷脂胆碱、棕榈酸、黄尿酸及N-脒基天冬氨酸等。风险预警模型在测试集中表现出良好的预测能力(AUC=0.812)。 结论 基于前瞻性随访队列,识别出多个关键代谢物及代谢通路,构建ESCC早期进展风险的代谢物预警模型。模型具有良好的预测鲁棒性和泛化能力,可为ESCC高风险人群的早期风险评估与干预策略优化提供理论支持。

关键词: 食管鳞状细胞癌, 代谢组学, 精准医学, 最小绝对收缩与选择算子, 风险预警模型, 早期筛查

Abstract:

Objective An early warning model for the risk of esophageal squamous cell carcinoma (ESCC) was constructed based on metabolomics to identify high-risk populations accurately. Method Eighty-four patients with low-grade intraepithelial neoplasia (LGIN) were included, and serum was collected at baseline. They were divided into a progress group (N=28) and a non-progress group (N=56) according to whether they progressed to high-grade intraepithelial neoplasia (HGIN) or ESCC during follow-up. Untargeted metabolomics analysis was performed using reversed-phase liquid chromatography and hydrophilic interaction liquid chromatography combined with high-resolution mass spectrometry. Differences in metabolic profiles between groups were assessed by combining univariate and multivariate analyses, and pathway enrichment analyses were performed for differential metabolites. The samples were divided into training and test sets in the ratio of 7:3. In the training set, univariate logistic regression combined with least absolute shrinkage and selection operator (LASSO) regression was used to screen key metabolites associated with disease progression, and a risk warning model was constructed based on the regression coefficients. Model performance was assessed by the receiver operating characteristic curve and area under the curve (AUC). Result A total of 1 431 metabolites from 10 classes were identified. Differential metabolites were significantly enriched in steroid hormone biosynthesis, primary bile acid synthesis, and linoleic acid metabolic pathways. Finally, 18 key metabolites closely related to the progression of the disease were selected, including sn-glycero-3-phosphocholine, hexadecanoic acid, xanthurenic acid, and N-amidino-aspartate. The risk warning model showed good predictive ability in the test set (AUC=0.812). Conclusion Based on the prospective follow-up cohort, multiple key metabolites and metabolic pathways are identified, and metabolite early warning models for the risk of early progression of ESCC are constructed. The model has good prediction robustness and generalization ability and may provide theoretical support for early risk assessment and intervention strategy optimization in people at high risk of ESCC.

Key words: esophageal squamous cell carcinoma, metabolomics, precision medicine, least absolute shrinkage and selection operator, risk prediction model, early screening