生物技术通报 ›› 2024, Vol. 40 ›› Issue (3): 202-214.doi: 10.13560/j.cnki.biotech.bull.1985.2023-0982

• 研究报告 • 上一篇    下一篇

甘蔗属种及其近缘属种蔗茅的全基因组密码子偏好性分析

田春艳(), 李旭娟, 李纯佳, 毛钧, 刘新龙()   

  1. 1.热带作物生物育种全国重点实验室,昆明 650205
    2.云南省农业科学院甘蔗研究所 云南省甘蔗遗传改良重点实验室,开远 661699
    3.农业农村部甘蔗生物学与遗传育种重点实验室,开远 661699
  • 收稿日期:2023-10-19 出版日期:2024-03-26 发布日期:2024-04-08
  • 通讯作者: 刘新龙,男,博士,研究员,研究方向:甘蔗分子育种;E-mail: lxlgood868@163.com
  • 作者简介:田春艳,女,硕士,助理研究员,研究方向:甘蔗分子育种;E-mail: tianchy89@126.com
  • 基金资助:
    云南种子种业联合实验室(202205AR070001-09);云南省科技人才与平台计划(202205AM070001)

Genome-wide Analysis of Codon Usage Bias in Saccharum Species and Its Phylogenetically Related Species Erianthus fulvus

TIAN Chun-yan(), LI Xu-juan, LI Chun-jia, MAO Jun, LIU Xin-long()   

  1. 1. National Key Laboratory for Biological Breeding of Tropical Crops, Kunming 650205
    2. Sugarcane Research Institute, Yunnan Academy of Agricultural Sciences, Yunnan Key Laboratory of Sugarcane Genetic Improvement, Kaiyuan 661699
    3. Key Laboratory of Sugarcane Biology and Genetic Breeding, Ministry of Agriculture and Rural Affairs, Kaiyuan 661699
  • Received:2023-10-19 Published:2024-03-26 Online:2024-04-08

摘要:

【目的】 为解析甘蔗基因组的密码子使用特征,提高异源基因在甘蔗中的表达效率。【方法】 以已发布的甘蔗属种(热带种LA-purple、割手密NP-X和AP85-441)及其近缘属种蔗茅(Yunnan2009-3)基因组为数据,利用Python、CodonW1.4.2进行密码子偏好性分析,同时通过中性绘图、ENC-plot、PR2-plot等分析探讨密码子偏好性形成的影响因素,并结合转录组测序数据分析密码子偏好性参数与基因表达水平的相关性。最后,基于RSCU均值与7个主要模式生物种(玉米、高粱、水稻、拟南芥、烟草、大肠杆菌、酿酒酵母)的密码子使用模式进行比较分析。【结果】 显示热带种、割手密和蔗茅的基因组都富含GC,平均GC含量为56.3%,且GC3>GC1>GC2, 倾向于使用以G/C结尾的密码子, 平均ENC值为48.45,偏好性较低。中性绘图、ENC-plot和PR2-plot分析表明它们的密码子偏好性受到自然选择、突变压力等多种因素的共同影响,其中自然选择占主导作用。相关性分析表明密码子偏好性参数与基因实际的转录表达水平存在显著相关性,但相关性不强。根据RSCU和∆RSCU值,确定了13个最优密码子,均以C或G结尾,密码子使用特性在全基因组和染色体组水平上无差异。通过比较发现,甘蔗的核苷酸组成及密码子偏好性与玉米、高粱和水稻较为相似,而与拟南芥、烟草、大肠杆菌和酵母具有显著差异。【结论】 甘蔗热带种、割手密和蔗茅的密码子偏好性高度相似,其形成受自然选择和突变因素的影响。此外,对甘蔗优异基因功能异源验证时可优先选择玉米、水稻和高粱作为异源表达系统。

关键词: 甘蔗属, 蔗茅, 基因组, 密码子使用偏好性, 最优密码子

Abstract:

【Objective】 To clarify the codon usage characteristics and improve expression efficiency of exogenous genes in sugarcane. 【Method】 Codon usage bias were analyzed by Python and CodonW1.4.2 tools based on their genomes data of three species including Saccharum officinarum (LA-purple), S. sponaneum (NP-X and AP85-441), and Erianthus fulvus (Yunnan2009-3). Neutrality-plot, ENC-plot, and Parity Rule 2(PR2)-plot analysis were also used to explore the possible factors that affect the formation of codon usage bias. Simultaneously, correlation analysis between gene expressions and codon usage bias parameters was carryd out based on transcriptome data. Finally, a comparison analysis of codon usage patterns were performed between sugarcane and the other seven main model organisms, namely, Zea mays, Sorghum bicolor, Oryza sativa, Arabidopsis thaliana, Nicotiana tabacum, Escherichia coli, and Saccharomyces cerevisiae based on the average RSCU value of three species. 【Result】 S. officinarum, S. sponaneum, and E. fulvus were rich in GC base pairs, the average GC content was 56.3%, and GC3 was the highest, followed by GC1 and GC2, indicating that sugarcane preferred to use synonymous codons ending with a C/G base. The average ENC value was 48.45, suggesting that sugarcane had a lower codon usage bias. According to the results of neutral-plot, ENC-plot, and PR2-plot analysis, the codon usage bias of four genomes was affected by natural selection, mutation pressure, and so on, and the natural selection dominated in them. Correlation analysis showed that codon preference parameters were significantly correlated with the actual transcription expressions of genes, but the correlation was not strong. Based on the RSCU and ∆RSCU values, thirteen identical optimal codons were confirmed, and all of them were ended with C/G base pairs. The codon usage characteristics were not different at genome-wide and chromosome level. In addition, the codon usage bias of sugarcane was similar to that of Z. mays, S. bicolor, and O. sativa, while it had a significant difference with A. thaliana, N. tabacum, E. coli, and S. cerevisiae. 【Conclusion】 The codon usage bias among S. officinarum, S.sponaneum and E. fulvus was highly similar, and its formation was affected by natural selection and mutation pressure factors. Meanwhile, Z. mays, O. sativa and S. bicolor could be used for elite gene function research on sugarcane as a exogenous expression system on account of their codon usage bias patterns.

Key words: Saccharum, Erianthus fulvus, genome, codon usage bias, optimal codons