生物技术通报 ›› 2019, Vol. 35 ›› Issue (10): 130-136.doi: 10.13560/j.cnki.biotech.bull.1985.2019-0266

• 研究报告 • 上一篇    下一篇

环状芽孢杆菌泛基因组分析及次级代谢通路挖掘

姚彩苗1, 赵雯雅2, 3, 汪步青2, 3, 郑利艳2, 3, 张丽萍2, 3, 刘洪伟2, 3   

  1. 1. 中国人民解放军联勤保障部队第九八〇医院检验实验科,石家庄 050000;
    2. 河北省科学院生物研究所,石家庄 050081;
    3. 河北省主要农作物病害微生物控制工程技术研究中心,石家庄 050081
  • 收稿日期:2019-04-02 出版日期:2019-10-26 发布日期:2019-09-30
  • 作者简介:姚彩苗,女,研究方向:微生物代谢产物的分离纯化;E-mail:yaocaimiao@163.com
  • 基金资助:
    河北省高层次人才资助项目(B2018003019),河北省科学院科技计划项目(19304,2018G01)

Pan-Genome Analysis and Secondary Metabolic Pathway Mining of Bacillus circulans

YAO Cai-miao1, ZHAO Wen-ya2,3, WANG Bu-qing2,3, ZHENG Li-yan2,3, ZHANG Li-ping2,3, LIU Hong-wei2,3   

  1. 1. Department of Laboratory Medicine,980 Hospital of PLA Joint Logistics Support Force,Shijiazhuang 050000;
    2. Institute of Biology,Hebei Academy of Science,Shijiazhuang 050081;
    3. Main Crops Disease of Microbial Control Engineering Technology Research Center in Hebei Province,Shijiazhuang 050081
  • Received:2019-04-02 Published:2019-10-26 Online:2019-09-30

摘要: 旨为对环状芽孢杆菌基因组进行更深入的了解,并探索其次级代谢通路。从NCBI数据库下载了9个环状芽孢杆菌的基因组,利用系统发育分析软件、泛基因组分析软件和次级代谢产物挖掘软件对其进行了分析。9株菌的基因组大小在5.01-9.63 Mb之间,在进化树上被归为了两个分支。通过泛基因组和核心基因组分析,发现其泛基因组含有9 572个基因家族,核心基因组由3 622个基因家族组成;共鉴定出4 593个特有基因,其中菌株NCTC2610的特有基因最多(3 030个),而菌株NBRC 13626的特有基因最少(39个)。通过次级代谢产物合成基因簇分析,9个环状芽孢杆菌基因组中共发现6类、32个次级代谢基因簇,重复出现最多的代谢通路是羊毛硫肽、套索肽和萜烯类化合物合成通路。通过本研究,明确了环状芽孢杆菌的泛基因组和核心基因组大小,预测了其次级代谢通路,有助于我们全面了解环状芽孢杆菌,为进一步更好地利用该菌株提供线索。

关键词: 环状芽孢杆菌, 泛基因组, 次级代谢, 基因组挖掘

Abstract: This study aimed to deeply understand the genomes of Bacillus circulans and to mine these secondary metabolic pathways. The genomes of 9 B. circulans were downloaded from NCBI database and analyzed by phylogenetic analysis software,pan-genome analysis software and secondary metabolite mining software. The genome size of 9 strains was between 5.01-9.63 Mb and was divided into two branches in the evolutionary tree. Through the analysis of pan-genome and core genome,it was found that the pan-genome contained 9 572 cluster genes,the core genome was composed of 3 622 cluster genes,and a total of 4 593 specific cluster genes were identified. Among them,strain NCTC2610 had the most specific cluster genes(3 030)and strain NBRC 13626 had the least specific cluster genes(39). After the analysis of secondary metabolite synthesis gene clusters,6 types and 32 secondary metabolic gene clusters were found in 9 B. circulansgenomes,and the most repeated metabolic pathways were lanthipeptide,lassopeptide and terpene compounds synthesis pathways. In sum,through this study the pan-genome and core genome of 9 B. circulans were clarified,and their secondary metabolic pathways were predicted. These results will help us to fully understand B. circulans,and will provide us some clues to better use those strains.

Key words: Bacillus circulans, pan-genome, secondary metabolic, genome mining