生物技术通报 ›› 2019, Vol. 35 ›› Issue (7): 90-99.doi: 10.13560/j.cnki.biotech.bull.1985.2019-0360

• 研究报告 • 上一篇    下一篇

莠去津降解菌泛基因组测序及比较基因组分析

王娅丽, 朱姗姗, 杨峰山, 马玉堃, 付海燕, 刘春光   

  1. 1. 黑龙江大学农业微生物技术教育部工程研究中心,哈尔滨 150500;
    2. 黑龙江大学生命科学学院 黑龙江省寒地生态修复与资源利用重点实验室,哈尔滨 150080;
    3. 黑龙江大学生命科学学院 黑龙江省普通高校微生物重点实验室,哈尔滨 150080
  • 收稿日期:2019-04-26 出版日期:2019-07-26 发布日期:2019-07-29
  • 作者简介:王娅丽,女,研究方向:生物技术;E-mail:941515464@qq.com
  • 基金资助:
    黑龙江省自然科学基金面上项目(C2018051),黑龙江省高校基本科研业务费黑龙江大学专项资金项目(HDRCCX-201614)

Pan-genome Sequencing and Comparative Genomic Analysis of Atrazine-degrading Bacteria

WANG Ya-li, ZHU Shan-shan, YANG Feng-shan, MA Yu-kun, FU Hai-yan, LIU Chun-guang   

  1. 1. Engineering Research Center of Agricultural Microbiology Technology,Ministry of Education,Heilongjiang University,Harbin 150500;
    2. Heilongjiang Provincial Key Laboratory of Ecological Restoration and Resource Utilization for Cold Region,School of Life Sciences,Heilongjiang University,Harbin 150080;
    3. Laboratory of Microbiology,College of Heilongjiang Province,School of Life Sciences,Heilongjiang University,Harbin 150080
  • Received:2019-04-26 Published:2019-07-26 Online:2019-07-29

摘要: Arthrobacter aurescens TC1和Pseudomonas sp. ADP是目前莠去津降解菌的模式菌株,筛选出Microbacterium sp. HBT4,旨在挖掘这3株不同种属细菌基因组间生物学信息的异同,并预测重要基因。通过Illumina Hiseq 4000测序平台采用DNA小文库制备和测序技术,进行了泛基因组测序,使用相关软件进行基因组组分分析、基因功能注释、基因间变异检测和比较基因组学分析,将分离得到的微杆菌HBT4与模式菌株进行核苷酸组成、共线性及菌株间变异差异分析。得到该菌株基因组大小约为3.53 Mb,预测到菌株HBT4编码基因3 397个、重复序列含量为1.33%、非编码RNA 63个,通用数据库基因功能注释共3 324个,专用数据库基因功能注释共1 149个,通过菌株间差异变异分析发现SNP、Small InDel和水平转移基因,未发现结构变异基因,获得该菌株特有基因中GO注释到的基因在细胞组分、分子功能和生物学进程中的数量和比例,从KEGG 代谢通路富集图中发现特有基因编码的二氢硫基赖氨酸残基琥珀酰转移酶位于三羧酸循环中α-酮戊二酸和琥珀酰辅酶A的代谢通路之间。获得3个菌株核心基因组与非必需基因组比例分布、系统进化树和共线性关系,发现三者之间共有基因家族986个、菌株HBT4特有基因家族1 171个。得到的菌株HBT4与两株模式菌株相比,其基因家族之间既有相同之处,又有较大差异。

关键词: 莠去津, 微杆菌属, 泛基因组, 比较基因组分析, 生物信息学

Abstract: Arthrobacter aurescens TC1 and Pseudomonas sp. ADP are currently the model strains of atrazine-degrading bacteria. Microbacterium sp. HBT4 was screened in this experiment for discovering the similarities and differences of biological information among the 3 bacterial genomes and predicting important genes. In this study,pan-genome sequencing was carried out by using small DNA library preparation and sequencing technology on Illumina Hiseq 4000 sequencing platform. Genome composition analysis,gene function annotation,gene mutation detection and comparative genomics analysis were carried out by using related software. The nucleotide composition,collinearity and variation differences between the isolated Microbacterium HBT4 and model strains were analyzed. The genome size of the strain was about 3.53 Mb. It was predicted that there were 3 397 coding genes,1.33% repetitive sequences and 63 non-coding RNA in the strain HBT4. There were 3 324 annotations of gene function in general database and 1 149 annotations of gene function in special database. SNP,Small InDel and horizontal transfer genes were found through analyzing variance among strains and no structural variation genes were found. The number and proportion of GO-annotated genes in the specific genes of the strain in cell components,molecular functions and biological processes were obtained. From KEGG metabolic pathway enrichment map,it was found that the dihydrothiosyllysine residue succinyltransferase encoded by the specific gene was located between the metabolic pathway of α-ketoglutaric acid and succinyl coenzyme A in the tricarboxylic acid cycle. The proportional distribution,phylogenetic tree and collinearity of the core genome and non-essential genome of the three strains were obtained. It was found that there were 986 common gene families among the three strains,and 1171 specific gene families strain HBT4. Compared with the two model strains,the strain HBT4 obtained in this study had both similarities and differences in gene family.

Key words: atrazine, Microbacterium sp. HBT4, pan-genome, comparative genome analysis, bioinformatics