生物技术通报 ›› 2014, Vol. 0 ›› Issue (7): 119-124.

• 研究报告 • 上一篇    下一篇

基于高通量测序的辽东栎转录组学研究

刘玉林1, 李伟1, 张志翔2   

  1. 1.北京林业大学生物科学与技术学院, 北京 100083;
    2.北京林业大学自然保护区学院, 北京 100083
  • 收稿日期:2013-12-10 出版日期:2014-07-15 发布日期:2014-07-16
  • 作者简介:刘玉林, 男, 博士, 研究方向:能源树种的转录组学;E-mail:lyl12504001@126.com
  • 基金资助:
    “十二五”国家科技支撑计划课题(2011BAD22B08), 国家林业公益性行业科研专项项目(201004001)

Transcriptome Analysis for Quercus liaotungensis Koidz. Based on High-throughput Sequencing Technology

Liu Yulin1, Li Wei1, Zhang Zhixiang2   

  1. 1. College of Biological Sciences and Biotechnology, Beijing Forestry University, Beijing 100083;
    2. College of Nature Conservation, Beijing Forestry University, Beijing 100083
  • Received:2013-12-10 Published:2014-07-15 Online:2014-07-16

摘要: 应用Illumina Solexa Hiseq 2000高通量测序技术对辽东栎的芽、花、叶及果实的混合样品进行转录组测序, 结果共获得3.8 Gb的有效数据。应用Trinity软件对有效序列从头拼接去重复后, 共获得95 800条unigene, 总长度为73.57 Mb, 最大长度、平均长度和N50分别为11 284 bp、768 bp和1 373 bp。利用Blastx与公共数据库Nr和Swiss-Prot的同源性比较(E值<1×10-5)发现, 38 163条unigene未发现与公共数据库中的序列具有同源性。通过KEGG数据库中参与淀粉合成与代谢的pathway分析, 共发掘出67条参与淀粉合成的unigene, 编码9个关键酶。此外, 在13 380 条unigene 中共搜索到15 901个SSR 位点, 其中二核苷酸和三核苷酸的重复类型占所有SSR位点的98.16%。

关键词: 辽东栎, Illumina Solexa, 转录组, 淀粉, SSR

Abstract: In this study, Illumina Solexa Hiseq 2000 high-throughput sequencing technology was used to get the comprehensive transcriptome from mixed samples of buds, flowers, leaves and fruits of Quercus liaotungensis. As a result, 3.8 Gb effective data was obtained. After de novo assembly by the software of Trinity, a total of 95 800 unigenes were generated, corresponding to a total of 73.57 Mb with a maximum length, average length and N50 of 11 284 bp, 768 bp and 1 373 bp respectively. Using Blastx against the public databases of Nr and Swiss-Prot with an E-value cut-off of 10-5, 38 163 unigene were not found in any databases with a high homology. According to the KEGG pathway assignment, 67 unigene encoding nine key enzymes which may involve in starch synthesis were identified. In addition, 15 901 potential SSR loci were detected from 13 380 unigene. Of them, dinucleotide repeat and trinucleotide repeat accounted for 98.16% of all.

Key words: Quercus liaotungensis, Illumina Solexa, Transcriptome, Starch, SSR