生物技术通报 ›› 2024, Vol. 40 ›› Issue (2): 313-324.doi: 10.13560/j.cnki.biotech.bull.1985.2023-0748

• 研究报告 • 上一篇    

融合跨物种科学数据的性状调控基因本体模型构建及应用

张丹丹1(), 赵瑞雪1,2(), 鲜国建1,3, 熊赫1   

  1. 1.中国农业科学院农业信息研究所,北京 100081
    2.国家新闻出版署农业融合出版知识挖掘与知识服务重点实验室,北京 100081
    3.农业农村部农业大数据重点实验室,北京 100081
  • 收稿日期:2023-08-05 出版日期:2024-02-26 发布日期:2024-03-13
  • 通讯作者: 赵瑞雪,女,博士,研究员,博士生导师,研究方向:农业信息技术应用研究、知识组织与知识服务;E-mail: zhaoruixue@caas.cn
  • 作者简介:张丹丹,女,博士研究生,研究方向:功能基因组、知识图谱与学科知识服务;E-mail: zhangdandan01@caas.cn
  • 基金资助:
    科技创新2030——“新一代人工智能”重大项目(2021ZD0113705)

Trait-regulated-genes Ontology Model Construction and Application by Integrating Cross-species Scientific Data

ZHANG Dan-dan1(), ZHAO Rui-xue1,2(), XIAN Guo-jian1,3, XIONG He1   

  1. 1. Agricultural Information Institute of Chinese Academy of Agricultural Sciences, Beijing 100081
    2. Key Laboratory of Agricultural Integration Publishing Knowledge Mining and Knowledge Service, National Press and Publication Administration, Beijing 100081
    3. Key Laboratory of Agricultural Big Data, Ministry of Agriculture and Rural Affairs, Beijing 100081
  • Received:2023-08-05 Published:2024-02-26 Online:2024-03-13

摘要:

【目的】在新技术带来的育种数据激增与计算育种对知识服务的新需求下,为解决作物育种知识服务中跨物种学科知识获取效率低且优异多效基因发现困难的问题。【方法】本研究构建了性状调控基因本体模型框架,并定义了本体模型中的实体层次结构和实体属性。以主粮作物水稻、玉米、小麦和模式植物拟南芥为数据采集对象,构建了以性状调控基因本体模型为模式层的知识图谱并进行实验。【结果】最终形成了涵盖13种实体、16种数据属性和14个对象属性的性状调控基因本体模型,以此模型为本体层的知识图谱实现了跨物种间学科知识关联检索、优异多效基因挖掘和跨物种基因功能预测。【结论】本研究所提出的性状调控基因本体模型构建方法,能够实现跨物种间性状调控基因的关联发现,可提高跨物种学科知识的获取效率,可支撑多维度科学数据寻证分析的功能基因发现结果。本研究为多效基因的挖掘和基因功能预测提供了一条可实现的方法路径,为作物育种科学研究提供了有效的数据支撑服务。

关键词: 本体模型, 学科知识发现, 知识图谱, 跨物种, 调控基因

Abstract:

【Objective】 With the proliferation of breeding data brought by new technologies and the new demand for knowledge services in computational breeding, in order to solve the problem of inefficient cross-species subject knowledge acquisition and difficult discovery of elite pleiotropic genes in crop breeding knowledge service.【Method】 We constructed trait-regulated-genes ontology model framework, and defined entity hierarchical structure and entity attributes in ontology model. Taking staple crops rice, corn, wheat and model plant Arabidopsis thalliana as data collection objects, a knowledge graph with trait-regulated-genes ontology model as model layer is constructed and experimented.【Result】 Finally, the trait-regulated-genes ontology model covering 13 entities, 16 data attributes and 14 object attributes was formed. This model is used as the knowledge graph of ontology layer to realize cross-species subject knowledge association retrieval, mining of elite pleiotropic genes and prediction of gene function across species.【Conclusion】 The method of trait-regulated-genes ontology model construction proposed in our study may achieve the correlation discovery of trait regulatory genes across species, improve the efficiency of cross-species subject knowledge acquisition, and support the gene discovery results of multi-dimensional data analysis. This study provides a feasible method path for the mining of pleiotropic genes and gene function prediction, and provides effective data support services for crop breeding scientific research.

Key words: ontology model, knowledge discovery, knowledge graph, cross-species, regulatory gene