异构数据源集成工具包及其在生物医学领域的应用 贾存鑫
系统架构 Ontology
Semantic Search Server 工具包 Semantic Matcher Ontology Generator Semantic Search Server
Semantic Matcher Schema Matcher Entity linkage finder Ontology matcher RDB to ontology matcher Entity linkage finder
Ontology Generator Identify class and property Generate URIs and labels Express n-ary relation(n > 2) by binary properties Extract class hierarchy
Semantic Search Server Parse semantic search query Boolean expression Constrained keyword (e.g. C:xxx P:xxx) Convert SPARQL to SQL Provide RESTful service
Bio2RDF Dataset #Triples #Entities #Types #Properties DrugBank Largest network of Linked Data for the Life Sciences ~11 billion triples across 35 datasets Selected datasets (triple store: Virtuoso 7.1.0) Dataset #Triples #Entities #Types #Properties DrugBank 3,649,750 316,555 91 105 NDC 6,199,488 488,146 15 37 KEGG 50,197,150 6,533,307 141 63
时间安排 2014-12-01 ~ 2015-03-01 工具包及系统开发 2015-03-01 ~ 2015-04-01 论文写作(初稿) 2015-04-01 ~ 2015-04-15 论文修改及外审 2015-05 ISWC2015 in-use track投稿
Thanks!