AskDBP中的实体连接方法
EntityLinking 输入:word序列 输出:(mention,uri序列)二元组序列 the man convicted of killing Martin Luther King (Martin Luther King,<http://dbpedia.org/resource/Martin_Luther_King,_Jr.>)
EntityLinking具体方法 1、Entity search 2、pagelink扩展实体集合 3、确定mention和uri 通过Lucene根据abstract域搜索相关entity,取top20 2、pagelink扩展实体集合 仅考虑正向链接到的实体 3、确定mention和uri 如果uri的localname或其重定向uri的localname是输入字符串的一个严格子 串,则认为这个子串是一个mention,这个uri为对应linking。
EntityLinking具体方法 the man convicted of killing Martin Luther King Top20: <http://dbpedia.org/resource/Martin_Luther_King,_Jr.> <http://dbpedia.org/resource/Gerald_Posner> <http://dbpedia.org/resource/Clayborne_Carson> <http://dbpedia.org/resource/Clarence_Benjamin_Jones> <http://dbpedia.org/resource/Dexter_Scott_King> <http://dbpedia.org/resource/Engine_Company_25_(Washington,_D.C.)> <http://dbpedia.org/resource/Alpha_Man:_The_Brotherhood_of_MLK> <http://dbpedia.org/resource/Louise_Martin> ...... pagelink扩展: <http://dbpedia.org/resource/Martin_Luther_King,_Jr.> ...... 重定向: Martin Luther King
EntityLinking效果 49个句子级输入,59(?)个实体,找到47个mention,44个确认正确 building owned by the Bank of America was featured in the TV series MegaStructures TV series <http://dbpedia.org/resource/Television_program> birds are protected under the National Parks and Wildlife Act National Parks <http://dbpedia.org/resource/National_park> Golden Globe awards the daughter of Henry Fonda win Golden Globe <http://dbpedia.org/resource/Golden_Globe_Award>(mention)
ClassLinking 输入:word序列 输出:(mention,uri序列)二元组序列 the architects of the tallest building in Japan [building] http://dbpedia.org/ontology/Building [architects] http://dbpedia.org/ontology/Architect
ClassLinking具体方法 1、Entity search 通过Lucene根据abstract域搜索相关entity,取top25 2、pagelink扩展实体集合 仅考虑正向链接到的实体 3、获取上述所得实体的class信息 4、确定mention和uri 如果uri的localname是输入字符串的一个严格子串,则认为这个子串是 一个mention,这个uri为对应linking。(通过WordNet取词干略微放松 判断的标准,解决复数class的问题)
ClassLinking具体方法 the president who approved the use of atomic weapons against Japan during World War II Top25: <http://dbpedia.org/resource/The_Devil's_Voyage> <http://dbpedia.org/resource/My_Trial_as_a_War_Criminal> <http://dbpedia.org/resource/Nuclear_weapons_debate> <http://dbpedia.org/resource/Harry_S._Truman> <http://dbpedia.org/resource/Japanese_nuclear_weapon_program> <http://dbpedia.org/resource/List_of_common_World_War_II_infantry_weapons> class: http://dbpedia.org/ontology/President http://dbpedia.org/ontology/Weapon pagelink: ...... <http://dbpedia.org/resource/Harry_S._Truman> mention: [president] http://dbpedia.org/ontology/President [weapons] http://dbpedia.org/ontology/Weapon
ClassLinking效果 49个句子级输入,50(?)个class,找到41个,确认正确38个。 In city the sports commentator know as Memory Man live [sports] http://dbpedia.org/ontology/Sport the actor who plays Dan White in Milk [plays] http://dbpedia.org/ontology/Play German mathematicians were members of the von Braun rocket group [group] http://dbpedia.org/ontology/Group [rocket] http://dbpedia.org/ontology/Rocket
Thank you~