A Survey of PIM Yukun Li
Outline Background Knowledge About PIM Workshop Related works Position and Some Ideas Conclusion and Outlook
Background Knowledge Development History of PIM Memex :A concept from Vannevar Bus Today’s PIM
Development History : PIM is not new Background Knowledge Development History : PIM is not new PIM broadly defined includes the management of information going into our own memories as well the management of external information. As such, an interest in PIM-related matters is evidenced in the study of mnemonic techniques going back to ancient times.
Memex : Vannevar Bush’s Ideal PIM Background Knowledge Memex : Vannevar Bush’s Ideal PIM The modern dialog on PIM is generally thought to have begun with Vannevar Bush’s highly inspirational article “As we may think” published as World War II was finally nearing its end. “The investigator is staggered by the findings and conclusions of thousands of other workers – conclusions which he cannot find time to grasp, much less to remember, as they appear”. Bush expressed a hope that technology might be used to extend our collective ability to handle information and to break down barriers impeding the productive exchange of information.
What is Memex? “a device in which an individual stores all his books, records, and communications, and which is mechanized so that it may be consulted with exceeding speed and flexibility.” The memex used small head-mounted cameras to record experiences and microfilm to store these experiences, but no computer.
Today’s PIM: Great Opportunity and Great Challenge Background Knowledge Today’s PIM: Great Opportunity and Great Challenge The development of Store device and Internet Computing Devices and I/O technologies The info-resource become richer and richer Development of IR,DBMS,CHI,etc. New problems and exacerbated old problems Information islands: in multiple versions Some Theory and application questions unsolved
Outline Background Knowledge About PIM Workshop Related Works Position and Some Ideas Conclusion and Outlook
About PIM Workshop Origin About 2005 PIM Workshop A few elementary concepts of PIM About 2006 PIM Workshop
Origin of The PIM Workshop About PIM Workshop Origin of The PIM Workshop William P. Jones ,Harry Bush and some other professors prompted and held the 1st PIM Workshop in 2005. Now it has been held twice. January 27-29, 2005, Seattle, Washington at the Watertown Hotel; In 2005, There were about 30 specialists attending the workshop ,who came from Microsoft、IBM、Washington University、California University、MIT、NFS and other universities and companies .Every one participating in the workshop was asked to submit a poster on his creative idea,
About PIM Workshop About the founders
About PIM Workshop Origin About 2005 PIM Workshop A few elementary concepts of PIM About 2006 PIM Workshop
The Result of PIM Workshop 2005 A Report on the NSF-Sponsored Workshop on Personal Information Management, Seattle, WA, 2005 This report discussed the elementary concepts of PIM domain, and put forward key issues to be paid more research as bellow: New concepts mentioned: Information , information item , information form A Personal Space of Information(PSI), Personal Information Management(PIM) , Personal Information Environment (PIE) PIM activities: Keeping, Finding/re-finding , “M-level activities”
About PIM Workshop Origin About 2005 PIM Workshop A few elementary concepts of PIM About 2006 PIM Workshop
Information, Information item, Information form 信息(Information):在PIM的研究中,我们聚焦于研究信息对于主体的影响能力,如,是否对我们采取行动或作出选择产生影响。例如我们在选择旅馆时,会选定哪一所旅馆,取决于我们所收集到的旅馆的位置、价格等。 信息项(information item):信息项是一个信息单元,具体例子有:一封邮件,一个页面,一个任何类型的文件、一篇文档等等。 信息框(information form):信息项的类型,也可以说是一个信息项的具体表示形式,其内容和变化与具体的应用和工具有关, 这些应用和工具用来命名、移动、修改、复制、组织信息项,也可以为数据项赋予一些属性。 例如Word 文档、Pdf文档、Emai文档、jpg图像文件等都可以看作是一个信息框。
Personal Information(PI) 在PIM研究中,个人信息包括以下三层涵义: (1)主体保存并为自己所用的信息(性质) (2)与主体有关但被其它实体控制的的信息,例如,医疗保险机构掌握着我们的健康信息。(访问) (3)主体经历过的但不为自己所控制的信息,例如我们访问过的网页(记忆问题)。 我认为:以上三类信息为直接信息,另外还有一种信息,可以称之为:隐含信息;这是一类隐含在直接信息之下,我们所不知道的信息,是需要经过挖掘分析才能发现的信息。例如,主体尚不知道的与其有关的网页,主体还不知道的隐含在PI中的信息,(我自己的观点,那么如何界定这一部分信息,有待进一步研究分析)
个人信息空间(PSI) 个人信息空间(PSI):个人信息空间是指其所能够控制,或名义上能够控制的所有数据项的全体组成的集合(并不是指对数据专属,例如邮件系统),一个PSI往往包括一个人的书籍、Paper文档,Email帐号信息,Email文档、或其它存储在不同计算机上的与主体有关的文件,也包括网页链接、应用、工具以及结构,以此支持PSI信息的获取、存储、访问和使用。 我的观点: (1) PSI是与主体有关联的所有信息项的集合 (2) PSI包括两个部分:已知部分(可控);未知部分 (3) PSI是本体需求和时间的函数. (4) PSI是动态变化的.表现在:本体需求行为的变化,会导致PSI变化;自生的信息项;自灭的信息项;信息项属性的变化;信息项从未知转变为已知;等.
PIM Workshop 2005关于PSI的几点说明: 尽管从某种意义上说,我们可以控制PSI中的information item;但是实际上也是很难完全做到的,比如,我们删除了一封邮件,但是我们很可能在以后会需要用到其内容(相信每个人都有过这样的经历) 一个PSI不能包括我们访问过的,但是尚在缓存中的网页,但是可以包括我们为这些网页所做的书签 通常,不可避免的会有大量灰色区域,比如,会有很多文件放在我们不能够控制的网络共享设备上,同样,PSI中也可能包括一些原来的应用程序遗留在我们机器(桌面)上的图标、书签、目录,他们都是自动建立的。 按照定义,每个实体只有一个PSI PSI区别于PIE,在文献中,PIE往往与支撑工具有关,是PSI的一个子集,一个办公室的物理空间,往往可以看作是一个PIE,里面包括成堆的和分类的文件、钉书机、书柜等,这是一个PIE,一个笔记本电脑是一个PIE,每个人可以有多个PIE。 PSI的大小是持续增大的,对于数字化信息而言更是如此,PSI是可供我们通过多种方法利用的潜在的数据源,例如,PSI可以被用来将主体的web经历用户化;可以对PSI中的信息进行分析挖掘,以抽取相应的信息模型;对PSI中信息的有效重用,可以提高我们的生产效率。同时PSI的持续增长也引来了信任、安全问题。
Introduction of PIM concept 一方面,PIM定义非常简单,我们每天都接触它、使用它,但是PIM同时又是很难定义的,以至于一直是一个挑战性的问题。 PIM是我们日常对于处理、分类以及对信息的访问----Lansdale (1988) 。 为个人创建的供其在一个工作环境中使用的系统,其中包含人们获取信息的规则与方法;对信息进行组织与存储的机制,以及维持系统运行的一些规则与过程,以及对信息进行访问、处理、产生输出的方法机制。 ----Barreau (1995) 很多关于PIM的定义是从传统的信息管理的角度,存储信息以使能够在以后被访问。 ----Boardman (2004) 从数据存贮的角度,PIM是在PIS中建立、使用、保持信息与需求之间的映射的一种努力。对PIM行为按照input-storage-output进行分类,考虑到了一个概念框架: • Keeping activities affect the input of information into a PSI. 保持:影响到PSI中数据的输入(From information to need) • Finding/re-finding activities affect the output of information from a PSI. 影响到PIM中数据的产生(输出)(From need to information) • “M-level activities” (e.g., “m” for “mapping” or for “maintenance and organization”) affect the storage of information within the PSI. 影响到PIM中数据的存储。 -------------------Report of PIM 2005 Workshop Problem :PIM最终应当是一个系统软件还是应用软件?
About PIM Workshop Origin About 2005 PIM Workshop A few elementary concepts of PIM About 2006 PIM Workshop
About PIM Workshop About The PIM Workshop 2006 There are more than 30 position paper that were presented in the workshop. Every position paper followed by a poster. The presenting experts were divided into groups to discuss on certain Topic. Panel: Finding, re-finding & search Short panel: Privacy, PIM & Group information management Panel: Keeping, organizing & information enhancement Short panel: PIM overall(整体), PIM odds(机会) & ends
PIM2006 paper Exploiting Personal Search History to Improve Search Accuracy 用户为中心的优化信息查询方法,利用历史数据提高查询准确度。
PIM2006 paper TaskTracer: Enhancing Personal Information Management Through Machine Learning 任务跟踪:通过机器学习强化PIM
PIM2006 paper PIM for Mobility 移动领域的个人信息管理
Outline Background Knowledge About PIM Workshop Related Works Position and Some Ideas Conclusion and Outlook
Related Branches and Works A summary of works on PIM Related Works on PDS\PIS\PDSM Related Works on CHI (HCI) Other Works on PIM
05-06年国外有关PIM paper: A summary of works on PIM Journal/conference Sub-title Quantity ACM SIGMOD2005 a Memex-Inspired Personal Store; Another TP Database.(会议) From Databases to Dataspaces: A New Abstraction for Information Management(Record) Lifestreams: A Storage Model for Personal Data. (Record) PSI 3 ACM SIGMOD2006 User-Defined Aggregate Functions: Bridging Theory and Practice(会议) MauveDB: Supporting Model-based User Views in Database Systems(会议) HCI/CHI 2 VLDB 2005 iMeMex: Escapes from the Personal Information Jungle (Demo Paper) PSI/APP 1 VLDB 2006 iDM: A Unified and Versatile Data Model for Personal Dataspace Management_ PIM workshop 2005 A Report on the NSF-Sponsored Workshop on Personal Information Management Poster(30) PSI/PIM/ Finding/ Keeping.etc PIM workshop 2006 Position Paper (32) 32
近年国内有关PIM研究论文 A summary of works on PIM 计算机学报 基于目标驱动和过程重用的Web服务客户化定制模型 05.4期 一种用于数据分布管理的模糊分组方法 05.7 HCI Keeping 软件学报 数据库技术发展趋势 孟小峰, 周龙骧, 王 珊 (数据集成技术,用户界面技术)2004, Vol.15, No.12 个性化服务技术综述 曾 春, 邢春晓, 周立柱 (2004,Vol.13, N o.10) 活跃型用户对P2P文件共享系统可用性的影响 ··········刘翰宇 肖明忠 代亚非 李晓明。2006.10 Survey PSI 计算机研究与发展 2005年第1期,79~84页。一种改进的自适应文本信息过滤模型。 马 亮 陈群秀 蔡莲红。 2005年第3期,439~447页。 Web社区发现技术综述。 杨 楠 弓丹志 李 忺 孟小峰。 2005年第5期,765~770页。根据用户行为网上导航的方法。 杨 捷1 毋国庆2。 2006年第9期,1644~1650页。基于HTML模式代数的Web信息提取方法。李石君, 于俊清, 欧伟杰。 2006年第10期,1695~1699页。 基于分级神经网络的Web文档模糊聚类技术。 雷景生, 马 军, 靳 婷。 Searching
Compare and analysis 国外的研究比较系统化,且对基础理论(如PSI方面)研究比较重视。原创性成果比较多。 A summary of works on PIM Compare and analysis 国外的研究比较系统化,且对基础理论(如PSI方面)研究比较重视。原创性成果比较多。 与国外相比,目前国内对于PIM、PSI研究还比较少,原创性工作还不多,研究工作都主要集中在信息获取方面,主要集中在对一些算法的研究或改进。 从研究机构上,在PIM、PSI方面,都有一些team在进行研究,且大都有国家项目支撑。
Related Branches and Works A summary of works on PIM Works on PDS\PIS\PDSM Works on CHI (HCI) Other Works on PIM
Important works: Works on PDS\PIS\PDSM In my opinion , Three Important articles: (1) The PIM workshop 2005 Report (2) iMeMex: Escapes from the Personal Information Jungle (Demo Paper) (3) A SIGIR 2006 PIM Workshop Position paper iMeMex: A Platform for Personal Dataspace Management. Dittrich, Jens, M. Salles, S. Karaksashian, et al., (4) VLDB 2006 regular paper iDM: A Unified and Versatile Data Model for Personal Dataspace Management, JensPeter Dittrich, Marcos Antonio Vaz Salles Main Contribution iMeMex DATA MODEL Resource view and resource view classes
Works on PDS\PIS\PDSM The paper’s team
Personal Data Space Management Works on PDS\PIS\PDSM Personal Data Space Management
Papers produced by the team Works on PDS\PIS\PDSM Papers produced by the team
Other Related works Works on PDS\PIS\PDSM Privacy Advisors for Personal Information Management (PIM2006安全性) Finding to Keep and Organize: Personal Information Collections as Contex (PIM2006) iMeMex: A Platform for Personal Dataspace Management. (PIM2006) Collecting and Organizing Web Content. (PIM2006) Internet-Scale Data distribution:Some Research Problem(WISE2006 Keynote Paper 1) Building a Domain Independent Platform for Collecting Domain Specific Data from the WebProblem(WISE2006 Keynote Paper 3)
Related Branches and Works A summary of works on PIM Works on PDS\PIS\PDSM Works on CHI (HCI) Other Works on PIM
CHI (HCI) About CHI2006 人机交互本质上是认知过程,人机交互理论是以认知科学为理论基础;人机交互系统是一个闭环系统,人机交互研究是以系统科学作为人机交互研究的框架的方法学;同时,人机交互是以信息技术作为用户界面的技术基础,通过信息系统的建模、形式化描述、整合算法、评估方法以及软件框架等信息技术最终实现和应用人机交互理论。 人机交互是研究人与计算机之间交互的技术,“人机和谐”和“基于自然交互方式的”的人机交互是当前的关注热点。研究自然和谐的人机交互理论和方法,发展新一代人机交互的技术,开发面向主流应用的新界面软件,是21世纪信息领域需要解决的重大基础课题。 第二届中国人机交互学术会议(CHI-CHINA2006)于2006年10月31日-11月2日在风景秀丽的浙江省杭州市召开。这次会议将邀请国内外著名学者就多媒体发展的最新趋势和热点问题做大会特邀报告。为确保论文质量,会议程序委员会将对收到的所有论文进行审阅,收录的文章将以《中国人机交互新进展》一书为名由出版社正式出版。 交互技术:包括人机环境,情感计算,新的交互工具、设备等用户界面技术及其相关的问题等; 交互系统: 包括新的交互系统的架构、界面和评价。多模态及可触摸界面,自适应交互,三维、虚拟现实、增强现实系统及界面,基于笔输入的交互技术及其相关的问题; 方法和工具: 包括交互系统设计和开发的新方法、过程、技术和工具; 理论和模型:包括人机交互理论和模型的形式化方法的描述和评价。 反思分析:对人机交互相关因素的探讨,年龄相关设计,人类错误,文化影响等; 可用性研究及评价;
CHI (HCI) Relate Papers Fast, Flexible Filtering with Phlat — Personal Search and Organization Made Easy 。 微软研究院(CHI2006) Personal Information Enhancement (part of Sidewalk Project.[PIM2006] Web Driving:An Image-Based Opportunistic Web Browser That Visualizes a Peripheral Information Space (WISE 2006) User Models: A Contribution to Pragmatics of Web Information System Design (WISE 2006) Improving Accessibility of the Web with a computer Game (CHI2006 Phetch借助模拟程序,收集图片描述信息) Verbosity: A Game for Collecting Common-Sense Facts一个收集常识性事实的策略(游戏) (CHI2006) And, at CHI’2004 there were 10 full papers (out of 93), 5 short-papers, and 4 posters focused on PIM-related topics. At CHI’2005 there were 9 full papers (out of 93), 5 short-papers or posters and 1 doctoral consortium presentation focused on PIM-related topics. (from PIM workshop 2005 Report)
Related Branches and Works A summary of works on PIM Works on PDS\PIS\PDSM Works on CHI (HCI) Other Works on PIM
(Most of them are simple) Other Works on PIM Application of PIM Now some Web-sit declare to provide PIM services, But Most of them only can finish some sample functions. There is still a long way to realize a real PIM system. Some Website related to PIM (Most of them are simple)
Related Branches Personal Web Data integration Personal Data Mining Other Works on PIM Related Branches Personal Web Data integration Personal Data Mining Cognitive psychology (认知心理学) Others (Mobile data management, etc)
Outline Background Knowledge About PIM Workshop Related Research Branches Position and Ideas Conclusion and Outlook
Position and Ideas About Data Space About PIM A few Research Topics About Research Method
About Data Space The Whole Digital Data World is take as a big One Data Space, and it extend speedy. Entity Data Space is a part of ODS and differs from each other on logic. Entities are interactive and inclusive sometime. PDS is a sub-DataSpace of ODS. There is uncertain relation based on Semantic. PDS是PSI的数据表示
A Common Data Space Model About Data Space A Common Data Space Model Every Entity Has a Dataspace PDS is one of them Important Characters of PDS: Open system Pay-as-you-go Evolution Net Data Model suits it more Data Block (logic and Phisical) Data element (logic page)
Position and Ideas About Data Space About PIM A few Topics on Our Research About Research Method
Position about PIM Application is the nature of PIM Which is a great motivation of research
A application model based PIM Position about PIM A application model based PIM
Position and Ideas Data Space and DSMS PIM Application A few Research Topics About Research Method
Topics and ideas Data Space中数据管理技术研究 -------存储、索引与查询优化 由于个人数据信息管理是对各数据源的管理,各种数据的存储位置不同,使用的数据库也不同,因此决定了对于各个数据源信息的访问效率也是不同的。首先是对PSI中数据存储策略的研究,面对如此巨大的个人数据信息,在一台机器上进行数据的存储是不可能的,同时从安全性、可访问性等方面进行考虑,也不是很好的。因此,对于Dataspace存储模型的研究成为一个基础性的问题。目的就是要做到,PI存放在如此众多的服务器上,但是当我需要某种数据的时候,我可以快速、准确的找到我所要的信息,因此会带来数据的索引问题。 因为Web数据变动的特点,有的数据会随时消失,这样就要求我们对数据的安全性策略进行研究,对数据安全性进行评估,以确定数据的处理策略。 Database中的数据索引和查询优化技术:由于随着PIM的使用,PSI中的数据量会越来越大,这样就会造成用户访问数据代价增高,因此,如何提高用户访问效率就会成为一个大问题,由于和传统数据库中的存储不同,因此影响访问效率的关键因素也不再是磁盘的I/O,那么这种情况下应改采用什么样的索引策略,Web DB索引的涵义是什么。
Topics and ideas PIM中主体行为特征模型的研究 PIM实际上由三部分组成:软件、PSI、主体(user)组成,主体的行为特征与PIM的运行效果有直接的关系,因此对主体行为特征进行研究也是PIM中的重要课题。主要内容为:行为特征的提取算法,行为特征的表示、行为特征模型等。主体信息一般包括以下几部分: 静态信息(Static info):主体的基本档案信息 动态行为信息(Operation log):主体行为记录 特性描述信息(User Profile):推理得出的主体特性描述 通过对个人操作日志的记录,分析个人行为特征,在客户进行查询搜索时,将个人行为特征作为查询结果返回的参数之一。依次更加准确的返回查询结果。 系统查询返回结果可以用一个四元式表示:r = f(q,t,uf,ud),其中 Q:用户提交的查询内容;t:同义词表(如IR==information retrieval) Uf:描述用户特征的UserProfile文件 Ud:User DataSpace, 用户数据空间中的信息内容 衡量r的指标有两个:一个是返回内容的丰富性,是否准确返回了尽可能多的信息内容,返回的时间效率如何
Demo:一个针对科研人员的PSI\ PIM Topics and ideas Demo:一个针对科研人员的PSI\ PIM 某工程师在进行相关PIM的研究,他发现PIM与IR、CHI等研究很有关系,于是准备找一些有关CHI最新研究成果的论文看一下,在Google中输入CHI,搜索到了CHI2006会议的内容,发现里边有一些Award best papers,但是只有相关论文的摘要。花两个小时找遍了有关会议的CHI2006 website的各个角落,但是始终没有找到相关的文章。 一个计算机专业的工程师想从网上查找一些IR方面的论文,他浏览了“软件学报”等相关网站,但是觉得文章还不够,于是他在Google中 输入:IR,这位工程师发现,查询结果中绝大部分文章与作者所要求的无关,只在第8项有“哈尔滨工业大学信息检索研究室”,与所要求内容有一定关系,但是,也并不一定有这方面的论文。如果输入“IR,论文”,查询结果都是一些物理、化学方面的文章。 这就提出了一个问题:一个专业技术人员如何从Web信息世界中快速、准确的获得自己所需要的信息资料;其次是当我们找到相关的内容后,如何将这些内容进行分类保存,并以user最易于接受的形式呈现出来。基于此,拟实现一个R-PIM系统,面向的User为科研领域的技术人员。实现一个PSI和PIM原型系统,作为我们研发的一个试验平台。系统功能包括:日常信息管理、Web信息集成、分类、存储、与查询等。 最高目标,如果我们正在做一个课题的研究,我们的系统能够将Web上有关这个课题的信息最高效、直观的展现出来,并基于关键词以树状或网状形式对这些论文及其参考文献进行归类分析,那么,科研人员就可以将宝贵的时间应用于更富有创新性的工作。
Position and Ideas Data Space and DSMS PIM Application A few Research Topics About Research Method
关注相关领域的研究,包括新的理论成果、技术方法、应用等。 About Research Method 关注相关领域的研究,包括新的理论成果、技术方法、应用等。 逐步确定一种应用环境(如科研人员、办公文员等),提出新的idea和topic,深入进行研究,并注意不断进行问题的抽象和概括,进行一些理论上的探索,特别是对Data space数据模型、查询优化等进行一些理论研究。 关注应用的开发,因为原始数据的收集、系统的实验、效果的评估都离不开软件系统的支持,此外开发Demo系统或原型系统也是科研成果的一部分。 注意课题的分解,分为不同的子课题,尽可能使各项工作既有一定的独立性,又是整个课题研究工作的有机部分。 希望能够有更多人参与这一非常富有生命力的课题的研究。
Outline Background Knowledge About PIM Workshop Related Research Branches Position and Some Ideas Conclusion and Outlook
Conclusion and Outlook PIM 和我们目前的研究工作有密切关系,PIM实际上就是我们目前所做的信息集成的下一步的工作,或者说是一个实际的应用领域。也可以说在某种程度上web信息集成是PIM中的一个环节。 PIM逐步成为研究热点,其成果具有很大的社会价值。 PIM的研究将会从理论和应用两方面展开,将会出现更多围绕具体的应用topic的论文。同时随着新成果的不断出现,一些理论问题也会逐步提出并解决。从PIM到GIM,由此使Web 信息资源得到真正高效的应用,数据库技术有一个新的挑战与机遇。 目前我们应当给于PIM、PSI、GIM足够的关注,并开始进行相关研究工作。如果我们目前开始这方面的研究,取得一些理论成果和应用成果是可能的。
参考文献 [1] Bade, Korinna & A. Nurnberger, Personalized Structuring of Retrieved Items.(IR),PIM 2006 Workshop [2] Deborah Barreau, Jane Greenberg, Abe Crystal, & Anuj Sharma, Personal Information Management in a Learning Context. ,PIM 2006 Workshop [3] Victoria Bellotti ,Jim Thornton Managing Activities with TV-ACTA: TaskVista and Activity-Centered Task Assistant, Personal Information Management - A SIGIR 2006 Workshop [4] Bradshaw, Shannon, Marc Light, & David Eichmann, (Bee)Dancing on the Boundary between PIM and GIM. , Personal Information Management - A SIGIR 2006 Workshop [5] Capra, Robert G. III, & Manuel Perez-Quinones, Factors and Evaluation of Refinding Behaviors. , Personal Information Management - A SIGIR 2006 Workshop [6] Catarci, Tiziana, Benjamin Habegger, & Antonella Poggi, Intelligent User Task Oriented Systems. , Personal Information Management - A SIGIR 2006 Workshop [7] Chaytor, Rhonda, Edward Brown, & Todd Wareham, Privacy Advisors for Personal Information Management. , Personal Information Management - A SIGIR 2006 Workshop [8] Chirita, Paul, Julien Gaugaz, Stefania Costache, & Wolfgang Nejdl, Context Detection on the Desktop Combining Multiple Evidences. , Personal Information Management - A SIGIR 2006 Workshop [9] Cutrell, Edward, Susan Dumais, & Raman Sarin, New directions in personal search UI. , Personal Information Management - A SIGIR 2006 Workshop [10] Czerwinski, Mary, User Interface Support for Today's Crazed Information Worker: From Scatterbrained to Focused. , Personal Information Management - A SIGIR 2006 Workshop [11] Dittrich, Jens, M. Salles, S. Karaksashian, et al., iMeMex: A Platform for Personal Dataspace Management. , Personal Information Management - A SIGIR 2006 Workshop, Personal Information Management - A SIGIR 2006 Workshop [12] Dontcheva, Mira, Steven Drucker, Geraldine Wade, David Salesin, & Michael Cohen, Collecting and Organizing Web Content., Personal Information Management - A SIGIR 2006 Workshop [13] Dumais, Susan,PSearch: An interface for combining personal and general results. [14] Elsweiler, David, Ian Ruthven, & Linxiao Ma, Role of Memory in PIM. [15] Fisher, Danyel, AJ Brush, & Marc Smith, Social Information Matters!. [16]Groth, Kristina & Kerstin Severinson Eklundh, Combining Personal and Organisational Information [17] Gwizdka, Jacek, Finding to Keep and Organize: Personal Information Collections as Context. [18] Hawkey, Kirstie & Kori M. Inkpen, Incidental Information Privacy and PIM. [19]Jones, William & Harry Bruce, Personal Information for a World as We Want It to Be. [20] Kirsh, David, Personal information objects & Burden of multiple personal spaces. [21] Lepouras, George, Alan Dix, & Akrivi Katifori, OntoPIM: From Personal Information Management to Task Information Management [22] Li, Yingang, Javed Mostafa, & Xiaofeng Wang, A Privacy Enhancing Infomediary for Retrieving Personalized Health Information from Web [23]Maier et al., Personal Information Enhancement (part of Sidewalk Project). [24]Murthy, Sudarshan, Uma Murthy, & Ed Fox, Using Superimposed and Context Information to Find and Re-find Sub-documents. [25] Murthy, Uma, Ingrid Burbey, Guyhyun Kwon, Nicholas Polys, Prince Vincent, & M. A. Perez-Quinones, Re-finding from a Human Information Processing Perspective: Designing a Personal Memex for Different Populations. [26] Shen, Xuehua, ChengXiang Zhai, & Bin Tan, Exploiting Personal Search History to Improve Search Accuracy [27] Singh, Gurminder, PIM for Mobility. [28]Spurgin, Kristina, A Sense-Making Approach to Personal Information anagement. [29] Stumpf, Simone, Margaret Burnett, Tom Diettrich, & Jon Herlocker, TaskTracer - Enhancing Personal Information Management Through Machine Learning. [30]Tungare, Manas, Pardha S. Pyla, Miten Sampat, & Manuel Perez-Quinones, Defragmenting Information using the Syncables Framework [31] Yu, Xiaoyan, Mohammad Alkandari, Pengbo Liu, & Manuel A. Perez-Quinones, Visualizing a Personal Social Network of Email Archives for Re-Finding. [32] Zhang, Yi, Bayesian Graphical Models for Adaptive Filtering(Position and Relevant Work for PIM) [33] Easy Edward Cutrell, Daniel C. Robbins, Susan T. Dumais, Raman Sarin . Fast, Flexible Filtering with Phlat — Personal Search and Organization Made(CHI2006) [34] Luis von Ahn, Shiry Ginosar, Mihir Kedia, Ruoran Liu and Manuel Blum. Improving Accessibility of the Web with a computer Game. (CHI2006 Phetch借助模拟程序,收集图片描述信息) [35] Luis von Ahn, Mihir Kedia and Manuel Blum.Verbosity: A Game for Collecting Common-Sense Facts一个收集常识性事实的策略(游戏) (CHI2006) [36] Mile Nakaoka, Taro Tezuka, and Katsumi Tanaka, Web Driving:An Image-Based Opportunistic Web Browser That Visualizes a Peripheral Information Space (WISE 2006)
参考文献 [37] Klaus-Dieter Schewe and Bernhard Thalheim. User Models: A Contribution to Pragmatics of Web Information System Design (WISE 2006) ------------------Wise2006-------------Web Doc Analysis------------ [38]Xiaoyuan Li, Lidan Shou,Gang Chen, Lujiang Ou . A Latent Image Semantic Indexing Scheme for Image Retrieval [39]Hybrid Method for Automated News Content Extraction . Yu Li,Xiaofeng Meng, Qing Li, Liping Wang [40]A Heuristic Approach for Topical Information Extraction from News Pages. Yan Liu,Qiang Wang, Qingxian Wang ------------------Wise2006-------------Security and Trust------------ [41Internet-Scale Data distribution:Some Research Problem(WISE2006 Keynote Paper 1) M.Tamer Qzsu [42]Building a Domain Independent Platform for Collecting Domain Specific Data from the WebProblem(WISE2006 Keynote Paper 3) Lizhu Zhou 43.User-Defined Aggregate Functions: Bridging Theory and Practice S. Cohen (Technion—Israel Institute of Technology) ACM SIGMOD 2006 研究用户自定义的聚集函数与系统接口的转换,提出了一个框架(PIM中也存在类似的桥联问题) 44. MauveDB: Supporting Model-based User Views in Database Systems 。Amol Deshpande Samuel Madden。ACM SIGMOD 2006 MauveDB::数据库系统中基于模型的用户视图 对于PIM中HCI设计有一定借鉴作用。 45.G. Bell. Keynote: MyLifeBits: a Memex-Inspired Personal Store; Another TP Database. In ACM SIGMOD, 2005. [46] J.-P. Dittrich, M. A. V. Salles, D. Kossmann, and L. Blunschi. iMeMex: Escapes from the Personal Information Jungle (Demo Paper). In VLDB, 2005. [47] X. Dong and A. Halevy. A Platform for Personal Information Management and Integration. In CIDR, 2005. [48] X. Dong, A. Halevy, J. Madhavan, and E. Nemes. Reference Reconciliation in Complex Information Spaces. In ACM SIGMOD, 2005. [49] M. Franklin, A. Halevy, and D. Maier. From Databases to Dataspaces: A New Abstraction for Information Management. SIGMOD Record, 34(4):27–33, 2005. [50] M. J. Franklin and S. B. Zdonik. "Data In Your Face": Push Technology in Perspective. In ACM SIGMOD, 1998. [51] E. Freeman and D. Gelernter. Lifestreams: A Storage Model for Personal Data. SIGMOD Record, 25(1):80–86, 1996.
两点说明: 由于时间比较紧,有些工作还在做,有的资料可能还没有看到,因此有的观点可能不全面,希望大家指出并讨论。 参考文献主要是数据库会议相关的论文和PIM workshop论文,中文参考文献前面列出内容比较详细,故没有在后边列出。
PIM in my opinion :Listening to Music in the Information Sea.
Sincerely expect more contribution and creative ideas! THANKS