Presentation is loading. Please wait.

Presentation is loading. Please wait.

Research Libraries: Digital Intermediaries & Digital Archives -- Stanford’s plan, practice, & application Michael A. Keller University Librarian Director.

Similar presentations


Presentation on theme: "Research Libraries: Digital Intermediaries & Digital Archives -- Stanford’s plan, practice, & application Michael A. Keller University Librarian Director."— Presentation transcript:

1 Research Libraries: Digital Intermediaries & Digital Archives -- Stanford’s plan, practice, & application Michael A. Keller University Librarian Director of Academic Information Resources Founder/Publisher of HighWire Press Publisher of Stanford University Press --- CALIS Conference, Chengdu, PRC 15 May 2007

2 研究型图书馆: 数字化媒介与数字化存档 -- 斯坦福大学的规划、实践与应用
研究型图书馆: 数字化媒介与数字化存档 -- 斯坦福大学的规划、实践与应用 斯坦福大学图书馆总馆长 学术信息资源总监 HighWire出版社创办人与社长 斯坦福大学出版社社长 迈克尔 · 凯勒 中华人民共和国四川省成都市 2007年5月15日

3 The Cycle Readers Author (Professors (Professors Teachers Teachers)
Students) Author (Professors Teachers) Publishers Libraries Hosts ISPs (HighWire Press) Reviewers (Professors) Distributors Booksellers

4 循环圈 读者 (教授 老师 作者 学生) (教授 老师) 出版者 图书馆 管理者 网络服务提供商 (HighWire 出版社) 审读者
(教授) 发行人 销售人

5 The Cycle – Intermediaries
Author (Professors Teachers) Readers (Professors Teachers Students) Publishers Libraries Hosts ISPs (HighWire Press) Reviewers (Professors) Distributors Booksellers

6 循环圈– 媒介 读者 (教授 作者 老师 (教授 学生) 老师) 出版者 图书馆 管理者 网络服务提供商 (HighWire 出版社)
审读者 (教授) 发行人 销售人

7 The Cycle -- Digitization
Author (Professors Teachers) Readers (Professors Teachers Students Publishers Libraries Hosts ISPs (HighWire Press) Reviewers (Professors) Digitization Distributors Booksellers

8 循环圈-数字化 读者 (教授 老师 学生) 作者 (教授 老师) 出版者 图书馆 管理者 网络服务提供商 (HighWire 出版社)
审读者 (教授) 数字化 发行人 销售人

9 Stanford Strategic Positions
HighWire Press, a unit of Stanford University Libraries – serves scholarly publishers Intersection of professors as authors, publishers, editors and reviewers with librarians as information managers with information technologists as service Merging Libraries with Academic Computing Undertaking digital archiving (LOCKSS/CLOCKSS, Stanford Digital Repository) Including Stanford University Press

10 斯坦福的战略规划 HighWire 出版社是斯坦福大学图书馆的一个分支机构, 专门为学术期刊提供服务 图书馆与学术计算机构相结合
教授兼为作者、出版者、编辑及审读者 图书馆员兼信息管理者 信息技术人员提供服务 图书馆与学术计算机构相结合 从事数字存档(LOCKSS/CLOCKSS, 斯坦福数字库) 包括斯坦福大学出版社

11

12 斯坦福大学图书馆网页

13 HighWire Press Receives digital “manuscripts” of articles including data supplements not found in print editions Processes, adding features Publishing before print edition Several image resolutions Hyperlinking citations to cited references Alerting services PDF & HTML versions Citation mapping Corresponding with authors World-wide instantaneous delivery enabling many researchers to read simultaneously; no waiting for the print edition Some publishers abandoning print; more to follow

14 HighWire 出版社 接收文章的数字“文本”, 包括未出现在印刷本中的资料附录 加工处理, 增加功能
发表先于印刷版本的电子版 多种分辨度图像 链接引文与被引用的参考资料 多种预告服务 PDF 及 HTML 文本 引文示意图表 与作者联络 通过全球即时发行使众多研究者能够同时读到电子版文章,无需等待印刷版的发表 一些出版商已停止发行印刷版期刊, 更多的出版商也将这样做 网页:

15

16 《细胞生物杂志》文章样本

17

18 《细胞生物杂志》文章样本

19

20 《细胞生物杂志》文章样本

21

22 《细胞生物杂志》文章样本

23

24 《细胞生物杂志》文章样本

25

26 《细胞生物杂志》文章样本

27

28 《细胞生物杂志》文章样本

29 “Thumbnail image”

30 “微型图形”

31 Medium size image

32 中等尺寸图形

33 Large size image

34 大幅尺寸图形

35 Toll Free Linking

36 免费链接

37 Citation linking: destination

38 引述链接: 终端

39 “Prospective citing”

40 “预期引述”

41 Citation Map

42 引述示意图

43

44 Highwire Press 出版社网站(1)

45

46 Highwire Press 出版社网站(2)

47

48 Highwire Press 出版社网站(3)

49 Stanford’s HighWire Press summary of strategic importance
Enables web versions of many high impact, highly cited scholarly journals published Network distribution makes instant distribution possible, making all readers equal Provides numerous services making research faster, better, more penetrating Links readers and authors Embodies collaboration among publishers, librarians, information technologists Not for profit, a Stanford enterprise

50 斯坦福大学 HighWire Press 重要战略之总结
以电子版形式出版影响大及引用率高的学术期刊 通过网络实现即时发行,使所有读者能同时读到新发表的文章 提供多种服务,以加快检索速度,提高检索质量及精确度 加强读者与作者的沟通 实现出版者、图书馆员及信息技术人员之间的合作 非盈利性质的斯坦福大学附属机构

51 "Science, Scholarship, and Internet Publishing: The HighWire Story" Syllabus Magazine, October 1998
EXCERPT: "Scientists, scientific editors and publishers, scholarly society officers, and an enterprise unit of the Stanford University Libraries named HighWire Press have worked together over the past three and a half years to publish Internet editions of 70 influential scientific journals. Three significant accomplishments have resulted. First, there has evolved a mode of scholarly communication which serves readers, and facilitates research as much as it supports the clarity and validity of scientific discourse; this model has become a standard in Internet scholarly publishing. Second, an active community of scholarly editors and publishers has intensified the benefits of online scholarly publishing to the scientific, medical and technical communities at large. Third, the products of life sciences research in the advanced economies of Europe and North America are now more widely available than ever before, stimulating scientific and other cultural developments in other parts of the world."

52 “科学、学术及网络发行:HighWire 历史” 《摘要杂志》1998年10月号

53

54 斯坦福大学网站 各机构网页

55 The Challenge of Digital Preservation
Bit rot Obsolescence Format Technology Distribution and dissipation Migrations and transitions People (2 – 20 years) Software (5 – 10 years) Hardware (3 – 5 years) Benign neglect doesn’t work for digital objects. Preservation requires active, managed care.

56 数字资源保存的挑战 字节损蚀 过时 发行与分散 迁移与过渡 无为而治的做法不适于数字资源的保存,它需要积极的态度和妥善的管理 格式 技术
人员 (2 – 20 年) 软件 (5 – 10 年) 硬件 (3 – 5 年) 无为而治的做法不适于数字资源的保存,它需要积极的态度和妥善的管理

57 Three Major Areas of Preservation Needs
Google Books (’000s of TB) Parker Manuscripts (75 TB) MJF Media (50 TB) NGDA (10 TB) ~30 other digi projects (15 TB) Purchased collections (25 TB) Digital Library SULAIR collections & resources Digitization artifacts Institutional Repository Research data, Publications, dissertations, Learning objects, university assets “External” Depositors Online preservation and access Dark archive Research data: supplemental data for published works to meet granting agency requirements archiving shelved research projects web citations HighWirePress (32 TB ) Stanford Univ Press (10 TB) Other Academic Publishers

58 数字资源保存的三个主要领域 Google Books (’000s of TB) Parker Manuscripts (75 TB)
数字化图书馆 斯坦福图书馆藏书与资源 数字化文物 公共机构库存 研究资料, 出版物,论文, 学习目标,大学资产 “外部“ 库存 网上保存 密存档案 Google Books (’000s of TB) Parker Manuscripts (75 TB) MJF Media (50 TB) NGDA (10 TB) ~30 other digi projects (15TB) Purchased collections (25 TB) HighWirePress (32 TB ) Stanford Univ Press (10 TB) Other Academic Publishers

59 Design Objectives & Assumptions
Preservation-focused archive Replicated content multiple copies, geographically distributed Secure Auditable Modular Tiered storage environment online, nearline, offline Version rather than delete Content-agnostic content audits component audits security audit process & procedural (aka PWC-style) audits

60 设计目标 及 设想 以保存为主的档案 经复制的内容 多份拷贝,不同地点储存 安全 便于检查 模式化 分层存储环境 在线,近线,无线
制成不同版本而非删除 内容的不可知

61 Core Repository Functionality
Preserving access to digital information over time …through generations of technology obsolescence and change. Maintaining integrity of that information over time …through generations of migration and reformatting. Repository Services Functionality All (or almost all) user-facing services Enhanced access & delivery through applications Data mining, dry research, new indexing, e-science, etc. Federation

62 核心仓储的功能 仓储服务功能 保持不同时期数字化资料的获取 …不因技术的更新换代而受影响 保持不同时期的资料的完整性
…不因时代的迁移与过渡而受影响 仓储服务功能 全部(或基本全部)直接为用户服务 通过运作增进信息的获取及传递 进行资料挖掘,无预期结果的研究,索引更新,网络科学研究,等等 结成联盟

63 SDR: Core Repository vs. Repository Services

64 SDR: 核心存储之与存储服务

65 SDR Serves As Common Preservation Infrastructure
while specialty archives and applications provide focused digital content collection, access and value-added services National Geospatial Digital Archive (NGDA) Geospatial data SUL Digital Bookshelves (Google Books, internally digitized, vendors' e-books) Digital Library Applications (images, mss, media, Special Collections showcases) Institutional Repository (faculty- and student submitted papers, data, websites, etc.) Stanford Digital Repository (SDR): content agnostic, preservation repository

66 SDR作用于公共保存设施 同时其专业档案馆及专业技术的应用还提供具有针对性数字化内容的收集、获取和增值服务
National Geospatial Digital Archive (NGDA) Geospatial data SUL Digital Bookshelves (Google Books, internally digitized, vendors' e-books) Digital Library Applications (images, mss, media, Special Collections showcases) Institutional Repository (faculty- and student submitted papers, data, websites, etc.) Stanford Digital Repository (SDR): content agnostic, preservation repository

67 SDR Workflow SDR Book Reader Geospatial Data Luna DEWI (?) Digital
Collections External Collections SDR Conversion Access Layer Storage Layer Ingest Virus Check Ingest

68 SDR 流程图 SDR Book Reader Geospatial Data Luna DEWI (?) Digital
Collections External Collections SDR Conversion Access Layer Storage Layer Ingest Virus Check Ingest

69 SDR High-Level Architecture

70 SDR 高层结构图

71 SDR Component Diagram

72 SDR结构图示

73 SDR Physical Topology Module(s) Hardware Conversion, Gatekeeper
March 2006 Module(s) Hardware Conversion, Gatekeeper Sun Fire X4100 Server 4 TB Nexsan SATA Disk Ingest, Storage code, Storage Request Processor Sun Fire X4100 Server 4 TB Nexsan SATA Disk Online storage 32 TB Sun Honeycomb Storage System Tape Copies Sun StorEdge L700 Tape Library, with LTO2 drives IBM Tivoli Storage Manager Iron Mountain data protection plan Access Service, Access Cache 8 TB of Nexsan SATA Disk

74 SDR实体结构 组件 硬件 Conversion, Gatekeeper Sun Fire X4100 Server
4 TB Nexsan SATA Disk Ingest, Storage code, Storage Request Processor Sun Fire X4100 Server 4 TB Nexsan SATA Disk Online storage 32 TB Sun Honeycomb Storage System Tape Copies Sun StorEdge L700 Tape Library, with LTO2 drives IBM Tivoli Storage Manager Iron Mountain data protection plan Access Service, Access Cache 8 TB of Nexsan SATA Disk

75 Stanford Digital Repository
Managed care for digital objects of all genres & formats Serves several strategic needs Digital Library Institutional Repository Enterprise Repository A strategic development for research, teaching & learning Will provide a distinctive, competitive edge

76 斯坦福数字化存储 妥善管理所有类型和版式的数字组件 为多种重要需求服务 对研究、教学和学习具有重大意义 将是一种独特和具有竞争力的优势
数字化图书馆 公共事业机构存储 企业存储 对研究、教学和学习具有重大意义 将是一种独特和具有竞争力的优势

77

78 SDR对机构存储的获取方针 ——————————————————————————————————————

79 What is LOCKSS? 163 LOCKSS Libraries in 18 countries
Lots Of Copies Keep Stuff Safe Digital Preservation Infrastructure Decentralized, Peer to Peer, Continuous Audit & Repair Internet computers chattering away among themselves Open Source 163 LOCKSS Libraries in 18 countries

80 什么是 LOCKSS? “Lots Of Copies Keep Stuff Safe” 数字化储存设施
分散而非集中,同行间交流,持续检查与修复 网络电脑之间相互对话 开放源码 163个LOCKSS 图书馆分布于18个国家 网址:

81 Collection Title 1 Title 2 Patron LOCKSS box LOCKSS box LOCKSS Boxes

82 藏书 收藏 刊名 1 刊名2 用户 LOCKSS 存储盒 2 LOCKSS 存储盒 1 LOCKSS 存储盒 1,2

83 Preservation Title 1 Title 2 Patron LOCKSS box LOCKSS box LOCKSS Boxes

84 保存 刊名 1 刊名 2 用户 LOCKSS 存储盒 2 LOCKSS 存储盒 1 LOCKSS 存储盒 1,2

85 Prevents the publisher from revoking access rights to back content
Title 1 Title 2 Patron LOCKSS box LOCKSS box Prevents the publisher from revoking access rights to back content

86 获取 刊名 1 刊名 2 用户 LOCKSS 存储盒 2 LOCKSS 存储盒 1 防止出版商撤销读者获取回溯内容的权利

87 CLOCKSS Controlled LOCKSS Limited network of library caches
LOCKSS technology underlies CLOCKSS Shared governance model

88 CLOCKSS 受控管的LOCKSS 图书馆缓存的有限网络 LOCKSS技术加强了CLOCKSS功能 共享的控管模式

89 The CLOCKSS Prototype Two year demonstrator, ending in 2007
Public reports of progress & outcome Demonstration that this solution is credible for long term Proof of scalability for publisher content & library deployment Funded first by participants with recent grant support from NDIIPP (Library of Congress)

90 CLOCKSS 模式计划 两年的示范期于2007年结束 计划参与者以国会支持款项作为首批投资
公众对计划和结果的报告 示范长期实行该计划的可信性 证明出版内容和图书馆规模的可提升性 计划参与者以国会支持款项作为首批投资 网址:

91 CLOCKSS Participants CLOCKSS acting on behalf of wider community of libraries & publishers 7 Libraries distributed across tectonic plates 12 publishers, commercial & scholarly societies Numbers & types sufficient to cover the bases Commitment based on stewardship of libraries & responsibility of publishers

92 CLOCKSS 参与者 CLOCKSS代表众多图书馆和出版社 7个图书馆 12个出版社、企业及学术团体
参与者的数量和不同的形式足以涵盖全部的需要 图书馆管理和出版社的责任是实现承诺的基础

93 Libraries University of Edinburgh New York Public Library
Indiana University Rice University University of Virginia OCLC Stanford University NB: more to be added on more tectonic plates Libraries

94 爱丁堡大学 纽约公共图书馆 印第安那大学 莱斯大学 维吉尼亚大学 OCLC 斯坦福大学 注意: 将有更多的图书馆加入 参与的 图书馆

95 Publishers Blackwell Publishing Elsevier Nature Publishing Group
Oxford University Press SAGE Publications Springer Taylor and Francis John Wiley & Sons American Chemical Association American Medical Association American Physiological Society Institute of Physics NB: aim to add all the rest Publishers

96 参与的出版商 Blackwell Publishing Elsevier Nature Publishing Group
Oxford University Press SAGE Publications Springer Taylor and Francis John Wiley & Sons American Chemical Association American Medical Association American Physiological Society Institute of Physics NB: aim to add all the rest 参与的出版商

97 Equal Partners Librarians, with Publishers agreeing, retain stewardship role as society’s memory institutions Publishers have decided to trust & engage Libraries, committing to prospect of preservation for continuing access Both are exploring social & technical model in a 2 year test, working to build a full scale production system Costs are equally shared, with add’l funding from NDIIPP for audit & reporting

98 平等合作者 图书馆员承担社会记忆机构管理员的职责,出版商也同意此说法 出版商信任图书馆并与之合作,共同为不断的存取而致力于数字储存的工作
在两年的试行中,双方都在探索社会与技术模式,以建立全面的生产系统 费用是双方分担,另外还有NDIIPP提供的审计和报告资金

99 CLOCKSS Mission “CLOCKSS is a not-for-profit community partnership between publishers and libraries that is developing a distributed, validated, comprehensive archive that preserves and ensures continuing access to electronic scholarly content.”

100 CLOCKSS 的使命 “CLOCKSS是出版商和图书馆的非盈利合作伙伴,致力于建立可分发、正确及全面的数字化档案,以保证用户不断获取电子版学术期刊内容。”

101 CLOCKSS Governance Jointly governed by founding library & publisher partners Each partner represents an organization, but collectively sectors are represented University libraries & Public libraries Commercial publishers & scholarly societies No single point of failure or institutional interest will hinder long term governance Consensus driven, united for support of scholarly communication over the long term CLOCKSS seen as complimentary to national arrangements for legal deposit

102 CLOCKSS 管理 由提供资金的图书馆与出版商共同管理 每一个合作伙伴都代表各自的机构,但是又以行业合作为代表: 大学图书馆与公共图书馆
商业出版商与学术团体 长期管理计划不会因偶尔失误和某机构的利益而受到阻碍 以共识为动力,为支持长期的学术交流而联合 CLOCKSS被视为全国性部署下的合法储存之互补

103 LOCKSS/CLOCKSS Distributed preservation function
Caches authorized e-content for local caching Empowers libraries Inexpensive, easily implemented Flexible, open source application Expanding community of users Expanding community of uses

104 LOCKSS/CLOCKSS 被分布的资源保存之功能 为本机缓存获取授权的电子内容 获得授权的图书馆 廉价,易于实施 使用灵活,开放源码应用
用户不断增长 使用不断增加

105

106 介绍数字资源存储提倡者的国会图书馆网页

107 Other SULAIR Strategic Programs
Google Book Search & other digitization Development of “Bookless” Libraries CourseWork Sakai (Open Source) Course Management System Media Preservation Expanding the East Asia Library Expanding the Middle Eastern Collection

108 斯坦福大学图书馆其他重要计划 谷歌图书检索和其他数字化计划 建立“没有图书”的图书馆 课程安排 传媒资源的保存 东亚图书馆的扩展
Sakai (开放资源)课程管理系统 传媒资源的保存 东亚图书馆的扩展 中东收藏部的扩展

109 Download this presentation at
Thank you Download this presentation at

110 谢谢! 我的电邮信箱: Michael.Keller@Stanford.edu 本次讲演的内容可从以下网站下载


Download ppt "Research Libraries: Digital Intermediaries & Digital Archives -- Stanford’s plan, practice, & application Michael A. Keller University Librarian Director."

Similar presentations


Ads by Google