A Data Mining Algorithm for Generalized Web Prefetching

Slides:

Advertisements

Similar presentations

第10讲中共领导的民主革命与国共关系中国共产党领导的民主革命斗争，就是中共领导的新民主主义革命的历程。1921年到1949年，中国共产党领导全国人民，把马克思主义普遍真理同中国革命的具体实践及国情相结合，制定民主革命纲领，建立革命统一战线，走农村包围城市的道路。经过工农武装割据、抗日战争和人民解放战争，推翻了帝国主义、封建主义和官僚资本主义的反动统治，取得了新民主主义革命的伟大胜利。复习时注意中共在各个时期重大会议及国共关系的复习。

Advertisements

温故而知新：我国的国家性质是什么？人民民主专政的国体国家的一切权利属于人民决定我国政府是人民的政府.

2011年10月31日是一个令人警醒的日子,世界在10月31日迎来第70亿人口。当日凌晨,成为象征性的全球第70亿名成员之一的婴儿在菲律宾降生。？

初级会计实务第八章产品成本核算主讲人：杨菠.

第一课生活在人民当家作主的国家人民民主专政：本质是人民当家作主.

中考阅读复习备考交流西安铁一中分校向连吾.

自考英语二.

人工智能 Artificial Intelligence 第十一章

我国的宗教政策第七课第三框.

中央广播电视大学开放教育成本会计（补修）期末复习

第二单元动物生命活动的调节和免疫高等动物的内分泌系统与体液调节.

人教版义务教育课程标准实验教科书小学数学四年级上册第七单元《数学广角》合理安排时间 248.

-Artificial Neural Network- Hopfield Neural Network(HNN) 朝陽科技大學資訊管理系李麗華教授.

中考语文积累永宁县教研室步正军 2015．9.

小学数学知识讲座应用题.

Chapter 8 Liner Regression and Correlation 第八章直线回归和相关

倒装句之其他句式.

复习：诚实内涵诚实二个表现诚实意义 1、对自己要诚实2、对他人诚恳实在.

Leftmost Longest Regular Expression Matching in Reconfigurable Logic

Semantic-Synaptic Web Mining: A Novel Model for Improving the Web Mining 報告者：陳宜樺報告日期：2015/9/25.

IEEE TRANSACTIONS ON MAGNETICS, VOL. 49, NO. 3, MARCH 2013

THE PRINCIPLE OF ACCOUNTING

Some Effective Techniques for Naive Bayes Text Classification

Rate and Distortion Optimization for Reversible Data Hiding Using Multiple Histogram Shifting Source: IEEE Transactions On Cybernetics, Vol. 47, No. 2,February.

指導教授：許子衡教授報告學生：翁偉傑 Qiangyuan Yu , Geert Heijenk

Population proportion and sample proportion

浙江大学本科生《数据挖掘导论》课件第7课数据挖掘的高级主题徐从富，副教授浙江大学人工智能研究所.

第 22 课孙中山的民主追求 1 ．近代变法救国主张的失败教训： “师夷之长技以制夷”“中体西用”、兴办洋务、变法维新等的失败，使孙中山

計算方法設計與分析 Design and Analysis of Algorithms 唐傳義

Source: IEEE Access, vol. 5, pp , October 2017

Special Topics in Social Media Services 社會媒體服務專題

第8章關聯分析王海.

5.3 USE OF PREVIOUS RESEARCH

Data Mining 資料探勘 Introduction to Data Mining Min-Yuh Day 戴敏育

The Concept of Fuzzy Theory

Integrated decision support systems: A data warehousing perspective

ZEEV ZEITIN Delft University of Technology, Netherlands

基于类关联规则的分类 Classification Based on Class-Association Rules

研究經驗與趨勢分享黃悅民 Department of Engineering Science,

学术答辩课程题目姓名 | 班级 | 学号 | 专业 |.

A high payload data hiding scheme based on modified AMBTC technique

研究技巧與論文撰寫方法中央大學資管系陳彥良.

常見的巨量資料分析與應用楊立偉教授台大工管系暨商研所 2018.

Maintaining Frequent Itemsets over High-Speed Data Streams

Controllable and Trustworthy Blockchain-based Cloud Data Management

Learn Question Focus and Dependency Relations from Web Search Results for Question Classification 各位老師大家好,這是我今天要報告的論文題目,…… 那在題目上的括號是因為,前陣子我們有投airs的paper,那有reviewer對model的名稱產生意見.

Source: Journal of Network and Computer Applications, Vol. 125, No

主講人：陳鴻文副教授銘傳大學資訊傳播工程系所日期：3/13/2010

A Data Mining Algorithm for Generalized Web Prefetching

DeepPath 周天烁

題目：衛星遙測於水質監測之應用講者：中華大學土木工程學系陳莉教授時間：民國101年12月26日遙測緣起與發展

Distance Vector vs Link State

An Efficient MSB Prediction-based Method for High-capacity Reversible Data Hiding in Encrypted Images 基于有效MSB预测的加密图像大容量可逆数据隐藏方法。本文目的：做到既有较高的藏量（1bpp),

BiCuts: A fast packet classification algorithm using bit-level cutting

Efficient Query Relaxation for Complex Relationship Search on Graph Data 李舒馨

(二)盲信号分离.

钱炘祺一种面向实体浏览中属性融合的人机交互的设计与实现 Designing Human-Computer Interaction of Property Consolidation for Entity Browsing 钱炘祺

Speaker : YI-CHENG HUNG

Distance Vector vs Link State Routing Protocols

何正斌博士國立屏東科技大學工業管理研究所教授

Fast Image Dehazing Algorithm using Morphological Reconstruction

MGT 213 System Management Server的昨天，今天和明天

畢氏定理(百牛大祭)的故事張美玲製作資料來源：探索數學的故事（凡異出版社）.

簡單迴歸分析與相關分析莊文忠副教授世新大學行政管理學系計量分析一(莊文忠副教授) 2019/8/3.

WiFi is a powerful sensing medium

Gaussian Process Ruohua Shi Meeting

分類樹(Classification Tree)探討Baseball Data

102年人事預算編列說明邁向頂尖大學辦公室製作.

Hybrid fractal zerotree wavelet image coding

Presentation transcript:

A Data Mining Algorithm for Generalized Web Prefetching author: Alexandros Nanopoulos、 Dimitrios Katsaros、 Yannis Manolopoulos source: IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, VOL. 15, NO. 5 year: OCTOBER. 2003 presented by C. W. Hsu

Outline Introduction BACKGROUND GENERALIZED ALGORITHM Experiment Result Conclusion

Introduction The Web has become the primary means for information dissemination. It is being used for commercial, entertainment, or educational purposes, and, thus, its popularity resulted in heavy traffic in the Internet

Web Data Traffic

The cache of web document

Cache problems

Transparent Informed Prefetching

Proposed architecture of a prediction-enabled Web server

Motivation Predictive Web prefetching Server - Cache Client - Cache 一般伺服器會將文件快取到主記憶中，但是目前也有很多人將快取資料直接傳送到客戶端。

Method Transparent Informed Prefetching DG：Dependency Graph PPM：Prediction by Partial Match WM：Web log Mining WMo：Web log Mining - Ordering

Dependency Graph First Markov Model ABCACBD and CCABCBCA

Prediction by Partial Match All-m-Order Markov model(Second Markov Model) ABCACBD and CCABCBCA

Pruning Criteria support-pruning confidence-pruning error-pruning

WMo The use of Web log mining methods for the discovery of association rules among user accesses. Association rule discovery algorithms represent candidates as sets of documents, which do not consider this ordering.

WMo C1 = {A, B} T = {B,C,A,D} S1 = {A}, S2={B} WMo={A,B}、{B、A} S1 = {A,B,C} S2 = {A,B,D} WMo = {A,B,C,D}、{A,B,D,C} 當C1 = {A,B}時，T={B,C,A,D}將無法有效的找到

Candidate generation procedure

PERFORMANCE RESULTS Usefulness Accuracy Network traffic

The Parameters for the Generator Mean value of the noise = 噪音的平均值 Variance of the noise = 噪音的變化 corProb = 最後一個節點時的移動到另一個節點的可能性 Noise = 分成兩種一種是開始移動的節點與隨機插入一個節點至交易中。

Experimental Evaluation Mean value of the noise = 噪音的平均值 Variance of the noise = 噪音的變化

Experimental Evaluation Mean value of the noise = 噪音的平均值 Variance of the noise = 噪音的變化

Experimental Evaluation Mean value of the noise = 噪音的平均值 Variance of the noise = 噪音的變化

Cache hits as a function of the cache size Mean value of the noise = 噪音的平均值 Variance of the noise = 噪音的變化

PERFORMANCE RESULTS http://ita.ee.lbl.gov/html/traces.html T = Traffic A = Accuracy U = Usefulness W = 5 => 統計的交易最高長度(代表一次鎖定的長度)

PERFORMANCE RESULTS WMo/wp = 沒有使用任何Pruning的技術

Discussion WMo prefetches more documents correctly than PPM and DG. Algorithm WM presents the worst performance because it does not consider

Discussion It is respect to the two factor: Order -> DG Noise -> PPM 實驗上指出PPM容易被Noise影響，反之DG和WM不容易造成影響。 PPM會因為Noise增加而影響正確性的部份。 DG容易因為order的因素而造成影嚮，但是PPM和WMo則不會。

END