VII. Data Compression (A)

Slides:

Advertisements

Similar presentations

对本书、视频等任何 MATLAB 问题，作者做到有问必答！你买的不仅仅是书，更是一种 “ 有问必答 ” 的服务！

Advertisements

663 Chapter 14 Integral Transform Method Integral transform 可以表示成如下的積分式的 transform  kernel Laplace transform is one of the integral transform 本章討論的 integral.

1 第一章：绪论什么是信源编码？为什么要信源编码 / 数据压缩？为什么可以信源编码 / 数据压缩？怎样进行信源编码？

Final Review Chapter 1 Discrete-time signal and system 1. 模拟信号数字化过程的原理框图使用 ADC 变换器对连续信号进行采样的过程使用 ADC 变换器对连续信号进行采样的过程 x(t) Analog.

第三章图像处理技术第三章多媒体图像处理技术.

北京大学数字视频编解码技术国家工程实验室 AVS标准工作组，AVS产业技术创新战略联盟

第二章多媒体数据压缩编码技术.

6.1 概述 6.2 信源编码与压缩技术 6.3 信道编码与调制技术

第十章　图像的频域变换.

Outline Image Compression Image Understanding

多媒体通信技术主讲教师：黄玉兰　　　　　　　　　　　　　　　　学时：16.

第八章多媒体技术基础.

陆哲明博士、教授哈尔滨工业大学自动化测试与控制研究所哈尔滨工业大学信息对抗技术研究所

XI. Hilbert Huang Transform (HHT)

Blind dual watermarking for color images’ authentication and copyright protection Source : IEEE Transactions on Circuits and Systems for Video Technology.

3-3 Modeling with Systems of DEs

Time Frequency Analysis and Wavelet Transforms Oral Presentation

Applications of Digital Signal Processing

Rate and Distortion Optimization for Reversible Data Hiding Using Multiple Histogram Shifting Source: IEEE Transactions On Cybernetics, Vol. 47, No. 2,February.

XV. Applications of Wavelet Transforms

V. Homomorphic Signal Processing

XVI. Applications of Wavelet Transforms

JPEG图像编码标准 §3.4 内容提要本节主要介绍JPEG图像压缩编码算法(DCT变换算法)、图像数据文件格式 (JFIF，JPEG File Interchange Format)。最后，对JPEG 2000进行一个简单的介绍。 JPEG.

Differential Equations (DE)

第十一章影像與視訊壓縮.

第九章影像壓縮.

視訊串流\Streaming Video Part-2-3 Compression Digital image/video

數位典藏之數位影像處理技術探討雲端上的寶藏~ 國立新港藝術高中蘇淵源.

電腦數位音樂介紹 11組電機三陳俊傑吳岳庭.

Mpeg Family 簡介第六組 B 呂孟庭 B 廖彥鈞.

視訊串流\Streaming Video Part-1 Multimedia on Computer Digital

淺談視訊壓縮技術陳宏昇楊凱超.

影像篡改之偵測與定位（運用密碼與編碼技術）

II. Short-time Fourier Transform

數位影像壓縮技術簡介第四組陳孝賢.

聲轉電信號.

Randomized Algorithms

VI. Brief Introduction for Acoustics

第十章轉換編碼視轉換為座標軸之旋轉視轉換為基底函數之分解影像轉換轉換編碼之方法 JPEG DCT 演算法 JPEG DCT 之結果

Source: IEEE Transactions on Image Processing, Vol. 25, pp ,

一般論文的格式註：這裡指的是一般 journal papers 和 conference papers 的格式。

第8章 DCT与JPEG编码 JPEG（Joint Photographic Experts Group联合图象专家组）是（ITU的前身）国际电话与电报咨询委员会CCITT与ISO于1986年联合成立的一个小组，负责制定静态图像的编码标准 1992年9月JPEG推出了ISO/IEC 10918标准(CCITT.

信息隐藏主讲教师：余艳玮 /2/5 数字媒体包括了图像、文字以及音频、视频等各种形式，以及传播形式和传播内容中采用数字化，即信息的采集、存取、加工和分发的数字化过程。数字媒体已经成为继语言、文字和电子技术之后的最新的信息载体。

A high payload data hiding scheme based on modified AMBTC technique

Advanced Digital Signal Processing 高等數位訊號處理

信息隐藏技术与应用第八章数字水印的评价理论和测试基准

第三章付里叶分析离散付氏级数的数学解释(The Mathematical Explanation of DFS)

CH6 Pairs Selection in Equity Markets

VIDEO COMPRESSION & MPEG

數位浮水印技術及其應用.

XIV. Orthogonal Transform and Multiplexing

图像DCT变换《信息隐藏实验教程》教学幻灯片五.

图像压缩标准JPEG.

数字水印技术算法研究曹锋付晨陈阳 cs.nju.

An Efficient MSB Prediction-based Method for High-capacity Reversible Data Hiding in Encrypted Images 基于有效MSB预测的加密图像大容量可逆数据隐藏方法。本文目的：做到既有较高的藏量（1bpp),

信号与图像处理基础 Image Compression 中国科技大学自动化系曹洋.

96學年度第二學期電機系教學助理課後輔導進度表（三）(查堂重點)

(二)盲信号分离.

第3章数字编码 3.1 信源编码 3.2 信道容量 3.3 差错控制编码 3.4 几种差错控制编码简介 3.5 数字压缩编码

More About Auto-encoder

醫工所碩士二年級 R 葉昱甫電子所碩士一年級 R 謝博鈞電信所碩士一年級 R 王欣平

Reversible Data Hiding in Color Image with Grayscale Invariance

II. Short-time Fourier Transform

語音特徵擷取之資料相關線性特徵轉換研究生：張志豪多酌墨在數學式的物理意義及精神。老師、各位口試委員、各位同學大家好。

第一章 JPEG介紹.

Gaussian Process Ruohua Shi Meeting

Hybrid fractal zerotree wavelet image coding

Presentation transcript:

VII. Data Compression (A) 壓縮的通則：利用資料的一致性資料越一致的資料，越能夠進行壓縮 [References] I. Bocharova, Compression for Multimedia, Cambridge, UK, Cambridge University Press, 2010. 酒井善則，吉田俊之原著，原島博監修，白執善編譯，“影像壓縮術” ，全華印行, 2004. 戴顯權，“資料壓縮 Data Compression,” 旗標出版社, 2007. D. Salomon, Introduction to Data Compression, Springer, 3rd ed., New York , 2004.

 7-A 壓縮的哲學： 244 (1) 利用資料的一致性，規則性，與可預測性 (exploit redundancies and predictability, find the compact or sparse representation) (2) 通常而言，若可以用比較精簡的自然語言來描述一個東西，那麼也就越能夠對這個東西作壓縮 Q: 最古老的壓縮技術是什麼？ (3) 資料越一致，代表統計特性越集中包括 Fourier transform domain, histogram, eigenvalue ……….. 等方面的集中度

Compression technique 245 Data type Compression technique Compression rate Audio Image Video

246 思考：如何對以下的資料作壓縮 Article: Song: Voice: Cartoon: Compression: Original signal Compact representation + residual information

 7-B Compression for Images  影像的「一致性」： Space domain: 每一點的值，會和相鄰的點的值非常接近 F[m, n+1]  F[m, n], F[m+1, n]  F[m, n] Frequency domain: 大多集中在低頻的地方。

Lena Image 在 space domain 上的一致性 248 Lena Image 在 space domain 上的一致性 (horizontal difference) (vertical difference)

249 Histogram: 一個 vector 或一個 matrix 當中，有多少點會等於某一個值例如：x[n] = [1 2 3 4 4 5 5 3 5 5 4] 則 x[n] 的 histogram 為 h[1] = 1, h[2] = 1, h[3] = 2, h[4] = 3, h[5] = 4

250 Lena Image 頻譜 (frequency domain) 的一致性 L[m, n] |fft2(L[m, n])| (用亮度來代表 amplitude) p q

251 影像的「頻率」：frequency in the space domain 從 m = 0 至 m = M-1 之間有 p 個週期 p = 5 larger p : more variation in the space domain

Process of JPEG Image Compression 252  7.C JPEG Standard Process of JPEG Image Compression Image 88 DCT AC係數 Zigzag Scan Huffman Coding JPEG file 4:2:2 or 4:2:0 量子化 8 × 8 (切成blocks) DC係數差分編碼 Huffman Coding 量子化表檔頭主要用到四個技術：(1) 4:2:2 or 4:2:0 (和 space domain 的一致性相關) (2) 8  8 DCT (和 frequency domain 的一致性相關) (3) 差分編碼 (和 space domain 的一致性相關) (4) Huffman coding (和 lossless 編碼技術相關)

253 JPEG：影像編碼的國際標準全名： Joint Photographic Experts Group JPEG 官方網站： http://www.jpeg.org/ 參考論文：G. K. Wallace, “The JPEG still picture compression standard,” IEEE Transactions on Consumer Electronics, vol. 38, issue 1, pp. 18-34, 1992. JPEG 的 FAQ 網站： http://www.faqs.org/faqs/jpeg-faq/ JPEG 的免費 C 語言程式碼： http://opensource.apple.com/source/WebCore/WebCore-1C25/platform/image-decoders/jpeg/ 一般的彩色影像，可以壓縮 12~20 偣。簡單的影像甚至可以壓縮超過 20 倍。

254  壓縮的技術分成兩種 lossy compression techniques 無法完全重建原來的資料 Examples: DFT, DCT, KLT (with quantization and truncation), 4:2:2 or 4:2:0, polynomial approximation 壓縮率較高 lossless compression techniques 可以完全重建原來的資料 Examples: binary coding, Huffman coding, arithmetic coding, Golomb coding 壓縮率較低

 7-D 4:2:2 and 4:2:0 255 R: red, G: green, B: blue Y: 亮度, Cb: 0.565(BY), Cr: 0.713(RY), 4 : 4 : 4 4 : 2 : 2 4 : 2 : 0 N N Y Y M M Y N/2 N N M/2 Cb Cb M/2 Cb M N/2 N N M/2 Cr M/2 Cr M Cr

256 24 bits/pixel  16 bits/pixel  12 bits/pixel 同樣使資料量省一半的(b)(d)圖，(d)圖和原來差不多，然而(b)圖邊緣會有失真現象。還原時，用 interpolation 的方式

257 原圖直接在縱軸取一半的pixels 再還原 (a) (b)

258 4 : 2: 2 4 : 2: 0 (c) (d)

 7-E Lossy Compression Techniques -- KLT 259  7-E Lossy Compression Techniques -- KLT 複習：DFT 的優缺點 Karhunen-Loeve Transform (KLT) (similar to Principal component analysis (PCA)) 經過轉換後，能夠將影像的能量分佈變得最為集中 It is optimal, but dependent on the input  1-D Case K[u, n] = en[u] (K = [e0, e1, e2, ….., eN1] en 為 covariance matrix C 的 eigenvector mean C[m, n] = corr(x[m], x[n]) = Note: corr代表correlation

260 KLT 的理論基礎：經過 KLT 之後，當 u1  u2 時，X[u1] 和 X[u2] 之間的 correlation 必需近於零 (即 decorrelation) 即 corr(X[u1], X[u2]) 所以 Since if for all u The above equation can be simplified as:

261 Note that is the (u1, u2)th entry of E{XXT} where Since where K is the KLT matrix where C is the covariance matrix and corr(x[m], x[n]) = To make when u1  u2 should be a diagonal matrix Therefore, the KLT transform matrix K should diagonalize C. That is, the columns of K are the eigenvectors of C.

262  2-D Case KLT 缺點: dependent on image (不實際，需要一併記錄 transform matrix) Reference W. D. Ray and R. M. Driver, “Further decomposition of the Karhunen-Loeve series representation of a stationary random process,” IEEE Trans. Inf. Theory, vol. 16, no. 6, pp. 663-668, Nov. 1970.

 7-F Lossy Compression Techniques -- DCT 263  7-F Lossy Compression Techniques -- DCT Suboptimal, but independent of the input  DCT: Discrete Cosine Transform C[0] = , C[u] = 1 for u  0 IDCT: inverse discrete cosine transform 對於大部分的影像而言， DCT 能夠近似 KLT (near optimal) 尤其是當 corr{f[m, n], f[m+, n+]} =  ||  || ,   1 時有 fast algorithm Advantage: (1) independent of the input (2) near optimal (3) real output

264 DFT for Lena image DCT for Lena image Comparing with the DFT: (1) 能量更為集中 (2) Real output (3) 一樣都有 fast algorithm

左圖：將 DFT，DCT 各點能量(開根號)由大到小排序右圖：累積能量 265 左圖：將 DFT，DCT 各點能量(開根號)由大到小排序右圖：累積能量 DCT output k k Energy concentration at low frequencies: KLT > DCT > DFT

266 通常，我們將影像切成 8  8 的方格作DCT Why: N image 8 x 8 方格 M

267 References [1] N. Ahmed, T. Natarajan, and K. R. Rao, “Discrete cosine transform,” IEEE Trans. Comput., vol. C-23, pp. 90-93, Jan 1974. [2] K. R. Rao and P. Yip, Discrete Cosine Transform, Algorithms, Advantage, Applications, New York: Academic, 1990.

附錄八：量測方法的精確度常用的指標 268 方法判斷為真事實上為真 TN FN TP FP TP (true positive): 事實上為真，而且被我們的方法判斷為真的情形 FN (false negative): 事實上為真，卻未我們的方法被判斷為真的情形 FP (false positive): 事實上不為真，卻被我們的方法誤判為真的情形 TN (true negative): 事實上不為真，而且被我們的方法判斷成不為真的情形

269 (positive prediction rate) 以抓犯人為例，TP 是有罪而且被抓到的情形，FP是無罪但被誤抓的情形，FN 是有罪但沒被抓到的情形，TN 是無罪且未被誤逮的情形寧可錯抓一百，也不可放過一個 recall 高，但 precision 低寧可錯放一百，也不可冤枉一個 precision 高，但 recall 低

270 Accuracy Detection error rate F-score General form of the F-score