0% found this document useful (0 votes)

6 views6 pages

Data Compression

The document provides a comprehensive study material on data compression techniques for M.Tech Digital Communications students, covering topics such as entropy coding, Huffman coding, and various predictive and transform coding methods. It includes theoretical concepts, formulas, and practical applications relevant to digital broadcasting standards. The material is structured for quick revision and preparation for university examinations.

Uploaded by

afreenshaik3071

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

6 views6 pages

Data Compression

Uploaded by

afreenshaik3071

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

M.

Tech Digital Communications

MTDC 15

DATA COMPRESSION TECHNIQUES

Complete Study Material for Semester Examination

Beginner-Friendly | Theory + Formulas + Diagrams | Quick Revision

University Exam 100 Marks

Sessional 50 Marks

Duration 3 Hours

Instruction 3 Periods per week

UNIT I: Entropy Coding & Source Models

1.1 Why Data Compression?

Data compression removes REDUNDANCY from data to reduce its size. Example: The text
"AAAAAABBB" has redundancy — we can represent it as "6A3B" = much shorter! Two types: • Lossless
compression: No data lost; perfectly reconstruct original (ZIP, PNG, FLAC) • Lossy compression: Some
data lost; can't perfectly reconstruct (JPEG, MP3, H.264)

1.2 Information Theory Basics

Information content of symbol s: I(s) = log2(1/p(s)) bits The RARER a symbol, the MORE
information it carries! Entropy H(X): Average information per symbol: H(X) = -Σ p(x) · log2(p(x))
bits/symbol Entropy = MINIMUM average code length achievable (Shannon's source coding theorem)
Example: Fair coin: H = -0.5·log2(0.5) - 0.5·log2(0.5) = 1 bit Biased coin (p=0.9 heads): H =
-0.9·log2(0.9) - 0.1·log2(0.1) = 0.469 bits

1.3 Huffman Coding

Huffman coding assigns SHORT codes to FREQUENT symbols and LONG codes to RARE symbols.
Construction algorithm: 1. Sort symbols by probability (low to high) 2. Combine two LOWEST
probability symbols into a node (sum their probabilities) 3. Repeat until only one node left (the root) 4.
Assign 0/1 to left/right branches 5. Code for each symbol = path from root to leaf Example: p(A)=0.5,
p(B)=0.25, p(C)=0.125, p(D)=0.125 A→0, B→10, C→110, D→111 Average length = 0.5×1 + 0.25×2 +
0.125×3 + 0.125×3 = 1.75 bits (Entropy = 1.75 bits ✓)

1.4 Arithmetic Coding

Arithmetic coding encodes an ENTIRE message as a single number in [0,1). Process: 1. Start with interval
[0, 1) 2. For each symbol, subdivide current interval according to symbol probabilities 3. Output a number
within the final interval Advantage: Can get arbitrarily close to entropy (better than Huffman for short
messages or skewed probabilities) Used in: JPEG2000, H.264, HEVC video codecs.

1.5 Run-Length Encoding (RLE)

RLE replaces RUNS of same symbol with (count, symbol) pair. Example: AAABBCCCC → (3,A)(2,B)(4,C)
Best for: Binary images (fax), simple graphics, executable files CCITT Group 3 fax uses modified Huffman
coding of run lengths.

1.6 Ziv-Lempel (LZ) Coding

LZ coding is a DICTIONARY-based method that finds repeated patterns. LZ77: Uses a sliding window as
dictionary. Encodes as (offset, length, next char). LZ78: Builds explicit dictionary. LZW (used in GIF,
TIFF): Modified LZ78 — starts with single characters in dictionary, adds phrases as we encode. GZIP uses
LZ77 + Huffman. It's one of the most widely used compression formats!

1.7 Waveform Characterization

Source models describe statistical properties: Stationary source: Statistics don't change with time
Ergodic source: Time average = ensemble average For quantization: Optimal quantizer
(Lloyd-Max): Decision boundaries and reconstruction levels that minimize MSE. Quantization SNR
for uniform quantizer with R bits: SNR ≈ 6.02R + 1.76 dB (increases ~6 dB per bit of resolution —
important formula!)
UNIT II: Predictive Coding

2.1 DPCM (Differential Pulse Code Modulation)

Instead of encoding the signal x(n) directly, encode the PREDICTION ERROR e(n) = x(n) - x_hat(n).
Since consecutive samples are correlated, the prediction error has SMALLER variance than the signal
→ needs fewer bits! Block diagram: x(n) → [Predictor x_hat(n)] → subtract → e(n) → [Quantizer] →
e_hat(n) → Output ↑___________________________________| At decoder: x_hat(n) = x_hat(n-1) +
e_hat(n) (simple 1st order prediction) DPCM gain: GDPCM = σ²x / σ²e (ratio of signal variance to
prediction error variance)

2.2 ADPCM (Adaptive DPCM)

The predictor and/or quantizer step size ADAPTS to the signal statistics. Adaptive quantizer: Step size
∆(n) adjusts based on recent error magnitudes. Large errors → increase ∆ (coarser quantization for fast
changes) Small errors → decrease ∆ (finer quantization for slow changes) ADPCM is used in: • G.726 (ITU
standard): 32 kb/s voice coding (telephone quality) • CD-quality audio → compressed to lower rates

2.3 Motion Compensated Prediction for Video

Video frames are HIGHLY correlated — consecutive frames are very similar! Motion Estimation: Find
where each block in current frame came from in previous frame. Block matching: Move a block around in
reference frame to find best match. Motion Vector: (dx, dy) displacement of the block Types of frames: •
I-frame (Intra): Independent, no prediction (like JPEG image) • P-frame (Predicted): Coded as difference
from previous frame using motion vectors • B-frame (Bidirectional): Coded from both previous AND future
frames GOP (Group of Pictures): I-P-P-P-B-B-P-B-B-... pattern in MPEG
UNIT III: Transform Coding

3.1 Transform Coding Concept

Transform coding converts signal to a TRANSFORM DOMAIN where energy is CONCENTRATED in few
coefficients. Steps: 1. Divide signal into blocks (typically 8×8 for images) 2. Apply transform (DCT, DFT,
Wavelet) 3. Most transform coefficients are near zero → quantize roughly (or set to zero) 4. Only
transmit/store NON-ZERO coefficients Key: Transform DECORRELATES the data. Karhunen-Loeve
Transform (KLT) is theoretically optimal but requires knowing signal statistics. DCT ≈ KLT for natural
images (that's why JPEG uses DCT!)

3.2 DCT (Discrete Cosine Transform)

DCT-II (most common 'the DCT'): X(k) = (2/N)^0.5 · Σ c(k) · x(n) · cos(π·k·(2n+1)/(2N)) for k=0,1,...,N-1
Properties: • Real-valued (unlike DFT which is complex) • Energy compaction: Most energy in first few
coefficients • Used in JPEG, MPEG, MP3, H.264, H.265 JPEG compression process: 1. Convert
RGB → YCbCr (chroma subsampling [Link] or [Link]) 2. Divide into 8×8 blocks 3. Apply DCT to each
block 4. Quantize coefficients (larger step size → more compression → lower quality) 5. Zigzag scan
→ Run-length encode → Huffman code

3.3 Wavelet-Based Compression

Wavelets give MULTI-RESOLUTION decomposition — great for images with edges. 1D DWT: Split into
approximation (LL) and detail (LH, HL, HH) using filter banks 2D for images: Apply to rows then columns
→ 4 subbands JPEG 2000 uses wavelet (Daubechies 9/7 for lossy, 5/3 for lossless). Advantages over
JPEG: No blocking artifacts, progressive transmission, better quality at same bitrate.
UNIT IV: Digital Broadcasting Standards

4.1 Vector Quantization (VQ)

Instead of quantizing one sample at a time (scalar quantization), VQ quantizes VECTORS (groups of
samples) together. Codebook: Set of representative vectors (codewords) {c1, c2, ..., cN} Encoding:
For each input vector x, find the NEAREST codeword ci. Output: Just the INDEX of the nearest
codeword (log2(N) bits) LBG Algorithm (Linde-Buzo-Gray): Design optimal codebook by k-means
clustering. Advantage: Better compression than scalar quantization (can exploit correlations between
samples)

4.2 Fractal Image Compression

Fractal compression exploits SELF-SIMILARITY in images — parts of image look like scaled, transformed
versions of other parts. Process: 1. Divide image into small 'range' blocks (e.g., 4×4) 2. Search for larger
'domain' blocks that resemble each range block (after scaling) 3. Store the affine transform (scale, rotation,
offset) instead of pixels Advantage: Very high compression ratios, resolution-independent Disadvantage:
Very slow encoding time

4.3 Digital Broadcasting Standards

MPEG (Moving Pictures Experts Group): • MPEG-1: VCD quality (~1.5 Mb/s), MP3 audio • MPEG-2:
DVD, digital TV (DVB, ATSC), up to 15 Mb/s • MPEG-4: Streaming, mobile video; includes H.264 (AVC) •
H.265/HEVC: 2x better compression than H.264, used in 4K streaming • H.266/VVC: Latest standard, 2x
better than HEVC Audio: • MP3 (MPEG-1 Audio Layer III): 128 kb/s for near-CD quality • AAC: Better than
MP3 at same bitrate (iTunes, YouTube, mobile) • Dolby AC-3: 5.1 surround sound for cinema/DVD Key
standard for digital broadcasting: DVB (Europe), ATSC (USA), ISDB (Japan)
QUICK REVISION TABLE

Topic Key Points Formula/Keyword

Entropy Min avg code length H(X)=-Σp(x)log2(p(x))

Huffman Short code=frequent symbol Optimal prefix-free code

Arithmetic coding Encode whole msg as number Closer to entropy than Huffman

LZ77/LZW Dictionary-based; sliding window Used in GZIP, GIF

DPCM Code prediction error; smaller variance e(n)=x(n)-x_hat(n)

Motion compensation I/P/B frames; motion vectors Used in MPEG/H.264/H.265

DCT Energy compaction; real-valued Used in JPEG, MP3, H.264

Quantization SNR 6 dB gain per bit SNR≈6.02R+1.76 dB

JPEG 2000 Wavelet-based; no blocking Daubechies 9/7 wavelet

MPEG-4/H.264 Video streaming standard AVC = Advanced Video Coding

Multimedia Compression Techniques Overview
No ratings yet
Multimedia Compression Techniques Overview
23 pages
Efficient Source Encoding Techniques
No ratings yet
Efficient Source Encoding Techniques
30 pages
Compression Techniques Overview
No ratings yet
Compression Techniques Overview
26 pages
Overview of Data Compression Techniques
No ratings yet
Overview of Data Compression Techniques
46 pages
Data Compression Techniques Overview
No ratings yet
Data Compression Techniques Overview
48 pages
Image and Video Compression Techniques
No ratings yet
Image and Video Compression Techniques
76 pages
Image Compression Techniques Explained
100% (2)
Image Compression Techniques Explained
124 pages
Data Compression Techniques Explained
No ratings yet
Data Compression Techniques Explained
4 pages
Arithmetic Coding Explained for Compression
No ratings yet
Arithmetic Coding Explained for Compression
9 pages
Image Compression Techniques Overview
No ratings yet
Image Compression Techniques Overview
12 pages
Digital Image and Signal Processing Guide
No ratings yet
Digital Image and Signal Processing Guide
16 pages
Image Compression Basics Explained
No ratings yet
Image Compression Basics Explained
37 pages
Image Compression Techniques Explained
No ratings yet
Image Compression Techniques Explained
70 pages
Understanding Data Compression Techniques
No ratings yet
Understanding Data Compression Techniques
53 pages
Understanding Image Compression Techniques
No ratings yet
Understanding Image Compression Techniques
39 pages
Data Compression Techniques Explained
No ratings yet
Data Compression Techniques Explained
36 pages
Understanding the JPEG Compression Standard
No ratings yet
Understanding the JPEG Compression Standard
39 pages
Multimedia Data Compression Overview
No ratings yet
Multimedia Data Compression Overview
9 pages
Data Compression Techniques Overview
No ratings yet
Data Compression Techniques Overview
58 pages
Understanding Data Compression Techniques
No ratings yet
Understanding Data Compression Techniques
25 pages
Unit-5 (CSC330-MC) (Elective)
No ratings yet
Unit-5 (CSC330-MC) (Elective)
59 pages
Understanding JPEG Image Compression
No ratings yet
Understanding JPEG Image Compression
39 pages
Image Compression Techniques Explained
No ratings yet
Image Compression Techniques Explained
14 pages
Understanding the JPEG Image Standard
No ratings yet
Understanding the JPEG Image Standard
36 pages
Data Compression Techniques Explained
No ratings yet
Data Compression Techniques Explained
37 pages
Text and Image Compression Techniques
No ratings yet
Text and Image Compression Techniques
96 pages
Overview of Data Compression Techniques
No ratings yet
Overview of Data Compression Techniques
24 pages
Data Compression Techniques Overview
No ratings yet
Data Compression Techniques Overview
21 pages
Image Compression Techniques Overview
No ratings yet
Image Compression Techniques Overview
163 pages
Digital Video Processing Techniques
No ratings yet
Digital Video Processing Techniques
87 pages
Lossless PDF Compression Techniques
No ratings yet
Lossless PDF Compression Techniques
36 pages
Image Compression Techniques Explained
No ratings yet
Image Compression Techniques Explained
55 pages
JPEG Compression and MPEG Techniques
No ratings yet
JPEG Compression and MPEG Techniques
57 pages
Overview of Data Compression Techniques
No ratings yet
Overview of Data Compression Techniques
31 pages
Digital Image Compression Techniques
No ratings yet
Digital Image Compression Techniques
19 pages
Data Compression Techniques Overview
No ratings yet
Data Compression Techniques Overview
3 pages
Data Compression Techniques Overview
No ratings yet
Data Compression Techniques Overview
47 pages
Understanding Image Compression Techniques
No ratings yet
Understanding Image Compression Techniques
44 pages
Understanding Compression Redundancy
No ratings yet
Understanding Compression Redundancy
11 pages
Lossless Compression Algorithms Explained
No ratings yet
Lossless Compression Algorithms Explained
26 pages
Unit 3.1 Data Compression
No ratings yet
Unit 3.1 Data Compression
38 pages
Text and Image Compression Techniques
No ratings yet
Text and Image Compression Techniques
69 pages
Image Compression Techniques Explained
No ratings yet
Image Compression Techniques Explained
81 pages
Understanding Data Compression Techniques
No ratings yet
Understanding Data Compression Techniques
25 pages
Compression and Decompression Overview
No ratings yet
Compression and Decompression Overview
19 pages
Text and Image Compression Techniques
No ratings yet
Text and Image Compression Techniques
40 pages
Data Compression Techniques Overview
No ratings yet
Data Compression Techniques Overview
8 pages
Data Compression Techniques Explained
No ratings yet
Data Compression Techniques Explained
25 pages
Image Compression Basics Explained
No ratings yet
Image Compression Basics Explained
37 pages
Image Compression Techniques Explained
No ratings yet
Image Compression Techniques Explained
23 pages
Multimedia Compression Techniques Overview
No ratings yet
Multimedia Compression Techniques Overview
23 pages
Image Compression Techniques in ECE408
No ratings yet
Image Compression Techniques in ECE408
9 pages
Data Compression Techniques Overview
No ratings yet
Data Compression Techniques Overview
22 pages
Understanding JPEG Compression Techniques
No ratings yet
Understanding JPEG Compression Techniques
42 pages
Digital TV Compression Techniques Overview
No ratings yet
Digital TV Compression Techniques Overview
26 pages
Video Data Type and Compression Techniques
No ratings yet
Video Data Type and Compression Techniques
16 pages
Impact of Media on Communication Skills
No ratings yet
Impact of Media on Communication Skills
16 pages
Mount Everest's New Official Height
No ratings yet
Mount Everest's New Official Height
4 pages
Network Camera Bandwidth Report 2024
No ratings yet
Network Camera Bandwidth Report 2024
4 pages
Business Meetings and Video Conferencing
0% (1)
Business Meetings and Video Conferencing
7 pages
DCT Image Compression Techniques
No ratings yet
DCT Image Compression Techniques
38 pages
Grade 6 Guide to Audio-Video Conferencing
100% (1)
Grade 6 Guide to Audio-Video Conferencing
15 pages
GOP-Based Deep Video Preprocessing
No ratings yet
GOP-Based Deep Video Preprocessing
5 pages
Comparative and Superlative Survey
No ratings yet
Comparative and Superlative Survey
4 pages
AirLive N450R Modem Compatibility List
No ratings yet
AirLive N450R Modem Compatibility List
6 pages
Digitalization in Indian Judiciary
No ratings yet
Digitalization in Indian Judiciary
6 pages
Toronto Police Stolen Devices Report
No ratings yet
Toronto Police Stolen Devices Report
13 pages
VoIP Technology: Benefits and Drawbacks
No ratings yet
VoIP Technology: Benefits and Drawbacks
12 pages
TLE-EPP 6: Audio-Video Conferencing Guide
No ratings yet
TLE-EPP 6: Audio-Video Conferencing Guide
13 pages
SPL4 Tournament Match Results
No ratings yet
SPL4 Tournament Match Results
49 pages
Multimedia Compression Techniques Overview
No ratings yet
Multimedia Compression Techniques Overview
35 pages
e-Office and e-Filing Overview
No ratings yet
e-Office and e-Filing Overview
75 pages
Nokia X6 and Sony Ericsson Satio Specs
No ratings yet
Nokia X6 and Sony Ericsson Satio Specs
4 pages
Yealink MeetingBar A10 Overview
No ratings yet
Yealink MeetingBar A10 Overview
3 pages
Samsung Product Codes and Prices
No ratings yet
Samsung Product Codes and Prices
4 pages
Islamic Resources and Mobile Services
No ratings yet
Islamic Resources and Mobile Services
3 pages
Moto G Software Channel Updates
No ratings yet
Moto G Software Channel Updates
3 pages
Samsung Galaxy J7 (2016) Specifications
No ratings yet
Samsung Galaxy J7 (2016) Specifications
6 pages
Oppo F19 Pro Launch and Specs in Nepal
No ratings yet
Oppo F19 Pro Launch and Specs in Nepal
6 pages
LV KPI Test Checklist for 4G Network
No ratings yet
LV KPI Test Checklist for 4G Network
8 pages
Business Communication Essentials Guide
No ratings yet
Business Communication Essentials Guide
19 pages
Globalization's Impact on Communication
No ratings yet
Globalization's Impact on Communication
6 pages
DHI-VTO2311R-WP Wi-Fi Door Station Guide
No ratings yet
DHI-VTO2311R-WP Wi-Fi Door Station Guide
2 pages
Informe de Celulares Muertos en Caja
No ratings yet
Informe de Celulares Muertos en Caja
18 pages
Telstra April 2021 Swap Auction Details
No ratings yet
Telstra April 2021 Swap Auction Details
697 pages
LG Ericsson
No ratings yet
LG Ericsson
32 pages

Data Compression

Uploaded by

Data Compression

Uploaded by

M.

Tech Digital Communications

DATA COMPRESSION TECHNIQUES

Complete Study Material for Semester Examination

University Exam 100 Marks

Instruction 3 Periods per week

1.1 Why Data Compression?

1.2 Information Theory Basics

1.3 Huffman Coding

1.4 Arithmetic Coding

1.5 Run-Length Encoding (RLE)

1.6 Ziv-Lempel (LZ) Coding

1.7 Waveform Characterization

2.1 DPCM (Differential Pulse Code Modulation)

2.2 ADPCM (Adaptive DPCM)

2.3 Motion Compensated Prediction for Video

3.1 Transform Coding Concept

3.2 DCT (Discrete Cosine Transform)

3.3 Wavelet-Based Compression

4.1 Vector Quantization (VQ)

4.2 Fractal Image Compression

4.3 Digital Broadcasting Standards

Topic Key Points Formula/Keyword

Huffman Short code=frequent symbol Optimal prefix-free code

LZ77/LZW Dictionary-based; sliding window Used in GZIP, GIF

DPCM Code prediction error; smaller variance e(n)=x(n)-x_hat(n)

Motion compensation I/P/B frames; motion vectors Used in MPEG/H.264/H.265

DCT Energy compaction; real-valued Used in JPEG, MP3, H.264

Quantization SNR 6 dB gain per bit SNR≈6.02R+1.76 dB

JPEG 2000 Wavelet-based; no blocking Daubechies 9/7 wavelet

MPEG-4/H.264 Video streaming standard AVC = Advanced Video Coding

You might also like