IT News - How video compression works 1/5

BDTI explains how video codecs like MPEG-4 and H.264 work, and how they differ from one another. It also explains the demands codecs make on processors.

By BDTI
[For a closer look at H.264, see H.264: the codec to watch]

Digital video compression and decompression algorithms (codecs) are at the heart of many modern video products, from DVD players to multimedia jukeboxes to video-capable cell phones. Understanding the operation of video compression algorithms is essential for developers of the systems, processors, and tools that target video applications. In this article, we explain the operation and characteristics of video codecs and the demands codecs make on processors. We also explain how codecs differ from one another and the significance of these differences.

Starting with stills

Because video clips are made up of sequences of individual images, or "frames," video compression algorithms share many concepts and techniques with still-image compression algorithms. Therefore, we begin our exploration of video compression by discussing the inner workings of transform-based still image compression algorithms such as JPEG, which are illustrated in Figure 1.

The image compression techniques used in JPEG and in most video compression algorithms are "lossy." That is, the original uncompressed image can't be perfectly reconstructed from the compressed data, so some information from the original image is lost. The goal of using lossy compression is to minimize the number of bits that are consumed by the image while making sure that the differences between the original (uncompressed) image and the reconstructed image are not perceptible—or at least not objectionable—to the human eye.

Switching to frequency
The first step in JPEG and similar image compression algorithms is to divide the image into small blocks and transform each block into a frequency-domain representation. Typically, this step uses a discrete cosine transform (DCT) on blocks that are eight pixels wide by eight pixels high. Thus, the DCT operates on 64 input pixels and yields 64 frequency-domain coefficients, as shown in Figure 2. (Transforms other than DCT and block sizes other than eight by eight pixels are used in some algorithms. For simplicity, we discuss only the 8x8 DCT in this article.)

The DCT itself is not lossy; that is, an inverse DCT (IDCT) could be used to perfectly reconstruct the original 64 pixels from the DCT coefficients. The transform is used to facilitate frequency-based compression techniques. The human eye is more sensitive to the information contained in low frequencies (corresponding to large features in the image) than to the information contained in high frequencies (corresponding to small features). Therefore, the DCT helps separate the more perceptually significant information from less perceptually significant information. After the DCT, the compression algorithm encodes the low-frequency DCT coefficients with high precision, but uses fewer bits to encode the high-frequency coefficients. In the decoding algorithm, an IDCT transforms the imperfectly coded coefficients back into an 8x8 block of pixels.

A single two-dimensional eight-by-eight DCT or IDCT requires a few hundred instruction cycles on a typical DSP such as the Texas Instruments TMS320C55x. Video compression algorithms often perform a vast number of DCTs and/or IDCTs per second. For example, an MPEG-4 video decoder operating at VGA (640x480) resolution and a frame rate of 30 frames per second (fps) would require roughly 216,000 8x8 IDCTs per second, depending on the video content. In older video codecs these IDCT computations could consume as many as 30% of the processor cycles. In newer, more demanding codec algorithms such as H.264, however, the inverse transform (which is often a different transform than the IDCT) takes only a few percent of the decoder cycles.

Because the DCT and other transforms operate on small image blocks, the memory requirements of these functions are typically negligible compared to the size of frame buffers and other data in image and video compression applications.

No.	Subject	Author	Date	Views
67	7 Best Music Players for Mac You Should Try	엉뚱도마뱀	2017.12.07	601
66	CISCO Router 설정 팁 - QOS 1	digipine	2017.11.03	1817
65	CM6206 High Integrated USB Audio I/O Controller Windows Driver and PDF	digipine	2022.02.02	5656
64	Cubism Live2D Demo Windows D3D9	lizard2019	2023.01.11	258
63	DaVinci 에 대한 소개글	digipine	2017.11.03	112
62	Facebook 게시물을 검색하여 자살 암시를 발견하는 AI 서비스 시작 1	엉뚱도마뱀	2017.11.29	512
61	GA-P55A-UD3R rev 2.0 / GT 240 OSX 스노우 레파드 해킨가이드	digipine	2017.11.03	754
60	Galanz Glanz BCD-467WTDH Screen-Controlled eco-smart refrigerator	lizard2019	2020.05.19	55941
59	GDPR 유럽 일반 개인 정보보호법 시행	엉뚱도마뱀	2018.05.24	879
58	HIGH QUALITY MOBILE EXPERIENCE (HQME)	digipine	2017.11.03	339
57	High Quality Mobile Experience (HQME) IEEE P2200TM	lizard2019	2020.02.18	1444
»	How video compression works 1/5	digipine	2017.11.02	1596
55	How video compression works 2/5 - Quantization, coding, and prediction	digipine	2017.11.02	5991
54	How video compression works 3/5 - Color and motion	digipine	2017.11.02	1595
53	How video compression works 4/5 - Encoder secrets and motion compensation	digipine	2017.11.02	340
52	How video compression works 5/5 - Deblocking, deringing, and color space conversion	digipine	2017.11.02	23718
51	iOS 14.3 업데이트 릴리즈	digipine	2020.12.16	266
50	IoT 악성 코드 "Mirai"변종이 급증 주의	엉뚱도마뱀	2017.11.28	344
49	iPhone SE 2 2018년 상반기에 출시?	digipine	2017.12.11	540
48	iPhone X의 Face ID RAW 데이터 영상 분석 및 은행권 앱 동향	엉뚱도마뱀	2017.11.27	669

How video compression works 1/5

Shortcut

Shortcut