Unit 6 Lab 2: Data Representation and Compression, Page 8

EK 3.3.1C There are trade offs in using lossy and lossless compression techniques for storing and transmitting data.
EK 3.3.1D Lossless data compression reduces the number of bits stored or transmitted but allows complete reconstruction of the original data
EK 3.3.1E Lossy data compression can significantly reduce the number of bits stored or transmitted at the cost of being able to reconstruct only an approximation of the original data.

This picture of the BJC logo (shown right) is 158 pixels wide and 186 pixels tall, for a total of 29,388 pixels. The BMP (bitmap) format includes each of those pixels in the picture file, at four bytes per pixel, so the file size is about 120kB.

Lossless Compression

That is an inefficient way to store the information. Think about the 158 pixels in the top row. The first 60 or so are white. Then come five pixels of yellowish orange (the top slice of the "b"). And the rest of that row is white.

Instead of storing all 158 pixels individually, we could compress them with run-length encoding and just store six values (three numbers and three colors):

These days, the size of one picture isn't so significant, but think about every frame of a movie, and think about the time required to send the information over the Internet. Compression makes it easier to stream that movie to you.

pixel count	color code
60	FFFFFF
5	E5A84A
93	FFFFFF

Use the Color Mixer at RGB colors and hexadecimal notation to check the result of the hex code E5A84A.

Run-length encoding is a lossless compression format; it doesn't lose any information. The original picture can be reconstructed with every pixel exactly correct. But run-length encoding doesn't do well if the picture is a photograph where every pixel may be (at least slightly) different in color from its neighbors. If the length of each color run is just one pixel, both run length and color will take twice as much space as just storing the color of each pixel. Another lossless image format you may have heard of is PNG (Portable Network Graphics, pronounced "ping").

Lossy Compression

Lossy compression algorithms let file sizes be even smaller, but the original picture can't be perfectly reconstructed; information is lost. This would be terrible if these algorithms were used to compress a computer program or a novel, but people's perception of images do not require extreme precision. Similarly, sounds and movies can survive lossy compression without most people noticing.

The most commonly used lossy compression algorithm for pictures is called JPEG (or JPG, both pronounced "jay peg" for "Joint Photographic Experts Group," the committee that invented it). Lossy algorithms usually let you control the degree of precision.

Below are an original, uncompressed BMP and a highly compressed JPG of a picture measuring 256×192 pixels. Can you tell which is which?
pond pebbles

format	size
BMP encoding every pixel individually (shown above)	148 kB
PNG	106 kB
JPEG with least compression	94 kB
JPEG with most compression (shown above)	5 kB

: Lossy vs. Lossless

Lossless compression (like PNG) is reversible (no loss in quality); you can reconstruct the original data. It works by removing redundant data.

Lossy compression (like JPG) is not fully reversible; you can only reconstruct an approximation of the original data. It works by removing details that people aren't likely to notice.

These questions are similar to those you will see on the AP CSP exam.

A film student is recording a movie on his smartphone. When the recording is done, he decides to save a copy on his computer. The student then notices that the saved copy is of much lower image quality than the original. Which of the following could NOT be a possible explanation for the lower image quality?

The movie was saved using fewer bits per second than the original movie.

This is likely what happened.

The copy of the movie file was somehow corrupted in the process of saving.

This is possible, however if the file is corrupted it is unlikely to have a consistent negative impact on image quality.

The movie was saved using a lossy compression technique.

This is very likely.

Whenever a file is saved from one place on a computer to another, some information is always lost.

Correct. It is possible to make exact duplicates of digital information without any loss.

A visual artist is processing a digital image and overwriting the original. Which of the following describe lossless transformations of the digital image from which the original image can be recovered? Choose two answers.

Creating the negative of an image, where colors are reversed and dark areas appear light.

Correct. This transformation is reversible and is an example of a lossless transformation.

Blurring the edges of an image.

The blurring blends colors at the edges of the image and once colors have blended it is impossible to retrieve the original RGB values of the pixels involved.

Creating a grayscale copy of an image.

The grayscale of an image replaces each RGB value with their average and once colors have been averaged it is impossible to retrieve the original RGB values of the pixels.

Creating a vertically flipped copy of the image.

Correct. This transformation is reversible and is an example of a lossless transformation.

Data Compression

Lossless Compression

Lossy Compression