What is a character set?
A list of characters and the codes used to
represent each one
What does ASCII stand for? How many bits are in each character and how many characters total?
- Seven bits for each character, 128 unique character
How many bits was ASCII later evolved to?
- Represent lines, symbols, and letters with accents
How many are control characters?
How to find the code for uppercase/lowercase letters?
Uppercase letters start 65
• Code for J (10th letter) is 65 + (10 – 1) = 74
Lowercase letters start at 97
• Code for j (10th letter) is 97+ (10 – 1) = 106
What is the issue with the ASCII set?
What is Unicode character set?
- The first 128 characters in the Unicode character set correspond exactly to the ASCII character set
How many bits per character and characters are represented in the Unicode character set?
What is lossless compression?
What is data compression?
What is the compression ratio?
What are the two types of data compression?
What are the three types of loseless techniques?
What is keyword encoding?
What type of technique is keyword encoding usually used with?
-Huffman encoding
What is Run-length encoding?
What does Huffman Encoding use to represent each
character?
What is an advantage of Huffman Encoding
-saving lots of storage space by making shorter sequences (less bits) for the frequent letters and longer sequences (more bits) for the non-frequent letters
What is the least effective compression?
-Keyword Encoding
What is the most effective compression?
-Huffman Encoding
What are the applications of Huffman encoding?
How to find compression ratio for Huffman encoding?
number of characters*bits/compressed bit length