Types of text:
Text file formats
File formats are application specific
TXT - plain text document without much formatting options
DOC - created by document editing softwares like Microsoft word and is in binary format
RTF - Rich Text Format using Microsoft
PDF - Portable Document Format (binary format)
PS - Post Scripts (programming language the defines look of a printed page)
What is meant by text compression?
Reducing the size of a text document without changing the contents of the document while storing on a disk
Components of a compression technique:
Encoder
Storage or network
Decoder
What is the compression ratio?
The ratio of the total number of bits required before compression to the total number of bits required after compression
What is Huffman coding?
Instead of fixed sized code words variable length code words are derived such that the shortest code words are used for the words occurring frequently
What is the requirement of Huffman coding?
Used in applications where the text to be compressed has some known characteristics in terms of characters and their relative frequency of occurrences
Applications of Huffman coding
Fax machines, JPEG, MPEG
Image file formats
BMP - lossless, developed by Microsoft
TIFF - lossless file format, high quality, large size
JPEG - lossy, small size
GIF - lossless compression, larger size than JPEG, limited color range(256 colors)
PNG - lossless, support 16 million colors
Run–Length–Encoding (RLE)
Shannon-Fano Algorithm
Arithmetic Coding
Vector Quantization
Fractal Compression technique
Transform Coding
What is image processing?
It is the manipulation and analysis of visual information contained in an image.
Elements of image processing:
Acquisition - capturing visual information from real world and convert it into digital form
Storage - storage of acquired digital information of image for later retrieval and processing
Communication - transfer of image from one location or system to another
Display- rendering digital image data on a visual output device
Enhancement - improvement of the visual quality or image data to make it more suitable for analysis and interpretation