Signal Processing Flashcards

Question

Each cone cell can distinguish a large number of different frequencies. A. Correct B. Incorrect C. Depends

Answer 1

B. Incorrect

Answer 2

Higher level representations of images as a combination of mathematical descriptions of curves, shapes, gradients. - still need to be transformed into raster for displays and printers which can lead to a different interpretation by the rasteriser - type of source encoding - use for tables and graphs

Answer 3

Pros: * Stable * Fits most presentation media (e.g., displays) * Fits many capture media (e.g., CCD capture) * Can use lossy compression Cons: * Not Scalable * Difficult to animate * Bad for 3D (voxels are expensive) * Limited spatial frequency – Results in aliasing

Answer 4

Cons: * Render-dependent * Large variability in encoding (# of objects) * Might encode invisible information * Non-realistic * Lossy compression generally not possible Pros: * Scalable * Potentially efficient * Easy to animate * Good for 3D * More possible transformations

Answer 5

- Scalable Vector Graphics - Supported (in varying degrees) natively by most current browsers - source compression as it is XML

Answer 6

* Paths (curved or straight lines) * Basic Shapes * Text * Painting (filling, outlines, strokes, gradient, pattern) * Colour (rgb) * Gradients and Patterns * Clipping, Masking and Compositing * Filter Effects * Interactivity (events, focus) * Linking * Scripting * Animation * Fonts * Metadata

Answer 7

- reduce with higher bit depth

Answer 8

- reduce by cooling

Answer 9

- reduce by filtering

Answer 10

- reduce by using lossless or less lossy encoding

Answer 11

Discrete Signal - Defined only at specific time intervals - Can have any value (continuous amplitude) at those intervals - Example: Temperature readings every hour Digital Signal - Defined at specific time intervals and - Has finite, quantized values (usually binary: 0s and 1s) - Example: Computer data transmission

Answer 12

Aliasing happens when converting a continuous signal (like sound or light) into a discrete digital signal through sampling, and the sampling rate is too low to accurately represent the original signal.

Answer 13

Sample at at least double the maximum frequency of your signal.

Answer 14

g) C and D

Answer 15

B. No C. It depends

Answer 16

* 0dB – barely audible sound (more later) * 20dB – Whisper * 40dB – Quiet office * 60dB – Normal conversation * 80dB – Hair dryer * 100dB – Heavy traffic, pneumatic drill * 120dB – Loud thunder, music concert * 140dB – Jet aircraft at take off

Answer 17

1. Outer Ear: Capturing Sound - The pinna (the visible part of the ear) collects sound waves from the environment. - These waves travel through the ear canal and strike the eardrum (tympanic membrane), causing it to vibrate.

Answer 18

2. Middle Ear: Amplifying Vibrations - The vibrations from the eardrum are transferred to three tiny bones called the ossicles: - Malleus (hammer) - Incus (anvil) - Stapes (stirrup) - These bones amplify the sound and transmit it to the oval window, a membrane leading to the inner ear.

Answer 19

3. Inner Ear: Converting to Electrical Signals - Vibrations enter the cochlea, a fluid-filled, spiral-shaped structure. - Inside the cochlea, vibrations create waves in the fluid, which move the basilar membrane. - Hair cells on this membrane bend in response, opening channels that generate electrical signals.

Answer 20

4. Auditory Nerve & Brain: Interpreting Sound - The auditory nerve carries these electrical signals to the brainstem and then to the auditory cortex in the temporal lobe. - The brain processes these signals, allowing us to recognize and understand sounds like speech, music, or environmental noise.

Answer 21

Physical Measures: * Intensity (amplitude) * Frequency * Spectrum (complexity)

Answer 22

Perceptual Sensations: * Loudness * Pitch * Timbre

Answer 23

The rise in the detection threshold of one tone (test tone) due to the presence of a second tone (masker tone)

Answer 24

Temporal Masking (Time-Based) - Definition: A sound becomes inaudible because another sound occurs immediately before or after it. - Types: - Forward masking: The masker comes before the target sound. - Backward masking: The masker comes after the target sound. - Example: A loud drumbeat can make a soft whisper just before or after it harder to hear. - Mechanism: Related to how the auditory system processes sounds over time—there’s a brief window where sounds can interfere with each other even if they don’t overlap.

Answer 25

Spectral Masking (Frequency-Based) - Definition: A sound becomes inaudible because another sound at a similar frequency occurs simultaneously. - Example: A loud tone at 1000 Hz can mask a quieter tone at 1050 Hz if played together. - Mechanism: Occurs due to the way the cochlea processes overlapping frequencies—sounds close in frequency activate similar regions, making it harder to distinguish them.

Answer 26

Phons are already corrected for the irregular human perception in different frequencies, so they will sound approximately equally loud. To compensate for the less sensitive perception at lower frequencies, the 300Hz signal has to be louder (aka larger amplitude).

Answer 27

Ratio between the largest and smallest differentiable signals.

Answer 28

1. diaphram moves due to sound wave 2. vibration becomes electrical signal 3. signal travels to gear 4. gear boosts signal 5. signal turned back into sound so we can hear it (speaker) -> type of transducer -> converts waves to signals

Answer 29

We filter them out! Otherwise they can create undesired frequency components.

Answer 30

1. Take the continuous value. 2. Match it to the nearest available digital value. 3. Store that digital value instead of the original. -> translates continous to digital -> allows for as fewer bits as possible

Answer 31

Compression + Expanding Used to avoid unequal levels of noise depending on the amplitude of the signal.

Answer 32

A-Law (in Europe) and µ-law (US+Japan)

Answer 33

- Filtering - Sampling - Quantisation

Answer 34

* The addition of noise * Quantization always adds some noise; however, the amplitude of the noise is smaller the larger the bits/sample we choose

Answer 35

- Pulse code modulation is a method used to digitally represent analog signals - Filter -> compressor -> sampler -> quantizer Standard PCM channel = 8bit/sample * 8000samples/second = 64Kbps

Answer 36

High bitrate

Answer 37

- Compression from companding - Compression to reduce bitrate of transmission or storage

Answer 38

- sample less often: 1. degrades quality 2. sound may loose its defining characteristics - sample with fewer bits/sample: 1. large quantisation noise 2. worse SNR (harder to perceive)

Answer 39

- differential pulse-code modulation - linear predictive coding - perceptual coding

Answer 40

1. DPCM 2. Predictive DPCM 3. Adaptive DPCM

Answer 41

- instead of coding the current value, of the signal every time, code the difference - we can then use fewer bits/sample to code the changes, at a comparable quality.

Answer 42

- same concept as DCPM - encodes difference between predicted sample and current sample - predicted sample is created several previous samples

Answer 43

- predictive (8 order) - coefficients change depending on signal - fewer bits to encode smaller differences

Answer 44

A speech/audio compression method that combines sub-band coding with Adaptive Differential Pulse Code Modulation (ADPCM). How it works: 1. Split the audio signal into frequency bands (sub-bands). 2. Apply ADPCM separately to each band. 3. Encode differences adaptively, reducing bit rate while keeping quality. Key idea: Human ears are more sensitive to some frequencies than others. By coding each band separately, compression is more efficient and sounds better. Uses: Broadcast audio, telephony, digital TV, and multimedia systems. Benefit: Lower data rate with good speech/audio quality.

Answer 45

Definition: A method to represent speech by predicting each sound sample from past samples. How it works: 1. Current sample ≈ weighted sum of previous samples. 2. Store predictor coefficients + pitch + loudness. 3. Recreate speech using these values. Key idea: Models the vocal tract as a filter and the voice as a source. Uses: Speech compression, synthesis, recognition.

Answer 46

- Moving Picture Experts Group - video and audio standards

Answer 47

* Encode change instead of absolute values * Embed assumptions about the source and characteristics of the signal * Split the signal in subbands, and codify subbands differently according to frequency and/or sensibility * Do not encode what you cannot hear (embed assumptions about human hearing in the encoding of the signal)

Answer 48

- variation of pressure - frequencies in Hz-kHz - speed ~330m/s

Answer 49

- variation of em field - frequencies in 10^14 - 10^15 Hz - speed ~ 300,000,00 m/s

Answer 50

- pupil - iris -> providing 20~ fold variation in area - cornea - lens - retina - optic nerve - fovea

Answer 51

Definition: Daytime vision that works in bright light. Cells used: Cone cells in the retina.

Answer 52

- fovea - 3 types ( S M L ) - responsible for colour vision and acuity - about 4.5 mil

Answer 53

Definition: Nighttime vision that works in very low light. Cells used: Rod cells in the retina.

Answer 54

- peripheral vision - very sensitive - night vision - roughly 90 mil

Answer 55

- High dynamic range imaging

Answer 56

- visible spectrum: 380 - 760 nm - cones S M and L wavelength sensitivity - HSV - Hue: dominant frequency - Saturation: purity of signal - Value: brightness intensity

Answer 57

- two different signals can produce the same colour sensation - means any colour sensation cana be generated

Answer 58

- intensity threshold is very small - flicker detection -> critical flicker frequency ~120 Hz - acuity up to 0.5 seconds of arc

Answer 59

- masking - blind spot

Answer 60

Visible light

Answer 61

Moves charge pixel-to-pixel, converts to voltage at output

Answer 62

Converts charge to voltage inside each pixel

Answer 63

RGB – screens add Red, Green, Blue light to make colours

Answer 64

CMYK – inks absorb light (Cyan, Magenta, Yellow, Key/Black)

Answer 65

The range of colours a device or ink can reproduce

Answer 66

Devices differ – profiles ensure consistent colour reproduction

Answer 67

8 bits per channel (R, G, B)

Answer 68

Number of pixels (e.g., VGA 640×480, 1080p 1920×1080, 4K UHD)

Answer 69

Use a palette of colours; pixels reference palette entries

Answer 70

Pixel grid, stable, good for photos, not scalable

Answer 71

Shapes/curves described mathematically, scalable, efficient, good for animation/3D

Answer 72

SVG, PDF, PostScript, DWG

Answer 73

Paths, shapes, text, colour, gradients, filters, animation

Answer 74

Source compression (assumes abstract graphics) + XML lossless compression

Answer 75

- Scalable (no loss of quality when zooming) - Efficient for some uses - Easy to animate - Good for 3D and transformations

Answer 76

- Render‑dependent - Encoding varies a lot - Can store invisible info - Less realistic - Lossy compression not possible

Answer 77

- Paths (lines, curves) - Shapes (circle, rectangle, etc.) - Text - Colour (RGB) - Gradients & patterns - Clipping, masking, compositing - Filter effects - Interactivity (events, focus) - Linking & scripting - Animation - Fonts & metadata

Answer 78

- Quantisation noise → reduce with higher bit depth - Thermal noise → reduce by cooling - Background signals → reduce with filtering - Lossy encoding noise → reduce with lossless or less lossy encoding

Answer 79

- Lossless: No data lost (e.g. PNG, GIF). - Lossy: Removes less important info (e.g. JPEG).

Answer 80

- Same problem as undersampling in sound. - Frequency & sampling are spatial instead of temporal. - Causes jagged edges or moiré patterns. - Fixed with anti‑aliasing filters.

Answer 81

What it is: The whole process of turning an image into the JPEG format. Steps included: 1. Convert RGB → YCbCr (color space conversion). 2. Apply DCT to split the image into frequency components. 3. Quantization (lossy step). 4. Entropy coding (lossless step, e.g., Huffman coding). Key idea: Encoding = the pipeline that prepares and stores the image as a JPEG file.

Answer 82

RGB → screens, cameras CMYK → printing HSV/HSL → perceptual, colour picking YCbCr → separates luminance (Y) from chromaticities (Cb, Cr); allows reduced resolution for colour

Answer 83

The whole process of turning an image into the PNG file format. Steps include: 1. Filtering: rearranges pixel values to highlight patterns. 2. Compression: applies Deflate (LZ77 + Huffman coding). 3. Packaging: stores pixels + metadata (like transparency, color info). Key idea: Encoding = the pipeline that produces a PNG file.

Answer 84

Entropy compression (lossless: RLE, Huffman, differential) Lossy perceptual compression (quantization, undersampling colour components)

Answer 85

A. Content complexity B. Settings of compression algorithm D. Implementation of the encoder

Answer 86

Yes – luminance macroblocks (16×16) carry more information than chromaticity (2×8×8).

Answer 87

Content complexity & movement, compression algorithm settings, and encoder implementation.

Answer 88

Peak finding, mean intensity (over time), and image segmentation.

Answer 89

Dividing an image into meaningful regions (e.g., organs, tumours) using edge detection, local features, or machine learning.

Answer 90

Length, width, aspect ratio (L/W), and histograms of these values.

Answer 91

Segmenting images by treating intensity as topography, identifying catchment basins and watershed lines.

Answer 92

Area too small/large, eccentricity <0.7, too long/short, or touching image border.

Answer 93

MRI (T1, T1c, T2, FLAIR), CT, and PET.

Answer 94

BraTS (Brain Tumour Segmentation Challenge) dataset.

Answer 95

Significant improvements in semantic segmentation, especially for multi-class tumour sub-regions.

Answer 96

Dice coefficient: = 2𝑇𝑃 --------------------------- 2𝑇𝑃+𝐹𝑃+𝐹𝑁.

Answer 97

1. Pathology is the science of diseases. It studies their causes and effects by looking at tissue samples. 2. Histopathology is the study of changes in tissues caused by disease.

Answer 98

Digital pathology uses scanners to turn tissue slides into digital images. These can be stored, shared, and analysed by computers.

Answer 99

Large models trained on millions of slides (e.g. UNI, Phikon, CTransPath). They learn general features and can be fine‑tuned for tasks.

Answer 100

A way to combine an image with a filter (small matrix) to change or highlight features.

Answer 101

To filter images, detect edges, blur, sharpen, or segment parts of an image.

Answer 102

A grid of numbers (pixels). Each number shows brightness or colour.

Answer 103

Applying a filter (matrix) to an image by multiplying and adding values.

Answer 104

1. Correlation: filter slides over image without flipping. 2. Convolution: filter is flipped before sliding.

Answer 105

To handle edges of an image when applying filters.

Answer 106

1. Zero padding: add zeros around edges. 2. Constant padding: add fixed values. 3. Replicate padding: copy edge values. 4. Reflect padding: mirror the image edges.

Answer 107

A way to automatically find the best threshold to separate foreground and background.

Answer 108

A method that treats the image like a landscape and finds “basins” to separate regions.

Answer 109

They help machines see patterns in images, like edges, shapes, or textures.

Answer 110

Convolutions are the building blocks of Convolutional Neural Networks (CNNs).

Answer 111

It means a computer learns from data instead of being given fixed rules.

Answer 112

It helps us find patterns, make predictions, and handle complex tasks that are hard to code by hand.

Answer 113

It takes input data, learns a pattern, and gives a prediction.

Answer 114

Supervised learning and unsupervised learning.

Answer 115

Learning from labelled data where we know the correct answers.

Answer 116

1. Predict numbers (regression) 2. Predict categories (classification)

Answer 117

Finding groups or patterns in data without labels.

Answer 118

To predict a number using a straight‑line relationship.

Answer 119

When a model learns the training data too well, but fails on new data.

Answer 120

When a model is too simple and misses important patterns.

Answer 121

Split data into training, validation, and test sets.

Answer 122

Predicting a class label, like “cancer” or “not cancer”.

Answer 123

A line or shape that separates classes in the data.

Answer 124

A model inspired by the brain, made of layers of neurons that learn patterns.

Answer 125

It turns numbers into a value between 0 and 1, useful for classification.

Answer 126

Train one model per class. Each model says: “This class vs. all others.” Example: Cat vs Not‑Cat, Dog vs Not‑Dog, Bird vs Not‑Bird.

Answer 127

Self‑driving cars Medical imaging Banking Online shopping Voice assistants

Answer 128

A piece of input data, like age, pixel value, or test result.

Answer 129

A decision boundary in many features, too many to draw.

Answer 130

Logistic regression SVM Neural networks Random forests

Answer 131

It groups data into clusters by moving centroids to the average position of points.

Answer 132

The human brain - neurons connected by input and output wires.

Answer 133

Idea: downsampling method - reduce the size of feature maps to make the network faster and more focused. What it does: Takes the largest value from a small block (e.g. 2×2) - then summarises that block into one single value Why: Keeps the strongest feature (like sharp edges). Effect: Shrinks data size, highlights important details.

Answer 134

Idea: downsampling method - reduce the size of feature maps to make the network faster and more focused. What it does: Takes the average value from a small block - then summarises that block into one single value Why: Smooths features, keeps overall trend. Effect: Shrinks data size, reduces noise, less sharp than max pooling.

Answer 135

It finds the best parameters by moving step by step to reduce error.

Answer 136

A table showing true positives, true negatives, false positives, false negatives.

Answer 137

Imaging bones and detecting fractures. But can also be used for detecting pneumonia and for mammography.

Answer 138

High‑frequency sound waves (1–15 MHz).

Answer 139

1. A‑mode (Amplitude mode) Shows echoes as spikes on a graph. Used for measuring distances or tissue thickness (e.g., eye exams). 2. B‑mode (Brightness mode) Produces a 2D grayscale image. Most common mode for viewing organs and tissues. 3. M‑mode (Motion mode) Tracks movement over time (one scan line). Often used for heart motion and valve studies. 4. Doppler mode Measures frequency shifts in echoes. Shows blood flow speed and direction (color Doppler adds visual maps).

Answer 140

Because it doesn't use ionising radiation.

Answer 141

Hydrogen nuclei (protons) in water.

Answer 142

TR: Time between successive RF pulse sequences. TE: Time between RF pulse and echo signal.

Answer 143

T1: Fat bright, CSF dark T2: Fluid bright, CSF bright

Answer 144

Makes CSF dark but keeps abnormalities bright → easier to spot pathology.

Answer 145

Measure: Metabolism (sugar uptake in tissues). Use: Detecting cancer spread (metastasis).

Answer 146

Using a camera inside the body to see organs directly.

Answer 147

They annihilate → two gamma rays (511 keV) emitted in opposite directions.

Answer 148

Rotating X‑ray source and detectors with a moving table. Planes: Coronal (front/back), Sagittal (side), Axial (top/bottom).

Answer 149

They can damage DNA through ionisation.

Answer 150

Back: Spread beam values across pixels, add them up to estimate the image. Filtered: Same idea, but uses filters to reduce blur.

Answer 151

A scale for CT values (air ≈ ‑1024, water ≈ 0, bone ≈ +1024).

Answer 152

1. Patients move, scans differ in angle or slice thickness. 2. Use markers (internal or external) that appear in all images.

Answer 153

Electronic, thermal, speckle (ultrasound), artefacts. Reduced: Filters, subtract dark current, average over time.

Answer 154

Regions of interest.

Answer 155

Manual boxes, image analysis, ML

Answer 156

- heart rate: HR < 40, > 130 bpm - oxygen: <90%

Answer 157

What: Turning video into a digital format that can be stored or sent. Why: Raw video is huge. Compression makes files smaller by removing repeated or less important data.

Answer 158

Less than about 20 per second. The brain holds each image briefly, hence why the fast updates feel continuous and look smooth.

Answer 159

It scans lines across the screen with light.

Answer 160

Splitting even and odd lines to reduce flicker.

Answer 161

Example: 1080p at 30 fps needs ~1.5 billion bits per second. -> need to reduce and remove redundancy

Answer 162

Store less colour detail than brightness detail (e.g., 4:2:0).

Answer 163

Keep low‑frequency detail, drop high‑frequency detail (like JPEG).

Answer 164

Frames are similar. Only send differences.

Answer 165

I‑frame: 10–20:1 P‑frame: 20–30:1 B‑frame: 30–50:1

Answer 166

To move from a question to clear results through steps like getting data, exploring it, modelling it, and reporting findings.

Answer 167

Split data, check for missing values, look at class balance, view samples, and calculate simple stats.

Answer 168

Acquire → clean → label → handcrafted features → background removal → edge detection → segmentation → rule‑based classification.

Answer 169

Acquire → clean → label → split data → extract features → train ML model → evaluate.

Answer 170

Acquire → clean → label → split data → build CNN → train → evaluate.

Answer 171

MAE, RMSE, and R².

Answer 172

TP: correct positive. TN: correct negative. FP: predicted positive but wrong. FN: predicted negative but wrong.

Answer 173

Precision, recall, F1 score, accuracy, and confusion matrices.

Answer 174

1. Measuring motion between video frames. 2. Calculates motion vectors for every pixel. 3. Calculates motion for selected points or regions.

Answer 175

Things that look different from reality. They can come from capture, encoding, or playback.

Answer 176

Random electrical noise in sensors → lowers signal quality.

Answer 177

Small errors when turning analogue signals into digital numbers.

Answer 178

More motion → harder to compress → lower quality at fixed bitrate.

Answer 179

Frames are duplicated or interpolated → may look odd.

Answer 180

Scaling up/down can cause blur or blockiness.

Answer 181

Converting stored colours to screen RGB. Needs careful scaling if >8 bits per channel.

Answer 182

- A dataset with 14M+ images, 1M+ bounding boxes, 20k+ categories. - Allowed for breakthroughs like AlexNet and modern deep learning. - Deep conv nets allowed for over 95% accuracy on classification tasks.

Answer 183

120+ open datasets (MRI, CT, PET, Ultrasound) for cancers.

Answer 184

A huge X‑ray dataset from Stanford.

Answer 185

1. Intra-frame: Spatial redundancy, compress each frame like JPEG (to remember: r comes first in intra and r is next to s in the alphabet hence spatial) 2. Inter-frame: Temporal redundancy, motion compensation, predicative coding. 3. (t come after s in the alphabet so after r is inter, and hence temporal)

Answer 186

1. set-up (lighting, etc) 2. audio (spectrograms, species specific ranges, etc) 3. video (CNNs for visual classification) 4. combine video and audio

Answer 187

X‑ray tube rotates around patient. Detectors measure how much X‑ray passes through. Computer combines many angles → cross‑section image. Key idea: Shows tissue density differences. Lossy step: X‑ray dose absorbed by body.

Answer 188

Strong magnet aligns hydrogen atoms. Radio waves knock them out of alignment. As they relax, they send signals. Gradients map position → image. Key idea: Great soft tissue contrast.

Answer 189

Inject radioactive tracer (e.g. FDG). Tracer emits positrons → annihilation → 2 gamma rays. Detectors in ring catch both rays. Computer maps tracer distribution. Key idea: Shows metabolism, not structure.

Answer 190

Probe sends sound waves into body. Echoes return from tissue boundaries. Time + strength of echoes → depth + brightness. Computer builds real‑time image. Key idea: Safe, portable, live imaging.

Signal Processing Flashcards

(217 cards)