ASD Flashcards

Question

General Searching and Sequential Algorithm

Answer 1

The Problem: Searching for a key in a sequence of length len. Sequential Search: A standard approach where elements are checked one by one2. Analysis: Dominating Operation: Comparison of elements3. Complexity: Linear time complexity, $W(len) = \Theta(len)$4. Limitation: For unordered sequences, this cannot be improved5.

Answer 2

Efficiency: If the input sequence is sorted (non-decreasingly or non-increasingly), the problem can be solved much faster. Skipping Algorithm: Checks every k-th cell7. If a value higher than the key is found, it performs a linear search only on the preceding k-1 elements8. Complexity: Asymptotically $k$ times faster than sequential search, but still linear rank: $W(len) = \frac{1}{k} \cdot \Theta(len)$9. Optimal Jump: The optimal choice for $k$ is $\sqrt{len}$10.

Answer 3

Method: Uses the "Divide and Conquer" rule by repeatedly halving the search space. Process:Check the middle element12.If it equals the key, return the index13.If the middle element is higher than the key, restrict the search to the left sub-sequence; otherwise, search the right. Analysis: Time Complexity: $W(len) = \Theta(\log_2(len))$ and $A(len) = \Theta(\log_2(len))$15. Space Complexity: $O(1)$16. Requirement: Requires random access to elements (data must be in RAM).

Answer 4

Definition: The k-th positional statistic is the k-th smallest (or largest) element in a sequence18. Tournament Algorithm: Used to find the second smallest element efficiently19. Elements compete in pairs; winners (smaller elements) move to the next turn20. The second smallest must be among those who lost directly to the winner in the tournament history21. Complexity: Requires $len - 1$ comparisons to find the minimum, plus $O(\log_2(len))$ to find the second smallest22.

Answer 5

Partition Procedure: Selects a "median" element M and reorganizes the sequence so elements $\le M$ are on the left and elements $\ge M$ are on the right23. Complexity: Time $W(n) = n + O(1)$ and space $S(n) = O(1)$24. Hoare’s Algorithm (Quickselect): Uses partition to find the k-th statistic25. If the partition index equals k, the element is found; otherwise, it recurses on the appropriate sub-sequence26. Complexity: Linear time complexity on average27.

Answer 6

Input: A sequence $S$ of elements that can be ordered using a total-order relation1. Output: A non-decreasingly sorted sequence $S'$ containing the same elements as the input2. Importance: Sorting is a fundamental operation used to accelerate searching, visualize data, and compute statistical characteristics 3.

Answer 7

Idea: Repeatedly identify the minimum element from the unsorted portion of the array and swap it into its correct position at the front 4. Analysis: Dominating Operation: Comparison of two elements5. Pessimistic and Average Complexity: $W(len) = A(len) = \Theta(n^2)$ 6. Behavior: It always performs the same number of comparisons regardless of the initial order of the data, even if the sequence is already sorted7.

Answer 8

Idea: Iteratively take the "next" element and insert it into its correct position within the already sorted prefix of the sequence 8. Analysis: Pessimistic Case: Occurs when data is invertedly sorted, leading to $\Theta(n^2)$9. Average Case: $\Theta(n^2)$, though it is typically twice as fast as Selection Sort in practice. Optimistic Case: Very fast for nearly sorted data; if already sorted, it only requires $n-1$ comparisons (linear complexity)11. Variant: Using Binary Search to find the insertion point reduces the number of comparisons to $\Theta(n \log n)$, but the overall complexity remains $\Theta(n^2)$ due to the linear cost of shifting elements in an array.

Answer 9

Idea: A "Divide and Conquer" algorithm that splits the sequence into halves, sorts each half recursively, and then merges the sorted halves 13. Analysis: Time Complexity: $W(len) = A(len) = \Theta(n \log n)$14. Space Complexity: High for arrays ($\Theta(n)$) because it requires temporary memory for merging. Performance: The difference between linear-logarithmic and square complexity is massive for large datasets; for example, sorting 100 million records.

Answer 10

Linked Lists: Structure: Consists of nodes containing data and a link to the next node17. Advantages: Very fast modification operations (inserting/deleting) and can improve Merge Sort space complexity to $O(1)$ by avoiding element copying. Disadvantages: Slower access (no random access; must start from head) and extra memory for links19.Arrays:Advantages: Very fast random access and lower memory overhead20.Disadvantages: Inserting or deleting elements has a linear time complexity21.

Answer 11

Definition: A sorting algorithm is stable if it preserves the original relative order of elements with the same value (ties). Importance: In practical applications like databases, records are often sorted by one attribute (a key). Stability ensures that if two records have the same key, their original order is maintained, which is crucial for sorting multi-attribute records in successive iterations. QuickSort: It is less naturally stable than algorithms like Merge Sort.

Answer 12

Idea: A "Divide and Conquer" algorithm that uses a pivot element ($M$) to reorganize a sequence so that no larger element is to its left and no smaller element is to its right. The process is then applied recursively to the two resulting subsequences. Partition: This linear-time procedure ($W(n) = \Theta(n)$) places the pivot in its final position and returns its index. Complexity: Average Case: $\Theta(n \log n)$. Pessimistic Case: $\Theta(n^2)$, occurring when the input is already sorted or invertedly sorted, leading to maximum recursion depth. Space: Although it sorts "in place," recursion costs implicit memory, resulting in a pessimistic space complexity of $O(n)$.

Answer 13

The Limit: It is mathematically impossible for any comparison-based sorting algorithm to have an average or worst-case time complexity better than linear-logarithmic, or $\Theta(n \log n)$. Explanation: This can be modeled as a binary decision tree where each leaf is one of $n!$ possible permutations of the input. The height of this tree (number of comparisons) must be at least $\log_2(n!)$, which is approximately $n \log n$.

Answer 14

To "beat" the $\Theta(n \log n)$ limit, algorithms must use operations other than comparisons, often trading higher space complexity for lower time complexity. CountSort

Answer 15

Idea: Uses direct addressing to place elements. It requires the input to be non-negative integers. Complexity: Time complexity is linear, $\Theta(n + m)$, where $n$ is the sequence length and $m$ is the maximum value. However, space complexity is also high at $\Theta(n + m)$ due to helper arrays.

Answer 16

Idea: A scheme that sorts objects (like strings or multi-digit numbers) by applying a stable internal sorting algorithm to each digit or position, starting from the least significant to the most significant. Internal Algorithm: CountSort is a common choice if the symbols (digits/alphabet) are limited.

Answer 17

Definition: Recursion involves a function calling itself1. In algorithms, it is used to reduce a problem instance to a smaller instance of the same problem, often referred to as "divide and conquer"2. Positive Aspect: Recursion provides a very compact and elegant representation of an algorithm3. Negative Aspects: It implicitly costs additional memory because the machine must maintain a recursion stack. Deep recursion can lead to system failures (e.g., calling a recursive function for $n=100,000$ might crash a machine). Whenever possible, recursion should be translated into iterative versions to save memory6.

Answer 18

Defined by the base cases $Fib(0)=0$, $Fib(1)=1$ and the step $Fib(n+1) = Fib(n) + Fib(n-1)$. The value grows exponentially; for example, $Fib(50)$ is over 12 billion8.

Answer 19

A riddle involving moving $n$ rings between three sticks9. The number of moves is defined by $hanoi(n) = 2 \cdot hanoi(n-1) + 1$, which solves to $2^n - 1$ moves.

Answer 20

To solve a recurrent equation of the form $s_n = as_{n-1} + bs_{n-2}$, you must use the characteristic equation: $x^2 - ax - b = 0$11. Single Solution ($r$): $s_n = c_1r^n + c_2nr^n$12. Two Solutions ($r_1, r_2$): $s_n = c_1r_1^n + c_2r_2^n$13. Binet's Formula: An application of this method to the Fibonacci sequence, yielding $Fib(n) = \frac{1}{\sqrt{5}} \left( \left( \frac{1+\sqrt{5}}{2} \right)^n - \left( \frac{1-\sqrt{5}}{2} \right)^n \right)$14141414.

Answer 21

In algorithmics, three specific recursive forms are frequently encountered for time complexity $t(n)$: Case 1 (Logarithmic): $t(n) = t(n/2) + c \Rightarrow \Theta(\log n)$ (e.g., Binary Search). Case 2 (Linear): $t(n) = 2t(n/2) + c \Rightarrow \Theta(n)$ (e.g., finding maximum in a sequence). Case 3 (Linear-Logarithmic): $t(n) = 2t(n/2) + cn \Rightarrow \Theta(n \log n)$ (e.g., Merge Sort).

Answer 22

The Master Theorem provides a universal method for solving equations in the form $T(n) = aT(n/b) + f(n)$18. It compares the rank of $f(n)$ with $n^{\log_b a}$19: If $f(n)$ is polynomially lower than $n^{\log_b a}$, then $T(n) = \Theta(n^{\log_b a})$. If both are of the same rank, then $T(n) = \Theta(n^{\log_b a} \log n)$. If $f(n)$ is polynomially higher (and satisfies a regularity condition), then $T(n) = \Theta(f(n))$.

Answer 23

Sequences are the most common data structures and are primarily implemented using Arrays or Linked Lists.

Answer 24

Absolute Access: Extremely fast, constant time $O(1)$. Relative Access: Slow, linear time $\Theta(n)$. Limitations: Bounded size; any "insert" operation has a pessimistic linear time cost.

Answer 25

Singly Linked (SList): Nodes contain an element and a pointer to the next node. Doubly Linked (DList): Nodes contain pointers to both the next and previous nodes, allowing bidirectional navigation. Cyclic Lists: The last node links back to the first one. Trade-offs: Fast relative operations (e.g., "insert after") in constant time, but slow linear time absolute access and extra memory for pointers.

Answer 26

An ADS is defined by its interface (supported operations) rather than its concrete implementation .

Answer 27

A "LIFO" (Last In, First Out) structure with push, pop, and top operations .

Answer 28

A "FIFO" (First In, First Out) structure with inject, out, and front operations .

Answer 29

A Double Ended Queue that generalizes both stack and queue, allowing operations at both ends.

Answer 30

This method analyzes the total cost of a sequence of $m$ operations rather than evaluating each operation in isolation14141414. It is useful when some operations are expensive but rare15.

Answer 31

assigns a non-negative potential $\Phi$ to the state of the structure; the amortized cost is $a_i = t_i + \Phi_i - \Phi_{i-1}$16161616.

Answer 32

"Credits" are assigned to objects during cheap operations to pay for expensive future operations.

Answer 33

Simply computes the total cost of $m$ operations to find the average18.

Answer 34

Unbounded arrays emulate dynamic growth using a bounded array that is reallocated when full19191919. Growth Rule: If the array is full ($n = w$), allocate an array $2 \times$ larger and copy elements. Shrink Rule: If the number of elements is significantly smaller than the capacity (e.g., $n = w/4$), reallocate to a smaller array21. Complexity: While the pessimistic cost of a single pushBack is $O(n)$, the amortized cost for a sequence of $m$ operations is constant $O(1)$.

Answer 35

Operation SList DList UArray (Unbounded) CArray (Cyclic) Indexing [.] O(n) O(n) 0(1) 0(1) First/Last O(1) 0(1) 0(1) 0(1) Insert/Remove 0(1)1 0(1)? O(n) O(n) pushBack 0(1) 0(1) 0(1) 3 0(1)3 popFront 0(1) 0(1) O(n) 0(1)3

Answer 36

A Priority Queue is an Abstract Data Structure where each element has an associated "priority." It differs from a standard queue because it does not follow the FIFO rule; instead, it allows access to the element with the highest (or lowest) priority . Core Operations: insert(T e): Adds a new element with an assigned priority. findMin(): Returns the element with the minimum priority. delMin(): Returns and deletes the element with the minimum priority .

Answer 37

Unsorted Sequence (List/Array): insert: 0(1). delMin: On). o Sorted Sequence (List/Array): insert: O(n). o delMin: O(1).

Answer 38

A Binary Heap is a complete binary tree that satisfies the heap-order condition: for every non-root node $x$, the priority of the parent is less than or equal to the priority of $x$ 10. Array Representation: Because it is a complete tree, it can be stored in an array without pointers 11. If root is at index 1: parent[i] = i/2, left_child[i] = 2i, right_child[i] = 2i + 1 12. Operations on Heap: insert: Add to bottom and perform upheap — $O(\log n)$13. delMin: Move bottom element to root and perform downheap — $O(\log n)$14. construct: Building a heap from an unsorted sequence can be done in $\Theta(n)$ time15151515.

Answer 39

HeapSort: A sorting algorithm with $\Theta(n \log n)$ time complexity16. By placing the minimum element in the released space of the array, it can achieve $O(1)$ space complexity17. Greedy Algorithms: Typically used to select the "next best" element efficiently, such as in Huffman Codes, Dijkstra's shortest-path, and Prim's minimum spanning tree algorithms 18.

Answer 40

Addressable PQ: Supports decreaseKey(handle, newPriority) and delete(handle) using a handle to the element . Mergeable PQ: Supports merge(PQ1, PQ2) to combine two queues into one .

Answer 41

A Binomial Heap is a collection of Binomial Trees sorted by degree21. Properties: An $n$-element heap has $O(\log n)$ trees22. Efficiency: Its primary advantage is the fast merge operation, which takes $O(\log n)$ time, similar to binary addition 23. All other standard operations (insert, delMin, delete) also take $O(\log n)$ time24.

Answer 42

A Dictionary is an Abstract Data Structure (ADS) that represents a mapping from keys to values. It primarily supports three operations : search(K key): Returns the value associated with the given key . insert(K key, V value): Adds a new key-value pair. delete(K key): Removes the entry associated with the key.

Answer 43

Hashtables provide fast dictionary operations by using a hash function ($h: U \rightarrow [0..m-1]$) to map large keys into a smaller array of size $m$ 6.

Answer 44

When two keys map to the same index ($h(k_1) = h(k_2)$), a collision occurs 7.

Answer 45

Each array index points to a linked list of elements that hash to that position8. Under a uniform load assumption, it guarantees average $O(1)$ time if $m = O(n)$ 99.

Answer 46

All elements are stored in the array itself. If a position is occupied, the algorithm scans for a free index using strategies like linear probing, quadratic probing, or double hashing .

Answer 47

Universal Hashing: Randomly picking a hash function from a specific family to avoid "malicious" data patterns that cause many collisions 11. Perfect Hashing: A scheme that guarantees worst-case constant time $O(1)$ for searching 12.

Answer 48

An extension of the dictionary that also supports operations based on the order of keys, such as minimum(), maximum(), successor(key), and predecessor(key) .

Answer 49

Condition: For every node, all keys in the left subtree are smaller and all keys in the right subtree are larger 14. Complexity: Guarantees average $O(\log n)$ time for all operations 1515. However, the worst-case is linear $O(n)$ if the tree becomes unbalanced (e.g., a "vine")1616.

Answer 50

A self-balancing BST where the height difference between left and right subtrees of any node is at most 117. Guarantees worst-case $O(\log n)$ time through the use of rotations to maintain balance 18.

Answer 51

Uses a splay operation (a sequence of rotations) to move frequently accessed or recently modified keys to the root 19. Guarantees amortized $O(\log n)$ complexity20.

Answer 52

Implementation Search (Avg) Search (Worst) Order Operations? Unordered Sequence O(n) O(n) No Direct Addressing 0(1) 0(1) Yes (but limited) Hashtable (Chain) 0(1) O(n) No BST 0(log n) O(n) Yes AVL Tree 0(logn) 0 (log n) Yes Self-org. BST 0(1ogn)* 0(log n)* Yes

Answer 53

An ordered pair $G=(V,E)$, where $V$ is a set of vertices and $E$ is a set of edges 2. Each edge $e=\{v,w\}$ is an unordered pair of vertices3.

Answer 54

An ordered pair $G=(V,E)$, where each edge (arc) is an ordered pair $(v,w)$, indicating direction from $v$ to $w$4.

Answer 55

In an undirected graph, the number of edges incident to a vertex $v$, denoted $deg(v)$5.

Answer 56

In a digraph, $deg^-(v)$ is the number of incoming arcs and $deg^+(v)$ is the number of outgoing arcs6.

Answer 57

The sum of all vertex degrees in an undirected graph is equal to twice the number of edges.

Answer 58

Definition: Two graphs $G$ and $H$ are isomorphic if there is a bijection between their vertex sets that preserves adjacency8. Properties: Isomorphic graphs must have the same number of vertices, edges, and the same degree sequence9.

Answer 59

A square matrix where the entry at $(i, j)$ is 1 if there is an edge between vertex $v_i$ and $v_j$, and 0 otherwise10.

Answer 60

A matrix showing the relationship between vertices (rows) and edges (columns).

Answer 61

A digraph is a geometric representation of a binary relation on the set $V$12.

Answer 62

A sequence of vertices where each adjacent pair is connected by an edge.

Answer 63

A path that starts and ends at the same vertex.

Answer 64

An undirected graph is connected if there is a path between every pair of vertices.

Answer 65

A digraph is strongly connected if there is a directed path between every pair . it is weakly connected if the underlying undirected graph is connected.

Answer 66

A connected undirected graph with no cycles.

Answer 67

A tree in which one vertex is designated as the root.

Answer 68

A rooted tree where each internal node has at most two children.

Answer 69

You should be able to specify the height, depth, and number of leaves for a given rooted tree.

Answer 70

Defined as $G = (V, E)$, where $V$ is a set of vertices and $E \subseteq V \times V$ is a set of arcs (ordered pairs) 1.

Answer 71

Similar to a digraph, but edges are unordered pairs $\{u, v\}$ 2.

Answer 72

An edge $e = (u, v)$ is said to be incident to $u$ and $v$, while $u$ and $v$ are adjacent to each other 3.

Answer 73

Self-loops: Generally not allowed 4. Multi-graph: Allows multiple edges between the same vertices 5. Hypergraph: Generalization where edges are $n$-tuples rather than pairs 6.

Answer 74

A connected, undirected graph with no cycles .

Answer 75

A tree with one designated vertex called the root; it is often viewed as a directed graph where arcs point away from the root .

Answer 76

A rooted tree where each node has at most two children (left and right) .

Answer 77

A 2D array where A[i][j] = 1 if an edge exists from $v_i$ to $v_j$. Pros: Constant time $O(1)$ to check if an edge exists. Cons: High space complexity $O(V^2)$, inefficient for sparse graphs.

Answer 78

An array of lists where each list $i$ contains the neighbors of $v_i$. Pros: Space-efficient for sparse graphs $O(V+E)$. Cons: Checking for a specific edge $(u, v)$ takes $O(deg(u))$ time.

Answer 79

These recursive methods define the order in which nodes are visited 10: Pre-order: Visit Root -> Left Subtree -> Right Subtree 11. In-order: Visit Left Subtree -> Root -> Right Subtree 12. Post-order: Visit Left Subtree -> Right Subtree -> Root 13.

Answer 80

Method: Explores neighbors layer-by-layer using a queue 14. Properties:Computes the shortest path (minimum number of edges) from the source to all reachable vertices 15. Complexity: $O(V + E)$ 16.

Answer 81

Method: Explores as far as possible along each branch before backtracking; uses a stack or recursion 17. Timestamps: Each vertex $v$ receives a discovery time $v.d$ and a finishing time $v.f$ 18. Edge Classification:Tree edges: Part of the DFS forest. Back edges: Connect to an ancestor in the tree (indicate cycles). Forward edges: Connect to a descendant. Cross edges: Connect to nodes in different branches or trees.

Answer 82

FeatureBFSDFSData StructureQueue (FIFO)Stack (LIFO) / RecursionPrimary UseShortest path (edge count)Connectivity, cycle detection, topologyComplexity$O(V+E)$$O(V+E)$

Answer 83

An ordering of vertices in a Directed Acyclic Graph (DAG) such that for every arc $(u, v)$, $u$ comes before $v$ 19. High-level idea: Run DFS to compute finishing times $v.f$ for all vertices, then sort vertices in decreasing order of finishing times 20.

Answer 84

Subsets of a digraph where every node is reachable from every other node in the subset . DFS Application: Run DFS, reverse all arcs, then run DFS again in decreasing order of the first run's finishing times .

Answer 85

Definition: Given an undirected connected graph $G=(V, E)$ with positive weights on edges, find a tree $T=(V, E')$ that connects all vertices in $V$ such that the total sum of edge weights is minimized 1. Spanning Property: The resulting subgraph must be a tree (no cycles) and must "span" all original vertices 2. Input: An undirected connected graph with a weight function $w: E \rightarrow R^+$ 3.

Answer 86

For any cut in the graph (a partition of vertices into two sets), the edge with the minimum weight crossing that cut belongs to the MST.

Answer 87

For any cycle in the graph, the edge with the maximum weight in that cycle does not belong to the MST.

Answer 88

Idea: Grows the MST one vertex at a time starting from an arbitrary root. Mechanism: It maintains a connected tree and always adds the cheapest edge that connects a vertex in the tree to a vertex outside the tree. Similarity: It is very similar to Dijkstra's algorithm, but uses edge weights as priorities instead of total distances to the source. Suitability: Generally a good choice for most cases, especially dense graphs.

Answer 89

Idea: Grows the MST by considering edges in increasing order of weight10. Mechanism: It adds the next cheapest edge as long as it does not create a cycle. The partial solution is a forest (a collection of trees) that eventually merges into a single MST11. Efficiency: Can be faster than Prim's on sparse graphs where $m = O(n)$12. Streaming Mode: It can work "on-line" as edges arrive through a network, even if they aren't pre-sorted13.

Answer 90

Purpose: Essential for the efficient implementation of Kruskal's algorithm to keep track of connected components and detect cycles14. Operations:Find(x): Determines which component element $x$ belongs to. Union(x, y): Merges two separate components into one15. Fast Implementation: Uses techniques like "union by rank" and "path compression" to achieve nearly constant time complexity for operations16.

Answer 91

Definition: Given a graph $G=(V, E)$ with edge weights $w: E \rightarrow \mathbb{R}$ and a starting node $s$, find the shortest distance $\mu(s,v)$ and the parent node for every vertex $v$ to reconstruct the path. Non-existence: A shortest path may not exist if there is no path from $s$ to $v$, or if the graph contains a negative cycle, which allows for infinitely decreasing path lengths. Property: A subpath of a shortest path is itself a shortest path.

Answer 92

The Idea: Most shortest-path algorithms use edge relaxation to iteratively improve the currently known shortest distance to a node. Relaxation Operation: For an edge $(u,v)$, if the current distance to $u$ plus the weight of the edge $(u,v)$ is less than the current distance to $v$, update $v$'s distance and set $u$ as its parent. Key Lemma: After a sequence of relaxations that includes a shortest path as a subsequence, the node's distance attribute will equal the actual shortest-path distance.

Answer 93

Depending on the graph's properties, different algorithms are used for maximum efficiency:

Answer 94

Topological Sort followed by a single pass of relaxation. Complexity: Linear time, $O(V + E)$.

Answer 95

Mechanism: A greedy approach that uses a priority queue to always scan the unscanned node with the minimum distance. Complexity:$O((V + E) \log V)$ with a Binary Heap.$O(E + V \log V)$ with a Fibonacci Heap.

Answer 96

Mechanism: Relaxes all edges $V-1$ times. This is necessary because the shortest path can have at most $V-1$ edges. Cycle Detection: It can detect negative cycles by checking if any edge can still be relaxed after $V-1$ iterations. Complexity: $O(V \cdot E)$.

Answer 97

Goal: Find shortest paths between all possible pairs of nodes. Mechanism for graphs with no negative cycles: Add an artificial node and run Bellman-Ford once to compute "node potentials". Use these potentials to reduce weights so they are all non-negative. Run Dijkstra's algorithm $V$ times (once from each node as a source). Complexity: $O(V \cdot E + V^2 \log V)$.

Answer 98

never changing.

Answer 99

A loop invariant is a condition or statement about program variables that remains true before and after every iteration of a loop, acting as a crucial checkpoint to prove an algorithm's correctness and understand its behavior. It helps verify that a loop achieves its goal by maintaining a specific property (e.g., a sub-array is sorted, a sum is correct) throughout its execution, even as individual variables change. result = 5 == loopResult = loop{}

Answer 100

Definition: In any undirected graph, the sum of the degrees of all vertices is equal to twice the number of edges.Formula: $\sum_{v \in V} \text{deg}(v) = 2|E|$.Implication: Every edge has two endpoints, so it contributes exactly 2 to the total degree count of the graph.

ASD Flashcards

(127 cards)