Study Guide Flashcards

Question

25. Explain how standing queues develop in network buffers at bottleneck links. Why is a standing queue NOT correctly identified as congestion?

Answer 1

Queues develop at bottleneck links as a result of the bottleneck’s reduced forwarding speed. As some of the packets in the queue are forwarded, the TCP sender will begin to receive ACKs and send more packets, which arrive at the bottleneck link buffer, refilling the queue. The difference in the bottleneck link speed and the link RTT (driving the congestion window of the TCP flow) will result in a certain number of packets consistently occupying the buffer, until the flow completes, which is referred to as the standing queue. Standing queues are NOT congestion because it results from a mismatch in congestion window and the bottleneck link size. A standing queue can develop in single flow environments, and under usage limits that would eliminate actual congestion.

Answer 2

CoDel assumes that a standing queue of the target size is acceptable, and that at least one maximum transmission unit (MTU) worth of data must be in the buffer before preventing packets from entering the queue (by dropping them). CoDel monitors the minimum queue delay experienced by allowed packets as they traverse the queue (by adding a timestamp upon arrival). If this metric exceeds the target value for at least one set interval, then packets are dropped according to a control law until the queue delay is reduced below the target, or the data in the buffer drops below one MTU. Dropping a flow’s packet triggers a congestion window reduction by the TCP sender, which helps to eliminate buffer bloat.

Answer 3

The HEAD method in HTTP requests a document just like the GET method except that the server will respond to a HEAD request with only the HTTP response header; the response body (which would normally contain the document data) is not included. This saves the delay of transmitting the actual document, e.g., if it is a large file, but allows the browser to check the Last-Modified field in the response header to find out if it's been changed since the time when the cached version was retrieved.

Answer 4

a. 404 Not Found ? The requested file does not exist on the server. That is, the file indicated by the path part of the GET method line cannot be found at that path. b. 302 Moved Temporarily (also sometimes called 302 Found) ? The requested file is not at this location (i.e., the path part of the GET method line), but the browser should instead use the URL provided in the Location field of the response to retrieve the file. However, the file may be found at this location in the future (unlike a Moved Permanently response), so the URL in the Location field should be used this once, but not necessarily again in the future. c. 200 OK ? The operation in the request message succeeded. What that operation is exactly depends on the request method. For example, if the request method was GET then 200 OK means that the document was retrieved and its content should be in the body of the 200 OK response. (200 OK responses to other methods do not necessarily contain a body, though. This also depends on what the method was.)

Answer 5

a. Last-Modified This is the date and time that the requested document file was last modified on the server. It can be used to check if a cached copy is fresh (newer than the Last-Modified time) or stale (older than the Last-Modified time, indicating that it's been changed since the cached copy was retrieved). b. Host This is the domain name of the web request (e.g., from the domain part of the URL). One way this may be used is if a single web server (with a single IP address) is hosting websites for more than one domain. The web server can check the Host field to see which domain's pages should be retrieved for each request it gets. c. Cookie This is included in request messages that are sent to a domain that previously gave the browser a cookie. That cookie would have been provided by the Set-Cookie field in a response message, and after that (until the cookie expires) the browser should include the exact same cookie given by Set-Cookie in any request message it sends to the same domain. This allows the server to know that a request is coming from the same client that made another earlier request. For example, when you request to view your shopping cart, the web server may use cookies to know that you are the same person who earlier clicked on an item to add to your cart, so it can show you a cart containing that item.

Answer 6

DNS-based redirection is much faster than HTTP redirection, as the latter requires a couple extra round trips to servers. (It's actually more than just one extra round trip because you need to establish a TCP connection to a second different server.) It also gives the CDN provider more control over who will be redirected where than a technique like IP anycast would. Finally, it is not too difficult to implement (even if slightly more complex than the other two) and it uses tools that are widely supported (i.e., DNS) and do not need any modifications to support this technique (i.e., DNS works out of-the-box).

Answer 7

A BitTorrent client sends data only to the top N peers who are sending to it, plus one peer who is optimistically unchoked. Let's say for example purposes that N=4. Your BitTorrent client will choose the 4 peers who are sending to it at the fastest rate and it will send data to them in return. It will not send to other peers, and they are said to be choked. Thus it provides tit-for-tat by sending to those who send the most to it, and choking those that are not sending to it, or are sending slowly. However, this creates a problem where two peers who might be able to send to each other are mutually choked. Neither will begin sending to the other because the other is not sending to it. Therefore, each client will optimistically unchoke one peer at any given time for a brief period. If the client sends fast enough to the optimistically unchoked client to get on its top-4 then the peer will send data back in return. If the client receives enough data from the peer for it to be in the top-4 then that peer becomes one of the new top-4 and the slowest of the previous top-4 will be choked. Thus they both end up in each other’s top-4. (The peer is no longer "optimistically" unchoked, and is merely unchoked. A new peer is selected to be optimistically unchoked.) On the other hand, if the client does not get into its peer's top-4, or if it does but the peer does not send fast enough in return to get in the client's top-4, then they will not end up in each other’s top-4. After some time, the client will stop optimistically unchoking that peer and stop sending to it. It will choose a new peer to optimistically unchoke. This process repeats forever (until the client has the entire file, that is) in order to keep exploring different peers for better matches than the client's current top-N. The game theoretic result is that clients will end up sending to peers that are able to send back about the same amount – fast peers will get paired up, while slow peers are matched with each other. This happens because a fast peer will readily drop a slow peer from its top-N in favor of another fast peer, matching fast peers together. Slow peers will not get matched with fast peers because the fast peers will soon learn to choke them, but they will pair up with other slow peers because neither peer can find a better match who is willing to unchoke them.

Answer 8

A lookup will require O(N) hops in this case. Suppose a constant size of 1, as an example. Each node only knows how to find the next one, so it basically forms a ring topology. In the worst case, the requested item is on the last node in the ring before getting back to the node that originated the request. So the request has to go all the way around the ring, taking N-1 hops. Based on similar reasoning, if a larger, constant number of nodes is in the finger table, a proportionately smaller amount of time may be required. However, for any given constant size finger table, as the number of nodes in the system grows, the number of hops required will still be on the order of O(N).

Answer 9

O(log N) entries in the finger table means that each node knows about the node halfway around the ring back to it, about the node halfway to that one, the one halfway to that one, and so on until the last entry in the finger table that is just the next node. This means that for any given item that could be on any node, each node knows the address of at least one node that is at least half way around the ring from itself to the item. Since each hop cuts the distance to the item in half, the number of hops required to get to the item from any starting point in the DHT is O(log N). (This should be understood by analogy to binary search, divide-and-conquer, etc.)

Answer 10

1. IS-IS control – this is used to calculate routes that allow routers to later forward data packets, but does not carry data for any application 2. IP data – the actual IP packets that are forwarded by routers are the packets that contain application data 3. UDP data – similar to b), these UDP packets contain application data 4. DHCP control – this is used to automatically assign IP addresses to end hosts (and sometimes subnet and DNS server locations as well), which is required for that end host to then be able to send and receive data packets, but DHCP messages do not contain any application data themselves 5. 802.11 (Wi-Fi) data – this is a link layer protocol that carries data for applications or higher level protocols (which would be considered “data” by the link layer, even if they are not data at the application layer)

Answer 11

One scenario in which SDN is helpful is when something breaks in the network (at the software/configuration level). Since the control plane is separate and policies are centralized in the SDN controller, it is easier to see the “big picture” of what your network configuration is actually doing and you can find and fix problems more easily. Another scenario is when you want to update your network. Instead of buying all new hardware to get the latest control plane features, you simply update your software in the SDN controller. Similarly, updating policies is easier as you just update the configuration expressed by the DNS controller software, and you don't have to go around to each network device and update its individual piece of the global policy separately (and hope that you didn't miss one or accidentally misconfigure one in the process!). Finally, SDN is useful in research or testbed network. Because SDN is flexible, you can create new control techniques or try different policies to experiment with them, without having to build a new piece of hardware that implements the new behavior. This is useful not only because experiments to test new ideas are easier and less expensive, but also because it allows rapid iteration on those ideas if it becomes apparent some further refinement on the idea is needed.

Answer 12

Network virtualization is useful in multi tenant data centers (or “the cloud”) in order to provide each tenant with the illusion that they have a private network connecting their servers/Vms, and possibly to allow them some ability to configure their virtual network without affecting other tenants. It is also useful in R&D environments (e.g., universities or anyplace else research is done) in order to isolate networking experiments from the rest of the general-purpose traffic in their enterprise. Thus experimental techniques that be tried without causing problems for the rest of the network. Finally, it is useful in computer networking classes. Similar to the research scenario, we want to try doing some different things that may not be a good idea on the production network in order to learn more about how networking works. Virtualization allows us to try things without breaking the network for everyone else. Other answers may be possible, but these are three major use cases. Network virtualization is generally bad in situations where you can consider it overkill, or where the costs outweigh the benefits. For example your home network connecting to your ISP or the corporate network you use at work are poor candidates for network virtualization. Networks that are highly sensitive to latency are not good candidates. For example systems with system critical cyber-physical devices would likely not be able to trade a layer of virtualization to obtain the flexibility provided. For example, a network of hosts and physical devices used to launch manned space vehicles, conduct air traffic control, control a nuclear reactor, etc. are bad candidates for network virtualization.

Answer 13

The Pyretic API provides a high-level abstraction for SDN programmers. The OpenFlow API exposed by devices supporting it is a low level API, on the level of assembly language. It is inordinately difficult to develop sophisticated SDN applications with the OpenFlow API. Additionally, the Pyretic runtime provides an efficient runtime that automatically installs generated low level rules on hardware devices throughout the network.

Answer 14

First, using the Pyretic API, the programmer specifies a high level network policy. The Pyretic runtime connects via sockets to OpenFlow clients on the network. The Pyretic runtime interprets packets traversing these network clients against the policy, and using its socket connection installs OpenFlow rules to implement policy behavior. Additionally, these connections allow the Pyretic runtime to perform other actions, like proactively installing rules to reduce network latency, reading counters, etc.

Answer 15

A. flood() Returns one packet per local port on the network spanning tree. B. match(dstip=‘192.168.1.15’) & match(srcip=‘192.168.1.120’) Two separate match predicates are composed, the result matches any packet that has destination IP = 192.168.1.15 and source IP – 192.168.1.120 C. match(dstip=‘10.0.0.8’) >> fwd(12) A single match predicate sequentially composed with another, the result of which matches packets any packet bound for IP 10.0.08 and forwards it along port 12. This effectively “filters out” all traffic not bound for IP 10.0.0.8. D. match(dstip= ‘10.0.0.1’) >> ( match(srcip=‘10.0.0.15’) >> drop() + match(srcip= ‘10.0.0.25’) >> modify(dstip=‘10.0.0.30’) ) This policy implements a complex policy. First, all traffic not bound for IP 10.0.0.1 is filtered. Any packets bound for 10.0.0.1 is then subject to parallel composition. If the packet is from IP 10.0.0.15, it is dropped. If the packet is from 10.0.0.25, it is returned, with the destination IP rewritten to 10.0.0.30.

Answer 16

-What are the two things that need to be measured, and how could each be measured? We need to measure the topology, including not only the connectivity but also the capacity of each link and router. This could be done by routers self-reporting, similar to how they exchange information in a Link State protocol, but in practice is probably more often simply entered as data by a network engineer. We also need to measure the traffic, or offered load. This can be done using the “simple counters” measurement technique that we learned about earlier, since we want to know how much traffic is on each part of the network but don't necessarily need the details of specific flows. -What are two ways that control could be implemented? The “traditional” way to implement control is by adjusting link weights, which indirectly affects the routes calculated by the routing protocol. In practice, link weights are more often used this way than to represent any “real” property of the network, like bandwidth or link latency. Another way to implement control is by using SDN to directly control the routes that are used on the network.

Answer 17

LOCAL_PREF, the local preference parameter AS_PATH length, as determined by counting the number of ASes in the AS_PATH MULTI_EXIT_DISC, the MED value IGP metric to the NEXT_HOP, i.e., equal “hot potato” routing distance

Answer 18

This changes the flat layer 2 addressing (MAC addresses) into a hierarchical addressing (pseudo-MAC addresses). This means that switches only need to store a forwarding entry for each host in the same pod plus one for each other pod, rather than needing an entry for each host on the entire network. (Notice that hierarchical addressing is the same thing that allows IP to scalable at layer 3, so the idea is to push that concept down into layer 2.)

Answer 19

* Network load balancing – prevents bottleneck links and heavily loaded aggregation or core switches * Higher capacity – since the network is balanced, more hosts can reasonably be hosted on a network with the same number of switches * Shorter paths – shorter average number of hops between any two hosts results in faster network performance * Incremental expansion – allows adding switches to the network without reconfiguring the existing network infrastructure or adding additional “higher-level” switches

Answer 20

* Does not handle heterogeneous switch devices well, except when expanding the network with switches larger than those originally used. * Long cable runs between random switch pairs may be necessary, but are inconvenient and difficult to install

Answer 21

The Fabric Manager is primarily responsible for maintaining network configuration soft state. Using this soft state, the Fabric Manager performs ARP resolution, provides multicast capability to the network, and achieves fault tolerance goals. The Fabric Manager is a user process, running on a dedicated machine. This machine may be located on the network itself, or it can reside on a separate control network.

Answer 22

A PMAC encodes the position of an end host in a fat-tree network. This encoding consists of four components in the format pod.position.port.vmid . The first component encodes the pod number the end host and the edge switch reside in, and the position number encodes the end host’s position in the pod. The port component encodes the switch’s physical port number the end host is attached to. The vmid component encodes a unique ID for each virtual machine that is present on the end host. The edge switch maintains a mapping for each VM, which uses its own AMAC (actual MAC) address. This permits multiplexing of virtual hosts resident on a single physical host. The use of PMACs greatly simplify layer 2 forwarding due to their hierarchical nature. Switches no longer need a forwarding table entry per virtual host. A single forwarding table entry can be used to aggregate hosts, enabling forwarding behavior that exploits longest prefix match. Using AMACs, switch state size is O(n), where n is the number of virtual hosts in the data center, whereas state size is O(k) for PMACs, where k is the number of ports on switches used to construct the fat tree network.

Answer 23

To create a Jellyfish topology, we need to know three values: N, the number of racks / switches, k, the number of ports per switch, and r, the number of ports to be used to connect to other switches. Next, an approximation algorithm is used to generate a RRG (Random Regular Graph) using N, k, and r as input. The result is a blueprint for the Jellyfish topology that can be used to physically cable the switches and servers.

Answer 24

To incrementally add a new server rack, it is not necessary to generate a new RRG with N+1, k, and r. At a high level, we can add the new rack by iteratively selecting connections between other ToR switches (not otherwise connected to the new ToR switch) and replacing that connection with two new connections, each to the new switch. This maintains the previous connectivity of the topology, and also consumes two of the r ports on the new ToR switch dedicated to connecting to other ToR switches. This process is repeated until one or zero or the r ports remain. It is important to note that after expansion, the new topology cannot be expected to be uniformly random, as it would be if a new RRG was created and the entire data center re-cabled appropriately.

Answer 25

A BGP message could be sent to a router by some host (e.g., a remote attacker) that is not the router's legitimate neighbor ○ Note that although BGPSec provides session authentication, this kind of attack can also be prevented without BGPSec by using the “TTL Hack”. An AS could lie about being the origin of a particular subnet (i.e., claiming that the AS contains a subnet when it does not, in fact) ○ BGPSec prevents this by providing certificates that sign the origin claim An AS could lie about the AS-path to a particular subnet (i.e., claiming that there is a path through the AS when there is not, or that there is a more efficient path than there really is) ○ BGPSec prevents this by providing a chain of signed paths, each partial path in the chain being signed by the AS that advertised that part of the path

Answer 26

When an attacker performs a BGP hijack and leaves its own AS out of the path, it can ensure that even traceroute cannot discover it (the missing AS, that is) by simply not decrementing the TTL field on the traceroute when it passes through the attacking AS. To traceroute, it then looks like that AS isn't actually there.

Answer 27

There are several ways this could happen, but the most common is DNS poisoning. The attacker can SPAM DNS responses with a bad domain→IP address mapping to a local DNS server. If the attacker is lucky, or keeps trying for long enough, at some point the local server will issue a query that a SPAMed response could potentially be a legitimate response to (i.e., it has an ID that matches the request). When this happens, the local server will accept the attacker's response and not only reply to the request with the bad mapping but also cache it and use it to respond to new requests for the same domain name. Once the bad entry is in place, hosts that want to reach that domain will instead go to the IP address given by the attacker. The attacker's machine at that IP could then intercept traffic as a MITM or simply spoof the legitimate server (e.g., to collect login credentials).

Answer 28

ARP poisoning works similarly to DNS poisoning, except that there is not ID value that the attacker needs to guess (or SPAM enough guesses that one of them might be right). Not only that, a host will accept an ARP response even if no ARP request for that IP→MAC mapping was ever made – such a response is referred to as a “gratuitous ARP response”. An attacker could send gratuitous ARP responses for a particular IP address to hosts on its local network so that those hosts send messages to the attacker's MAC instead. For example, the attacker may send gratuitous ARP to a host for the network's gateway router, ensuring all packets headed for outside the local network instead come to the attacker's host instead. It could also send a gratuitous ARP to the router for the host's IP address, ensuring that return traffic is also sent to the attacker's host. This is particularly powerful because it's very easy to do and can let an attacker become a MITM for virtually all traffic to/from a target host. It's also a little harder to detect because the IP addresses are still the correct IP addresses for all the machines involved – only the MAC address have changed. The main drawback compared to something like DNS poisoning is that the attacker must be on the same local network as the target in order to do this. However, users connected to the same “public hotspot” are indeed all on the same local network, making them particularly vulnerable to this sort of attack.

Answer 29

The server does not allocate resources for the TCP connection immediately upon receiving a SYN packet, but instead waits for the ACK (final part of the 3-way handshake) to allocate those resources. In order to prevent attackers from simply doing an “ACK flood” instead of a SYN flood, the server's SYN/ACK response to the SYN packet contains a special “SYN cookie” that it uses as the connection's initial sequence number. When the server gets an ACK, it can calculate whether or not the sequence number in that ACK could have been legitimately generated as a SYN cookie. (The ACK number is the SYN cookie +1, so the server subtracts 1 to get the candidate SYN cookies that it tests.) If the ACK sequence number (SYN cookie) checks out, then the server knows 1) that the client has engaged in the entire 3-way handshake, rather than sending spurious ACKs, and 2) the client's IP address given by the IP headers is it's legitimate address, because otherwise it wouldn't have received the SYNACK that contains the SYN cookie. (There are some more details about how exactly it is able to programmatically verify the legitimacy of a SYN cookie extracted from an ACK message without having stored any data after the SYN and SYN/ACK steps, as well as how it is able to prevent replay attacks. However, we'll leave this answer at this moderate level of detail for now.)

Answer 30

The /12 subnet contains 2^(32-12) = 2^20 = 1048676. So 1048576/1048576 = 1 packet to observe.

Answer 31

- BGP does not validate information in routing announcements, so a manipulator can announce any path they want and claim ownership of a victim’s IP prefix. - Origin Authentication uses a trusted database for verification so an AS can’t claim ownership of a victim’s IP prefix, but they can still announce a path that ends at the proper AS, although the path does not physically exist. - soBGP uses origin authentication and a trusted database to guarantee that any path physically exists, but the manipulator can advertise a path that exists but is not actually available. - S-BGP uses path verification, which limits a single manipulator to announcing available paths, but they could announce a shorter, more expensive, provider path while actually forwarding traffic on a cheaper, longer customer path. - Data plane verification prevents an AS from announcing a path and forwarding on another, so the manipulator must actually forward traffic on the path he is announcing. - Defensive filtering polices the BGP announcements made by stubs. With the model in the paper, each provider keeps a prefix list of the IP prefixes owned by its direct customers that are stubs. If a stub announces a path to any prefix it doesn’t own, then it is dropped. In this way, if all providers correctly implement this it eliminated attacks by stubs.

Answer 32

Announcing longer paths can be better than announcing shorter ones. In the example given in Figure 9 of the paper, advertising the shortest path will only pick up traffic from one small provider. Announcing a longer path to the large provider, will attract more traffic overall as the large provider will prefer this path over the shorter, peer path as it will be cheaper. It is better for the manipulator to attract traffic from larger AS. This strategy will work against any secure routing protocol, except when launched by stubs in a network with defensive filtering, because it is only implementing a different export policy than usually used. Announcing to fewer neighbors can be better than announcing to more. In this strategy, by not exporting to certain Tier providers, customer paths to the victim can be eliminated and influential ASes will be forced to choose shorter peer paths over a longer customer path because the customer path was not made known to them. This will work against any secure protocol as it is just using a clever export policy to manipulate traffic. The identity of the ASes on the announce path matters since it can be used to strategically trigger BGP loop detection. With false loop prefix hijack, the manipulator claims an innocent AS originates the prefix to his provider. But when the false loop is announced, BGP loop detection will cause the AS to reject the path, removing the customer path from the network. This will force large ISPs to choose shorter peer paths. Unlike the first two attacks, this one will only work against BGP, origin authentication or soBGP because it involves false advertising of the path announced by an innocent AS.

Answer 33

Malicious ASes change their providers often to avoid being detected or to avoid the negative consequences of their customers activities. Among these providers, they are also known to connect to Providers with lax security policies and / or long response times to abuse complaints. Even still, Malicious ASes have longer periods of downtime, due to depeering from their neighboring ASes and detection avoidance strategies they employ. ASWatch captures these activities by taking snapshots of AS relationships periodically and observing the changes in relationships over time. These activities are then used to feed the reputation engine that identifies malicious ASes.

Answer 34

Malicious ASes conduct a wide variety of abusive actions, many of which can be countered with simple blacklisting. Examples of this would be DoS, spamming, and phishing. If a malicious AS consistently advertises its entire IP address space, it runs a higher risk of having the entire IP space blacklisted when these activities are detected. Small fragments of advertised space allow malicious activities to continue their activities in a fresh IP space fragment when they are blacklisted.

Answer 35

Botnets conducting a crossfire attack do not need to spoof their IP addresses, and as a result defenses based on detecting spoofed IP addresses fail. Additionally, the traffic sent by these botnets to overload links is not unsolicited, the traffic flows from one participating host to another. Furthermore, the attack overloads links in aggregate, meaning many low intensity flows combine to DoS the target links. These links are harder to differentiate from legitimate traffic, which prevents flow monitoring efforts from detecting these attacks.

Answer 36

Rolling attacks are implemented by an attacker to indefinitely continue an attack on a target area. Continuing to flood the same set of target links will ultimately have negative effects on the attack when router failure detection mechanisms are tripped. Additionally rolling attacks will make the crossfire attack even harder to detect by changing the attack vector without changing the overall target area.

Answer 37

Off-path adversaries can’t observe DNS queries and responses. They will trigger specific DNS lookups, but must generate numerous packets in hopes of matching the request the resolver will accept as they must guess the transaction ID and other entropy. On-path adversaries can passively observe the actual lookups requested by a resolver and can directly forge DNS replies. As long as the resolver receives the forged reply before the legitimate one, it will accept the forged reply. In-path adversaries can both block and modify packets and can block the legitimate packet. Hold-On can’t help here as the legitimate packets can be blocked.

Answer 38

Because the legitimate reply cannot be blocked by on-path adversaries, the “Hold-On” period can be used to wait for the legitimate reply to arrive. The stub resolver/forwarded first learns the expected RTT and TTL associated with legitimate traffic to remote recursive resolver. Then after issuing a DNS query, it starts its Hold-On timer. If a DNSSEC-protected response is expected, local signature validation is done for each reply and returns the first fully validated reply to the client or a DNSSEC error if the Hold-On timer expires before one is validated. If there is no DNSSEC, the resolver compares the timing of the reply to the expected RTT and compares the TTL field in the header to the expected TTL. If a reply is validated it will return this reply to the client, but if there are mismatches, it ignores the response and continues to wait. If the timer expires, it will send the last reply received that was not validated.

Answer 39

Network Virtualization refers to abstracting the network away from the physical equipment, which can be accomplished without SDN (we did this in Project 1 using Mininet!). On the other hand, SDN refers separating the control plane from the data plane by using a centralized logic controller. SDN does not necessarily imply Network Virtualization is employed. Network virtualization software like Mininet allows SDNs to be tested in a virtual environment by using logical processes to emulate physical network devices, including OpenFlow capable switches. By emulating the physical equipment, control plane logic for an SDN can be tested without the need for physical equipment and complicated data collection methods.

Study Guide Flashcards

(63 cards)