What does a filesystem keep track of?
Files: where the data is actually stored
Directories: Groups of files
Metadata: Information on the format and permissions related to a file
What is the primary job of a filesystem?
To make sure data is always accessible and intact.
What is the storage model of the Google Filesystem
How do reads work on the Google Filesystem
How do writes work on the Google Filesystem?
Does the master keep a persistent record of chunk locations? If not, what does it do?
It queries the chunk servers at startup and is updated by periodic polling.
What happens when a node fails in GFS?
How does the master know that a chunk he intends to use is up to date?
The master maintins a chunk version number to distinguish. Before an operation on a chunk, the master ensures version is advanced.
How consistent is GFS?
It has a relaxed consistency model.