Define Apache Cassandra.
A distributed NoSQL database designed for handling large amounts of data across many servers.
True or false: Cassandra uses a master-slave architecture.
FALSE
Cassandra employs a peer-to-peer architecture with no single point of failure.
What is the primary data model of Cassandra?
Cassandra uses a wide-column store data model.
Fill in the blank: CQL stands for _______.
Cassandra Query Language
What is a keyspace in Cassandra?
A keyspace is a namespace that defines data replication settings for tables.
Define replication factor.
The number of copies of data stored across the cluster nodes.
True or false: Cassandra supports multi-data center replication.
TRUE
It allows data to be replicated across different geographical locations.
What does Tunable Consistency mean?
It allows users to configure the level of consistency required for read and write operations.
Fill in the blank: Partitioning in Cassandra is done using a _______.
Partition key
What is a column family?
A collection of rows that share the same structure in Cassandra.
Define SSTable.
A file format used by Cassandra to store immutable data on disk.
True or false: Cassandra uses eventual consistency.
TRUE
This means that data will eventually become consistent across all replicas.
What is the purpose of gossip protocol?
To allow nodes to communicate and share information about the state of the cluster.
Fill in the blank: Tombstones are used to mark _______ in Cassandra.
Deleted data
What is a materialized view?
A precomputed query result stored as a table for efficient access.
Define lightweight transactions.
Transactions that provide conditional updates using a consensus protocol.
True or false: Cassandra supports joins between tables.
FALSE
Cassandra does not support joins; data modeling must account for this.
What is the role of a Cassandra node?
A single instance of Cassandra that stores data and participates in the cluster.
Fill in the blank: Data modeling in Cassandra focuses on _______.
Query patterns
What is scalability in Cassandra?
The ability to add more nodes to handle increased load without downtime.
Define compaction.
The process of merging SSTables to optimize storage and read performance.
True or false: Cassandra can handle unstructured data.
TRUE
It is designed to store various data types, including unstructured data.
What does nodetool do?
A command-line utility for managing and monitoring Cassandra nodes.
Fill in the blank: Cassandra is optimized for _______ workloads.
Write-heavy