Spark Context
RDDs
(Resilient Distributed Datasets)
Creating an RDD
A file based RDD
> val mydata = sc.textFile(“purplecow.txt”)
> mydata.count()
RDD Operations
RDD Operations: ACTIONS
Example:
for (line <- mydata.take(2))
println(line)
RDD Operations: TRANSFORMATION