Name 6 action commands we learned in class
.colllect() .take(num=filestaken) .foreach() .count() .countByValue() .saveAsTextFile()
Name 9 transformation commands we learned in class
What is the pyspark syntax to start a spark context
conf = SparkConf().setAppName('example').setMaster('local[*]')
sc = SparkContext(conf=conf)What are different syntax to load a file into a Spark environment?
df = spark.read.load(‘FOO’, format=’FILEFORMAT’, inferSchema = ‘true’, header = ‘true’)
alternatively you can load a file via:
df = spark.read.json('FOO')
df = spark.read.csv('FOO')What are the five categories of spark?
1) SparkCore – SparkContext
2) SparkSQL – SparkSession
3) Spark Streaming
4) MLlib
5) graphx