“…The Hadoop dataset process has two sets of operations: map and reduce; by contrast, the Spark dataset process has several sets of operations, and the transformation and action instructions are summarized in Table . The transformation type operating instructions include map(), filter(), flatMap(), groupByKey(), reduceByKey(), join(), and various types of actions (the operating instructions include count(), collect(), reduce(), and save()).…”