Tasks and conclusion
Post-training tasks:
- Try setting up your own 3 node Hadoop cluster. - A VM based solution can be found here
 
- Write a simple spark/MR job of your choice and understand how to generate analytics from data.- Sample dataset can be found here