pyspark-hdfs-notebook This is a simple notebook that reads a file from a remote Hadoop cluster and counts words contained