WHAT IS RDD ?
RDD is the spark's core abstraction which is resilient distributed dataset.
It is the immutable distributed collection of objects.
RDD Creation
RDD vs Dataframe vs Dataset