Which data structure is used to represent distributed datasets in Spark?
Apache Parquet
Resilient Distributed Dataset (RDD)
Overlook minor misbehaviors
Impose harsh punishments for any infraction

Big Data Technologies Exercises are loading ...