Spark의 RDD를 분석한다.
1. Spark RDD
1.1. Job, Stage, Task
1.2. Lazy Evaluation
2. 참조
- https://people.eecs.berkeley.edu/~matei/papers/2012/nsdi_spark.pdf
- https://datastrophic.io/core-concepts-architecture-and-internals-of-apache-spark/
- https://blog.k2datascience.com/batch-processing-apache-spark-a67016008167
- https://stackoverflow.com/questions/41340612/do-stages-in-an-application-run-parallel-in-spark
- https://jaemunbro.medium.com/apache-spark-%EC%A1%B0%EC%9D%B8-join-%EC%B5%9C%EC%A0%81%ED%99%94-c9e54d20ae06
- https://alklid.github.io/dlog/2017/10/12/spark-01/index.html
- https://pizzathief.oopy.io/spark-rdd