What are RDDs in PySpark?
What are RDDs in PySpark?
题目类型: 技术面试题
这是一道技术面试题,常见于澳洲IT公司面试中。
难度: easy
标签: interviewbit, pyspark, topic-specific, spark, data-engineering
参考答案摘要
RDDs expand to Resilient Distributed Datasets. These are the elements that are used for running and operating on multiple nodes to perform parallel processing on a cluster. Since RDDs are suited for p...
本题提供 STAR 原则详细解答和技术解析,登录匠人学院学习中心即可查看完整答案。