Is it possible to create PySpark DataFrame from external data sources?
Is it possible to create PySpark DataFrame from external data sources?
题目类型: 技术面试题
这是一道技术面试题,常见于澳洲IT公司面试中。
难度: hard
标签: interviewbit, pyspark, topic-specific, spark, data-engineering
参考答案摘要
Yes, it is! Realtime applications make use of external file systems like local, HDFS, HBase, MySQL table, S3 Azure etc. Following example shows how we can create DataFrame by reading data from a csv f...
本题提供 STAR 原则详细解答和技术解析,登录匠人学院学习中心即可查看完整答案。