Spark面试问题总结

阿里面试:https://www.jianshu.com/p/11578fd6e272
https://www.jianshu.com/p/c8a271448dcd
大数据开发面试-MMMM:https://www.jianshu.com/p/fec32e92e06c

OGG CDC 读取oracle日志-M

https://blog.csdn.net/dkl12/article/details/80447154
https://www.csdn.net/gather_28/MtTaQg3sMDI5OS1ibG9n.html

Flume-M

Source类型: spooldir avro exec
Channel类型: memory file jdbc kafka
Sink类型:avro hdfs
Flume读取binlog与kafka结合
https://blog.csdn.net/qq_33792843/article/details/84537669

maxwell实时读取mysql数据到hdfs

https://blog.csdn.net/qq_33290422/article/details/80225432
https://b

你可能感兴趣的:(spark相关问题汇总及解决,spark,面试,大数据)