K2B-7-1HadoopJun 1, 2020
The Apache Hadoop ecosystem — HDFS (distributed storage), YARN (cluster resource scheduler), MapReduce (batch compute), and the surrounding stack (Hive, Spark, HBase, Kafka). What each piece does, how the pieces fit, where Hadoop still wins in the cloud-native era, and the certification ecosystem (Cloudera, formerly Hortonworks).