Big data tools: Hadoop, Spark, H2O
-
Big data tools: Hadoop, Spark, H2O
After completing the topic learners will be able to recommend industry-grade big data tools.Overview of this topic:
- Tools for manipulating large datasets and performing analytics efficiently
- Utilization of distributed systems like Hadoop for fault tolerance and parallel computing
- Implementation of MapReduce for splitting, applying, and combining data operations
- Integration of streaming data solutions like Apache Kafka Streams and Apache Flink for real-time analytics
- Leveraging machine learning platforms such as H2O and Apache Spark MLlib for scalable algorithms
- See lesson's video;
- Learn from slides & notes;
- Take a quiz: Big Data TEST A;
- For support use chatbot bellow or chat with the notes.
-
Opened: Tuesday, 25 November 2025, 7:00 PM
Topic 1: Fundamentals of Big Data Analytics
Topic 2: Main big data tools: Hadoop, Spark, H2O