Training · Cloudera Open Source

Cloudera Training

Hands-on, in-depth courses for the core open source projects powering Cloudera Data Platform — Apache NiFi, Kafka, Impala, and Kudu — packed with operations, development, and tuning know-how that engineers can put to work immediately.

Curriculum overview

Each of the four open source projects is covered end-to-end, from architecture to operational tuning, drawing on best practices accumulated from real production deployments.

01
The standard for dataflow management

Apache NiFi

A complete course on Apache NiFi for enterprise data ingestion, transformation, and delivery — covering architecture, DataFlow design, operational tuning, and processor extension.

Curriculum

19 topics
  • 01Data pipeline architecture
  • 02Key NiFi capabilities
  • 03Core NiFi components
  • 04Important NiFi concepts
  • 05NiFi Processor
  • 06FlowFile structure
  • 07Relationship
  • 08Yield, Penalize, Rollback, Commit
  • 09Scheduling
  • 10Process Group
  • 11Queue
  • 12Funnel
  • 13Data Provenance
  • 14Site-To-Site
  • 15Controller Service
  • 16Using NiFi Processors
  • 17Developing and extending NiFi Processors
  • 18NiFi operations and optimization
  • 19Practical NiFi DataFlow design
02
Distributed streaming platform

Apache Kafka

Learn Apache Kafka end-to-end — from installation and internals to producer/consumer behavior, mirroring, operations, and monitoring.

Curriculum

12 topics
  • 01Introduction to Kafka
  • 02Kafka installation
  • 03Sending Kafka messages
  • 04Consuming Kafka messages
  • 05Kafka internals
  • 06Topics and Partitions
  • 07Message delivery
  • 08Kafka Connector
  • 09Kafka mirroring
  • 10Kafka administration
  • 11Kafka monitoring
  • 12Kafka operational configuration
03
High-performance MPP SQL engine

Apache Impala

Cover Impala architecture, installation requirements, Iceberg integration, query profile analysis, performance tuning, and security configuration.

Curriculum

16 topics
  • 01Introduction to Impala
  • 02Impala architecture
  • 03Installation and configuration requirements
  • 04Impala key ports
  • 05Impala data types
  • 06Impala client access
  • 07Impala Coordinator
  • 08Impala Catalog & StateStore
  • 09Supported file formats and storage
  • 10Iceberg Integration
  • 11Impala Query Profile
  • 12Impala system monitoring
  • 13Admission Control
  • 14Performance tuning
  • 15Impala security
  • 16HBase integration
04
Fast analytical columnar store

Apache Kudu

Learn Kudu's positioning and architecture, schema design, Impala integration, API usage, security, and administrative CLI with hands-on labs.

Curriculum

9 topics
  • 01Introduction to Kudu
  • 02Kudu's positioning
  • 03Kudu architecture
  • 04Kudu schema design
  • 05Impala and Kudu
  • 06Impala and Hive
  • 07Kudu API
  • 08Kudu security
  • 09Kudu command line
Training · Cloudera Open Source
Contact us