Apache Beam Key Concepts
Apache Beam has the following key concepts.
- Pipeline
- Modeling processing data
- PCollection
- Immutable dataset
- PTransform
-
Various transforming functions
- ParDo
- GroupBy
- Combine
- Flatten
- Partition
- Side Input
- Additional data to support transformation
- Runner
- Selecting platform to run logics such as Spark, Flink and etc
- Windowing
- Defining time windows - What
- Triggers
- When aggregation will run - When
- Schema
- Defining data format