Divide large datasets into smaller, organized parts based on column values. Flow uses Hive partitioning convention, creating a directory structure where each folder represents a partition value.
output
├── color=blue
│ ├── sku=PRODUCT01
│ │ └── products.csv
│ └── sku=PRODUCT02
│ └── products.csv
├── color=green
│ ├── sku=PRODUCT01
│ │ └── products.csv
│ ├── sku=PRODUCT02
│ │ └── products.csv
│ └── sku=PRODUCT03
│ └── products.csv
└── color=red
├── sku=PRODUCT01
│ └── products.csv
├── sku=PRODUCT02
│ └── products.csv
└── sku=PRODUCT03
└── products.csv
Write partitioned data with overwrite mode. When writing, ALL files within each affected partition directory are removed and replaced. Partitions NOT in the current dataset remain untouched.