Intermediate

Pipeline Builder

Discover the no-code Pipeline Builder tool for creating transforms visually with a drag-and-drop interface.

1 ยท No-Code Transforms

Pipeline Builder lets you build transforms without writing code. You drag input datasets onto a canvas, add transformation nodes (filter, join, aggregate, pivot, derive column), and connect them to an output dataset. Under the hood, Pipeline Builder generates the same Spark execution plan as a Python or SQL transform โ€” so there is no performance trade-off.

2 ยท Common Operations

Pipeline Builder supports: - Filter โ€” keep rows matching a condition. - Join โ€” combine datasets on shared keys. - Aggregate โ€” group and summarise (count, sum, avg). - Derive Column โ€” add computed columns with expressions. - Pivot / Unpivot โ€” reshape wide to long or vice versa. - Union โ€” stack datasets vertically. Each node shows a live data preview so you can verify results at every step.

3 ยท When to Use Pipeline Builder vs. Code

Pipeline Builder shines for straightforward ETL: filtering, joining, and aggregating. For complex logic โ€” custom UDFs, machine learning, recursive algorithms โ€” Code Repositories give you full flexibility. Many teams use Pipeline Builder for 80% of their transforms and drop into code for the remaining 20%. Both approaches produce standard Foundry datasets.

โœ… Key Takeaways

  • Pipeline Builder lets you build transforms visually โ€” no code required.
  • It generates the same Spark execution plan as Python or SQL transforms.
  • Live previews at every node make debugging straightforward.
  • Use Pipeline Builder for standard ETL; use code for complex logic.

๐Ÿง  Knowledge Check