Building a Transformation Pipeline
One of the most powerful features of Saddle Data is the ability to chain multiple in-flight transformations together. This guide will show you how to build a simple yet practical transformation pipeline.
Let's say we have a users table with the following columns:
idnameemailis_active(boolean)created_at
Our goal is to create a new table in our destination that only contains the id and email of active users, with the email column renamed to user_email.
Here is the pipeline we will build:
- Schema Mapping: Select only the
idandemailcolumns and renameemailtouser_email. - Filter Rows (Transformation): Keep only the rows where
is_activeistrue.
Building the Flow
- Configure Mapping: In the Schema tab, select the
userstable. Uncheck all columns exceptidandemail. In the destination mapping for theemailcolumn, change the name touser_email. - Add Filter: In the Design tab, add a Filter Rows transformation to your pipeline. Configure it to keep rows where
is_activeequalstrue.
When this Flow runs, Saddle Data will first filter the data according to your transformation rules and then apply the schema mapping, resulting in a clean, targeted table in your destination.