Easyflow docs
  • 🚩Introduction
  • Automation
    • 🚀Get Started
    • 🤖Workflows
    • ⚡Run & Invoke
      • Manual
      • Triggers
      • Scheduler
      • WebHook (Instant)
      • Rest API endpoint
      • Transactional vs Queue
      • Online \ Offline
    • 🎶Workflow orchestration
      • Step operations
      • Step configuration
      • Data mapping
    • 👣Steps and conditions
      • Multi steps
      • Parallel steps
      • Conditions
      • Loop & iteration
    • 🎲Dynamic content
      • Variables
      • Function expressions
    • 📦Schema
      • Schema builder
      • Auto schema detection
      • Use cases
    • 🐞Debugging
  • Visualisation
    • 🚀Get Started
    • 🧮Visual designer
      • Dashboard designer
      • Card designer
    • 🔌Datasources
      • Partner connectors
      • Existing connections
      • URL (CSV or JSON)
      • Data pipeline (Flow)
      • Data-blocs
      • Push to visualiser
    • 🎻Data pivoting
    • 📈Chart types
    • 🎨Customisation
      • Card appearance
      • Theme & branding
    • 🤝Preview & sharing
      • URL share
      • Scheduled snapshots
      • Send to TV
      • Preview mode
    • ⌛Refresh settings
  • Data Pipeline
    • 🚀Get Started
    • 🗞️Pipeline (ETL)
    • ⚡Automated runs
    • 🌻Transformation
      • Average by group
      • Change text casing
      • Clean data
      • Combine columns
      • Combine tables
      • Compare dates
      • Convert array of objects to array of arrays
      • Count by group
      • Extract text from column
      • Fill in blanks
      • Filter rows
      • Find and replace
      • Find maximum by group
      • Find min/max per row
      • Find minimum by group
      • Flatten rows
      • Format dates
      • Format numbers
      • Insert date & time column
      • Insert growth rate column
      • Insert if/else column
      • Insert if/else blank column
      • Insert math column
      • Insert row numbers
      • Insert rows
      • Insert running total column
      • Insert text column
      • Limit rows
      • Merge duplicate rows
      • Pivot columns
      • Remove duplicate rows
      • Rename columns
      • Select columns
      • Reorder columns
      • Sort rows
      • Split column
      • Stack tables
      • Sum by Group
      • Unpivot columns
      • Use regex
    • 🕶️Preview results
    • 🎻Orchestration
    • 📤Output
      • For Google Datastudio
      • For Easyflow visualisation
      • Array of arrays
      • Array of objects
  • Connectors
    • 🚀Get Started
    • 🌟Built-in Connectors
      • CSV
      • Data bloc
      • Data mapper
      • Delay (Wait)
      • HTTP
      • For Loop (Iteration)
      • Functions
        • Array functions
        • Conditions
        • Convertors
        • General functions
        • Date & Time
        • Filters
        • Math functions
        • Object functions
        • Operators
        • Text functions
      • Receive emails | IMAP
      • RSS
      • Send emails | SMTP
      • SQL Syntax
      • Trasform
      • Variables Setter
      • WebHook
      • Workflow (Sub flow)
    • 🌠Partner connectors
  • System
    • 👫Teams
    • 🙂Profile
    • 🧑‍💻Monitoring
    • 🔋Account Usage
    • 💵Billing and Pricing
Powered by GitBook
On this page
  • Input/output
  • Settings
  • Helpful tips

Was this helpful?

  1. Data Pipeline
  2. Transformation

Combine tables

PreviousCombine columnsNextCompare dates

Last updated 2 years ago

Was this helpful?

The Combine Tables step joins multiple tables into one by matching rows between those tables. It is similar to a VLOOKUP in Excel or Google Sheets. In SQL, it's like a join clause.

The principle is simple: if we have two tables of data that are related to each other, we can use the Combine Tables step to join them into one table by matching rows between those tables. This step can handle combining two tables at a time. Once we set it up, we can use it repeatedly. Even if the row amount changes, our step will continue working.

Input/output

The Combine Tables step requires two data inputs to combine. In our example below, we have two tables feeding into it. The first one is a table of "First Name", "Last Name", "Company", "Email" and "Country". The second one is a table of "Countries" and "Capital".

After using the Combine Tables step, our output data (shown below) has combined the "Countries" and "Capital" columns from the second table to the first table using "Countries" as the matching identifier.

Settings

Helpful tips

  • The default setting for the Combine Tables step is the most common way to combine tables. It keeps the entire primary table and finds matches in the other tables, fitting them in to their matched rows as we go. If a row doesn't have a match in the primary table, it won't show up in the results.

  • To combine three or more tables together, chain together multiple Combine Tables steps to merge your data.

After connecting two datasets into this step, Pick the datasets in the "First Dataset" and "Second Dataset" fields, then type your required join type "INNER", "OUTER", "LEFT" or "RIGHT". Then, in the identifier textboxes, you need to type the column name or index that will be used for the matching between the two datasets. "Identifier (First)" from the "First dataset" and "Identifier (Second)" from the "Second dataset". Once you're done, click the Preview results button from the bottom bar to display the newly-combined tables. If the datasets you want to combine don't have a shared column of identical values, then you can use the step to make one in each table. You'd do this by connecting your import datasets to an Insert row numbers step for generating a column with incrementally-increasing numbers.

🌻
Insert row numbers