Extract text from column

The Extract text from column step extracts a portion of text based on a matching character or offset. You may use this to pull out company names from emails, remove part of an ID, or extract the timezone from any date/time stamp.

Input/output

In this case, we're looking to extract email domains from our customer email addresses. Our input data has five columns: "First Name", "Last Name", "Company", "Email" and "Country".

After using the Extract text from column step, it gives us a new column named "Domain Names" where extracted email domains are listed (taken from the "Email" column's values). This new column is filled with domain names.

Settings

First, Pick the dataset, then specify the column that you'd like to extract text from.

Then, give your new column a name. This step will always create a new column with your extracted data.

Next, select an Operation. The options are:

  • find all text after

  • offset after

  • find all text before

  • offset before

  • offset after from the the end of text

  • offset before from the end of text

Last updated