Logo DWBI.org Login / Sign Up
Sign Up
Have Login?
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Login
New Account?
Recovery
Go to Login
By continuing you indicate that you agree to Terms of Service and Privacy Policy of the site.
SSIS

SSIS- Derived Column Transform

 
Updated on Sep 30, 2020

This is a Fast Forward tutorial on Derived Column Transformation in Microsoft SQL Server Integration Services.

Derived Column

The Derived Column is very flexible and powerful Transformation used to create new column values by applying expressions to input columns of the transformation. An expression can contain any combination of variables, functions, operators, and columns from the transformation input. The result can be added as a new column or inserted into an existing column as a replacement value. The Derived Column transformation can define multiple derived columns, and any variable or input columns can appear in multiple expressions.

Implementation

Lets go straight to implementation. In the previous article we have learnt the basics related to SSIS Project, Packages, Connections, Control Flow and Data Flow. So lets design the Control Flow as below. Here we have taken a Execute SQL Task, to delete data from the Staging target table before the Data Flow Task. The Data Flow Task is responsible for the data movement logic.

Control Flow
Control Flow

Double click the Execute SQL Task to open the Editor. Next we set the Name as Execute_SQL_Task_DELETE_STAGE_TBL. Next we select the Connection Type followed by the Connection Object. Next we select the SQLSourceType as Direct Input. Next we put the desired SQLStatement that we want to execute prior to the Data Flow run. By default, the ByPassPrepare is set to True. In this case we can optionally set it to false as we are not using parameterized query. Find the screen shot below:

Execute SQL Task
Execute SQL Task

Below is the the SQL statement that will be executed prior to the Data Flow. It will delete the existing records in the target staging table.

SQL Statement
SQL Statement

Now let us define the Data Flow Task to specify the data movement logic.

Data Flow Task- Properties
Data Flow Task- Properties

Here we use a OLE DB Source, a Derived Column transformation and finally a OLE DB Destination.

Data Flow
Data Flow

Double-click to open the OLEDB Source Editor. Next we select the OLEDB connection object name from the OLEDB connection manager browser. Next select the database Table or View.

OLEDB Source- Connection Manager
OLEDB Source- Connection Manager

Here we select all the columns of the EMP table as our source.

OLEDB Source- Columns
OLEDB Source- Columns

Lets keep the Error Outputs as default.

OLEDB Source- Error Output
OLEDB Source- Error Output

Next let's have a look at the Advanced Editor. Right-click the OLEDB Source and select Show Advanced Editor.

OLEDB Source Advanced Editor- Connection Managers
OLEDB Source Advanced Editor- Connection Managers

Below OLEDB Source Advanced Editor- Component Properties.

OLEDB Source Advanced Editor- Component Properties
OLEDB Source Advanced Editor- Component Properties

Below OLEDB Source Advanced Editor- Column Mappings

OLEDB Source Advanced Editor- Column Mappings
OLEDB Source Advanced Editor- Column Mappings

Below OLEDB Source Advanced Editor- Input & Output Properties

OLEDB Source Advanced Editor- Input & Output Properties
OLEDB Source Advanced Editor- Input & Output Properties

Next we Double-click the Derived Column transformation to open the Editor. The Editor layout shows three panes- Left we have the input Columns to the transformation along with the set of System and User defined variables. On the right we have the set of Functions and operators to be used for column manipulation. Below is the area where we define the new sets of columns as Output. Here we define the Derived Column Name, Data Type, Length, Precision, Scale and the corresponding Expression.

Derived Column Transformation Editor
Derived Column Transformation Editor

let us define a new column as load_date to be populated with system date, using function GETDATE(). Next we define output column commission as- if input commission is NULL then we evaluate it as 0 else we populate the source commission value.

Next right-click the Derived Column transform and select Show Advanced Editor.

Derived Column Advanced Editor- Component Properties
Derived Column Advanced Editor- Component Properties

Below Derived Column Advanced Editor- Input Columns. Here we can rename a column name using Output Alias. Here we select the Usage Type for column comm as READONLY as this column is used to evaluate commission and we don't want this column as output; instead we need the derived column as the output.

Derived Column Advanced Editor- Input Columns
Derived Column Advanced Editor- Input Columns

Below Derived Column Advanced Editor- Input & Output Properties.

Derived Column Advanced Editor- Input & Output Properties
Derived Column Advanced Editor- Input & Output Properties

Finally open the OLEDB Destination Editor. Select the connection object and the target table.

OLEDB Target- Connection Manager
OLEDB Target- Connection Manager

Next we go to the Mappings tab and define the target column mapping.

OLEDB Target- Mappings
OLEDB Target- Mappings

Lets leave the OLEDB Target- Error Output as default.

OLEDB Target- Error Output
OLEDB Target- Error Output

Next right-click the OLEDB Destination and select Advanced Editor.

OLEDB Target Advanced Editor- Connection Managers
OLEDB Target Advanced Editor- Connection Managers

Below OLEDB Target Advanced Editor- Component Properties

OLEDB Target Advanced Editor- Component Properties
OLEDB Target Advanced Editor- Component Properties

Below OLEDB Target Advanced Editor- Column Mappings

OLEDB Target Advanced Editor- Column Mappings
OLEDB Target Advanced Editor- Column Mappings

Below OLEDB Target Advanced Editor- Input & Output Properties

OLEDB Target Advanced Editor- Input & Output Properties
OLEDB Target Advanced Editor- Input & Output Properties

Finally we save the Data Flow, Control Flow and Package. Next Execute the Package.

Execute Package- Derived Column
Execute Package- Derived Column

Use Derived Column transform to evaluate and manipulate input column values and generated desired output columns as a part of data movement logic.

Top 10 Articles