Post by Category

DWH (1)

My 2 cents on Foreign Keys and Data Warehousing

May 1, 2017

As a data warehouse architect/designer, we always have to deal with the question: “should I use foreign keys ? Should I enforce them ?” Well, yes! for the f...

Design (1)

My 2 cents on Foreign Keys and Data Warehousing

May 1, 2017

As a data warehouse architect/designer, we always have to deal with the question: “should I use foreign keys ? Should I enforce them ?” Well, yes! for the f...

ETL (6)

Bulk load between technologies

October 16, 2016

Data migration is a very common task in (big) data engineering. In the big data landscape, you would probably look for sqoop to handle such a task. But in th...

Creating an execution dashboard for Talend jobs

April 3, 2017

Talend Open Studio is a great open source solution. Its “Cost vs. functionalities” is quite good. But because this is the free version, as opposed to the en...

Implementing a bridge table with Talend

January 19, 2017

In Kimball’s multi-dimensional data model, a bridge table is an analytical solution to a multi-valued dimension fact in a fact table: when a fact in a fact t...

Incremental data loading with Talend

March 8, 2017

One of the inevitable challenge in any data integration architecture is choosing the right loading technique. Push change data capture (CDC) is more than oft...

Variables declaration ordering error in Informatica and Talend

July 19, 2017

Understanding how in-flow variables are declared by your ETL tool is important. Probably all of them requires the user to declare the variables in the right ...

Workflow manager

November 15, 2016

If you have used Informatica, or to some extent Talend Studio (that is, not the free version), you know that you can chain jobs together. Now, if you need to...

Monitoring (1)

Creating an execution dashboard for Talend jobs

April 3, 2017

Talend Open Studio is a great open source solution. Its “Cost vs. functionalities” is quite good. But because this is the free version, as opposed to the en...

SQL (1)

Hierarchical and recursive queries in (T-)SQL

March 23, 2017

“Hierarchical and recursive queries in SQL”… this is quite a hot topic, so hot that it has its own wikipedia entry. The typical “hello world” is employees a...

TSQL (1)

Hierarchical and recursive queries in (T-)SQL

March 23, 2017

“Hierarchical and recursive queries in SQL”… this is quite a hot topic, so hot that it has its own wikipedia entry. The typical “hello world” is employees a...

Talend (2)

Implementing a bridge table with Talend

January 19, 2017

In Kimball’s multi-dimensional data model, a bridge table is an analytical solution to a multi-valued dimension fact in a fact table: when a fact in a fact t...

Incremental data loading with Talend

March 8, 2017

One of the inevitable challenge in any data integration architecture is choosing the right loading technique. Push change data capture (CDC) is more than oft...

Workflow (1)

Workflow manager

November 15, 2016

If you have used Informatica, or to some extent Talend Studio (that is, not the free version), you know that you can chain jobs together. Now, if you need to...

transformations (1)

String aggregation with databases and ETL tools

November 27, 2017

String aggregation is the process of concatenating strings (usualy as a list with a separator) based on a common key. Databases implement their own function ...