
migration, Data transformation , Business intelligence, Datawarehouse, Big data and Analytics.
ETL is Extract transform and Load whereas ELT is extract load then transform. Both have their own pros and cons but now a days ELT is more popular than ETL. This is why because size of data is growing & growing for transformation. ELT also supporting Data virtualization concept where actual original data will reside on system without any modification and transformation will less.
SQL server has reach sets of tools for ETL & ELT . Here I am sharing some important Differences between ETL and ELT which we should know before jumping on features.
SrlNo
|
Dimension
|
ETL
|
ELT
|
1 | Technology Adoption | ETL is a well-developed process used for over 20 years | ELT is a new technology |
2 | Compliance | This makes it easier to satisfy GDPR, HIPAA, and CCPA compliance standards. | This could violate GDPR, HIPAA, and CCPA standards. |
3 | Data size vs. complexity of transformations | ETL is best suited for dealing with smaller data sets that require complex transformations. | ELT is best when dealing with massive amounts of structured and unstructured data. |
4 | Implementation Complexity | ETL experts are easy to procure when building an ETL pipeline. | As a new technology, the tools to implement an ELT solution are still evolving. Moreover, experts with the requisite ELT knowledge and skills can be difficult to find. |
5 | Order of the extract, transform, load process | Data transformations happen immediately after extraction within a staging area. After transformation, the data is loaded into the data warehouse. | Data is extracted, then loaded into the target data system first. Only later is some of the data transformed on an “as-needed” basis for analytical purposes. |
6 | Transformation process | Transformations happen within a staging area outside the data warehouse. | Transformations happen inside the data system itself, and no staging area is required. |
7 | Unstructured data support | ETL can be used to structure unstructured data, but it can’t be used to pass unstructured data into the target system. | ELT is a solution for uploading unstructured data into a data lake and make unstructured data available to business intelligence systems. |
SQL Server have reach sets of data transformation features available since 2000 to now.
Functionality
|
SSIS
|
BULK INSERT
|
BCP
|
INSERT … SELECT
|
Azure Data
Factory
| |
SQL Dest.
|
OLE DB Dest
| |||||
SQL Server Version
|
2005 - Now
|
2005 – Now
|
7.0, 2000 to Now
|
6.0, 7.0, 2000 to Now
|
2008 to Now
|
2016
|
Protocol
|
Shared Memory
|
TCP/IP
Named Pipes
|
In Memory
|
TCP/IP
Shared Memory
Named Pipes
|
In Memory
|
Cloud Pipeline & Link Service
|
Speed
|
Faster / Fastest
|
Fast / Fastest
|
Fastest
|
Fast
|
Slow / Fastest
|
Fastest
|
Data Source
|
Heterogeneous
|
Heterogeneous
|
Data File Only
|
Data File Only
|
Any OLE DB
|
Heterogeneous / Hybrid
|
Transformation
|
Yes
|
Yes
|
No
|
No
|
Yes
|
Yes
|
ETL & ELT
|
ETL
|
ETL
|
E & L
|
E & L
|
ETL
|
ETL & ELT
|
Invoked from
|
DTEXEC / BIDS
|
DTEXEC / BIDS
|
Transact-SQL
|
Command Line
|
Transact-SQL
|
SaaS/API/ Web Service
|
Use Case
|
DW
|
DW
|
Small Scale DW
|
VLDB
|
VLDB
|
Big Data
|
Thanks for reading
Plz dont forget to like Facebook Page..https://www.facebook.com/pages/Sql-DBAcoin/523110684456757
Thanks for the amazing post.
ReplyDeleteSSIS Upsert