Pre-SQL and Post-SQL for Source and Target in Ingestion
Pre-SQL is an SQL statement executed using the source and target connections before the pipeline is run. On the other hand Post-SQL statements are executed after the pipeline is run. Using Pre-SQL and Post-SQL statements helps in performing operations like insert, delete and update before and after the load.
If we want to add some data in database from CSV file, and we haven't created a table for that so in this case we can use this feature like "Pre-SQL" option in target tab to create table before entering data in that table and if we want to delete specific data from table before fetching all data we can use feature like "Pre-SQL" option in source tab of selected table.
Guzzle supports Pre-SQL and Post-SQL for Source and Target and their execution in Ingestion. It is used mainly for pre and post formatting of Data in a Database.
Guzzle supports Pre-SQL and Post-SQL statements for Source and Target in Ingestion for different Datastore Technologies as listed in the table below:
Datastore technologies | Pre/Post SQL for source | Pre/Post SQL for target |
---|---|---|
Delta | Yes | Yes |
Hive | Yes | Yes |
Azure SQL | Yes | Yes |
Azure Synapse Analytics | Yes | Yes |
JDBC | Yes | Yes |
Guzzle follows the following order of execution for Pre-SQL and Post-SQL statements:
Source Section โ Pre SQL source
Source Section โ Read table or SQL
Source Section โ Post SQL source
Target Section โ Pre SQL target
Target Section โ Write data into target
Target Section โ Post sql target
we can summarize as below:
In source section :
- pre sql syntax is executed in source table.
- table is read by guzzle.
- post sql syntax is executed in source table.
In target section :
- pre sql syntax is executed in target table.
- write data in target table.
- post sql syntax is executed in selected table.
This order of execution applies across all connectors in Guzzle.
In case of multiple SQL statements, the statements will be executed in the order the user sees them in the Interface.
If we want to execute multiple SQL statement we can do by adding it in next input text, and they will execute all in sequence as first in first out. Example, In the above figure there are 2 Pre-SQL statements. The INSERT INTO statement will be executed first followed by the DELETE FROM statement.