- Purpose of the article: In this blog, we have explained how we can Load JSON data from Azure BLOB to snowflake using Matillion ETL.
- Intended Audience: This POC/blog will help the organization employees working on Matillion ETL.
- Tools and Technology: Matillion, Azure BLOB, Snowflake
- Key words: implement JSON data loading to snowflake using matillion.
Description:
The JSON file in Azure Blob storage should be loaded into Snowflake Datawarehouse using Matillion ETL.
Data Flow:-
data:image/s3,"s3://crabby-images/2e2ed/2e2edcbe3404430fadfdf389702048e0ff5d98da" alt=""
SOLUTION:
Orchestration Job:
data:image/s3,"s3://crabby-images/9ab26/9ab26bf9111522f7c4c7b3aeb15e977882e9c15a" alt=""
Take Orchestration Job in Matillion, Add Azure Blob Storage Component to Load the data from Blob to Snowflake.
Step 1:
When trying to load a JSON file from Azure blob to Snowflake, create one table with one column by using a variant datatype to that column.
data:image/s3,"s3://crabby-images/cf35e/cf35e4ba31682ab33606155a206f9e2545deafec" alt=""
This helps load the raw JSON data into that table.
Step 2:
data:image/s3,"s3://crabby-images/509e2/509e22e70ded4c65882dd52b085cbf1a9e48797c" alt=""
Give a valid storage location to access the file in Azure blob and provide Target table name, which was created in Snowflake to load the raw JSON data.
Mention the pattern as .*JSON to read JSON files and to avoid reading different file formats from the Azure blob location.
Step 3:
Validate and run the entire job.
data:image/s3,"s3://crabby-images/265f2/265f250b7525bde27138b7c6d001e1ceb769a4ef" alt=""
Check the table in Snowflake to see whether the JSON data is loaded.
Transformation Job:
data:image/s3,"s3://crabby-images/48584/485846baafeb8a720f67f938a1d6af92dcc9162f" alt=""
Step 1:
After loading raw JSON data into one table, Load the whole JSON data in the format of rows and columns.
data:image/s3,"s3://crabby-images/641ec/641ec55af7fbc0c2a019770a709aa5719d0bf0a1" alt=""
Add the transformation job in Matillion ETL; take the Table Input component and mention the Target table in which raw JSON data was loaded into the table.
Step 2:
Take the Nested data component to flatten nested data into rows
This is done by taking nested data as key: value pairs (such as a JSON dictionary) and using those keys as column names.
data:image/s3,"s3://crabby-images/043c7/043c78683bd6aff331a2afc7cefb6d8898729e25" alt=""
Provide required columns to extract JSON data
Step 3:
Take the Table output component to load extracted JSON data into rows and columns.
data:image/s3,"s3://crabby-images/8e731/8e7318beed1889ecd77af3e609c7f0e7e3d1b7c8" alt=""
Create the table in Snowflake with the required columns to load JSON data and map the columns to load match column names with the target table.
Output:
data:image/s3,"s3://crabby-images/5cc84/5cc845ee808fd340d0bd5b7583367234a1510475" alt=""
Entire Flow:
data:image/s3,"s3://crabby-images/cd973/cd9735ae9cf74110546fb6fcc5d5a247e8c6fc2f" alt=""
Author Bio:
data:image/s3,"s3://crabby-images/5ccdd/5ccdd5d4ac588f776dbc3da259a242cd6095b89d" alt="Picture of Rukmini Vuppala"
Rukmini Vuppala
Associate Software Engineer
Hello, I am Rukmini Vuppala. I am as Associate Software Engineering in MOURI Tech for the past 1.5 years. I have good experience in Azure, Matillion, Snowflake, SQL