Purpose of the article: In this blog, we have explained how we can implement snow pipe in AWS cloud.
Intended Audience: This POC/Blog will help which kinds of people, like developers working on AWS and snowflake cloud
Tools and Technology: AWS services, Snowflake
Key words: Creating snow pipe by using snowflake on AWS
Objective:
To establish the Snowflake Snow pipe on an AWS stage.
Snow pipe:
- It’s a Snowflake continuous data ingestion service.
- It loads data within minutes once after files are added to a stage and submitted for ingestion.
- It enables loading data from files as soon as they’re available during a stage.
Advantages of Snow pipe:
- Snow pipe constantly offers fresh business data across all departments while avoiding workload issues.
- It is extremely cost-effective and charges customers per second with supported computing time.
- Snow pipe is extremely adaptable and enables simple customizations to load data.
The architecture of Snow pipe in AWS:
data:image/s3,"s3://crabby-images/7c9d0/7c9d0a3b939c897b36195c4db8c17d0d62c06e47" alt=""
Steps to implement Snow pipe:
Step 1: Steps to make S3 Bucket in AWS:
- Log into the AWS Management Console.
- From the house dashboard, choose buckets
data:image/s3,"s3://crabby-images/c0bab/c0bab16ba3e9801bcdc1e81e288c511959d68459" alt=""
- Click on the’ create bucket’ option
- Set the bucket name as healthcaredata and choose the region for storing it.
- For block public access settings, select the choice ‘Block all public access’ and set Bucket versioning and Default encryption to Disable. And now create the bucket.
data:image/s3,"s3://crabby-images/fb59a/fb59a6e9cf9061a5bbc34f3e17687dcd5f817588" alt=""
data:image/s3,"s3://crabby-images/2bedd/2bedd0a49eb522e750c120e671abaf845b619e23" alt=""
Step 2: Steps to make Policy in IAM:
- From the house dashboard, choose IAM (Identity & Access Management)
- Choose a policy and click create a new policy
- Choose JSON and use code below given.
data:image/s3,"s3://crabby-images/4beb4/4beb4b6e03ea280871a37241024e5ba3afb6c319" alt=""
- Click on Next tags and Next review; set the policy name healthcare-Policy and make the policy.
Step 3: Steps to make User in IAM:
- From the house dashboard, choose IAM (Identity & Access Management).
- Choose the user, click on the create user option, and set the username as snowpipeingest-user.
data:image/s3,"s3://crabby-images/34bcb/34bcbecef3fa5167a13a13f48a2c8468612193a2" alt=""
- Choose Programmatic access in Access type and click on the Next Permission.
- Click on Attach existing policies directly then select the policy created earlier named as healthcaredata-Policy and click on Next tags.
data:image/s3,"s3://crabby-images/d7549/d754959d2461cf323ce018d5089c17cd38f90ad5" alt=""
- Create the user and download the CSV file, which has the access key ID and secret access key.
data:image/s3,"s3://crabby-images/2fb85/2fb85405215d22174a314bc4705b7ca0ab4b868b" alt=""
data:image/s3,"s3://crabby-images/5b772/5b772d2af8a46e35492933515f1707b1ba2b2f99" alt=""
Step 4: Creating a table in Snowflake:
- Redirect to the Snowflake account.
- Create a table in Snowflake to store the knowledge.
data:image/s3,"s3://crabby-images/6e5d0/6e5d0c7fd27b5029a0a9b0c29ae8d0bce5e60eff" alt=""
3. Create a stage using S3 bucket name with a proper file format that’s visiting be loaded.
data:image/s3,"s3://crabby-images/bbce8/bbce8435a0167610a8d6a9fed606c1dbe5d0c5c8" alt=""
- Creating a pipe (HEALTHCARE_SNOWPIPE_LOAD) using the stage layer (HEALTHCAREDATA_STAGE)
data:image/s3,"s3://crabby-images/59cb5/59cb5d7c175a3730b025853d1f90552fe87113ee" alt=""
- Use the show pipes command and replica the Arn code from the notification channel column values
SHOW PIPES;
- Redirect to the S3 bucket page and choose the event notification under the properties.
- Click on the Event notifications and choose create event notifications.
- Set the event name as snowdailyingest-event and choose the All object create an event from Event Types.
data:image/s3,"s3://crabby-images/84dbb/84dbb9682ee4b54dbe9b03320ae5187c98497d55" alt=""
9. Upload the files into s3 bucket.
10. Redirect to snowflake and query the table to verify to ingest the data
data:image/s3,"s3://crabby-images/830d1/830d1c35c2cd2b621868e8a0afd336c1c7704f5d" alt=""
data:image/s3,"s3://crabby-images/d44aa/d44aa3ea1232f5023c9ccc5ba28902714aa61661" alt=""
In the Snowflake table, files are successfully loaded.
Step 5: Status of Snow pipe:
The status of the pipe to retrieve the current status.
data:image/s3,"s3://crabby-images/3b99b/3b99b7cb88fa177ce0bae5a630a7ba5af11fb4cd" alt=""
Author Bio:
data:image/s3,"s3://crabby-images/07dfb/07dfb512d0874507474b1867d2c8da727b8ae1f5" alt="Picture of Jyothi MODI"
Jyothi MODI
Associate Software Engineer
An energetic, diligent and motivated person with a passion for innovation, Jyothi has 1+ years of experience in CHEP US automation project. She is also technically adept in Snowflake, AWS, Python, Flask, Matillion, ML, and Ms SQL.