📈 Introduction:

Remember when you built your first end-to-end data pipeline in the mini-project? Now, it's time to take it to the next level by applying industry best practices to automate and optimize your pipeline using Terraform, CI/CD, and automated tests. This advanced project will not only enhance your technical skills but also make your data pipeline more robust, scalable, and maintainable, mirroring the practices used by top tech companies.

🎯 Project Definition:

Build and enhance an ETL Data Pipeline to not only extract hourly weather information for a selected location but also automate and optimize the deployment process. Utilize Terraform to manage your GCP infrastructure, implement CI/CD pipelines to ensure smooth, automated deployments, and add automated tests to verify the correctness and reliability of your pipeline. Store the retrieved data in BigQuery and schedule the pipeline to run automatically on an hourly schedule.

🦸 Who is this project for?

🎯 Learning Objectives:

By the end of this project, you'll be able to: