Data engineer with experience in optimizing databases like PostgreSQL, building ETL pipelines in AWS and GCP, and managing infrastructure with Terraform. I work with ETL orchestration tools like Apache Airflow and implement data models for scalable analytics and reporting. My experience includes building real-time data processing solutions, cache systems, and search architectures for high-volume applications. Always focused on delivering high-performance solutions and enhancing data-driven decision-making.
Pinned Loading
-
impact-insight
impact-insight PublicGCP data pipeline predicting power outages in São Paulo based on weather data. Ingests CSV data into BigQuery, pulls real-time weather via Cloud Functions and Pub/Sub, and uses ARIMA and LightGBM m…
HCL
-
deftunes-pipeline-aws
deftunes-pipeline-aws PublicAn end-to-end data pipeline for De Ftunes’ music purchase analytics, designed to ingest, transform, and model data for efficient analysis of song purchases, user behavior, and service trends. Utili…
Python
-
user-activity-recommendation-pipeline-aws
user-activity-recommendation-pipeline-aws PublicEnd-to-end batch and streaming data pipeline on AWS to process user ratings and activity data. Leverages Amazon RDS, Glue, S3, Kinesis, and PostgreSQL with pgvector for real-time recommendation gen…
HCL
-
classic-car-retail-data-pipeline-aws
classic-car-retail-data-pipeline-aws PublicData pipeline on AWS using RDS, Glue, S3, and Athena, demonstrating the data lifecycle from ETL to visualization with Terraform as IaC.
Jupyter Notebook
-
-
If the problem persists, check the GitHub status page or contact support.