About Me
Hi! I’m Dániel Varga. Thank you for visiting my GitHub page. I’m currently focused on training and building hands-on projects in Data Engineering and Cloud Engineering. My goal is to develop strong foundations in key tools and concepts mainly in data engineering and cloud infrastructure. These includes the following skills:
🎓 Certifications
🎓 Learning Journey
💼 Contact
🛠️ Featured Projects
Through the projects featured on my site, I aim to grow my expertise and gain practical experience in solving real-world data challenges. Feel free to explore my work and connect if you want to collaborate.
-
Drift Detective is a Python library for tracking schema evolution and detecting structural drift in tabular datasets using versioned JSON snapshots.
-
From API to Database: Dockerized Airflow ETL Pipeline for Weather Data
ETL pipeline implemented in Apache Airflow that exctracts data from OpenWeatherMap API, then process it in Python, and store it in a PostgreSQL database for analytics and reporting. The setup deployed in a multi-container Docker environment.
-
Python ETL Project: Scraping, Transforming, and Loading Book Data
A Python-based ETL pipeline that scrapes book data, transforms and normalizes it, then loads it into a PostgreSQL database using Docker Compose.
-
2022 Airlines Departure Data Warehouse in PostgreSQL
PostgreSQL-based data warehouse project using the 2022 US Airlines Domestic Departure dataset. Implemented Star schema design to enable efficient analytical queries.



