Recent posts

Certification Exam Resources

3 minute read

I’ve put together a collection of resources I’ve used while preparing for various certification exams – covering everything from AWS to Databricks. The focus is mainly on data engineering and machine learning topics.

Databricks Generative AI Associate Exam Study Notes

19 minute read

In this blog post, I’m sharing my experience taking the Databricks Generative AI Associate exam – from study notes to resources that made a difference. Whether you’re just starting your prep or looking for extra insights, this guide will help you find the right resources to get prepared.

Handle Irregular Bursts of Files using EventBridge and Glue Workflow

7 minute read

Exploring ways of handling irregular and sudden bursts of multiple files for data processing using event driven architecture on AWS. This blog posts showcases how to use S3 notification with EventBridge to trigger a Glue Workflow that has number of events and batch window trigger conditions.

Personal Project - pytransflow

33 minute read

I’m thrilled to present pytransflow, a Python library I developed in my free time. pytransflow simplifies record-level processing through transformation flows defined in YAML files. I hope you find this library engaging and that it sparks your interest to both use and contribute to it.


Read more