Writing on data engineering, analytics and leadership

All of my long-form thoughts on data engineering, analytics, business intelligence and building data teams collected in chronological order.

Designing Data Warehouses for Better Understanding

Foreword Earlier this year I started working on an open source data handbook with instructions to help early stage companies set up data infrastructure and build data teams. This post introduces one of the core sections of the handbook called “Good Conventions” - which is aimed at helping you design...

Creating an Open-Source Data Handbook

I'm thrilled to introduce "The Ultimate Data Handbook," a project I've been passionately working on. Cheeky, I know. You can find it online here - The Ultimate Data Handbook. This cheekily named handbook is designed specifically for startups, providing a comprehensive guide to building data i...

Building an AI Data Analyst with LLMs - Part 3

This post is the third in a series where I talk about building an Open Source AI Data Analyst using LLMs. In Part 1, we speak about the motivation behind building such an agent. In Part 2, we list some of the challenges that stand in our way and start thinking of a solution. In this article, we’...

Building an AI Data Analyst with LLMs - Part 2

This post is the second in a series where I talk about building an Open Source AI Data Analyst using LLMs. You can read the first part here. If you’re familiar with python, you can follow along with my experiments on my Github repo. Recap In the first part of this series, we talked about the need ...