Skip to content

About

Building systems that
move data forward

Harsha Reddy
Bangalore, India

As an engineering leader, I contribute to developing and optimizing real-time data systems with a focus on reliability, scalability, and cost efficiency. Currently at Bureau, I drive data platform strategy and build AI-ready infrastructures.

What I Do

My work is centered on enabling organizations to process and leverage vast amounts of structured and unstructured data for intelligent decision-making. I've architected petabyte-scale data processing platforms, built real-time analytics systems supporting 10B+ monthly interactions, and enabled Agentic AI workflows that drive autonomous decision-making in production.

I care deeply about data quality, observability, and making data accessible to the teams that need it. A well-designed pipeline is one that the on-call engineer never has to think about.

Why I Write

This blog is where I share what I'm learning and building. I write about the practical side of data engineering — the patterns that work, the mistakes I've made, and the tools I reach for when the stakes are high. No fluff, just real experience from production systems.

Career

Principal Software Engineer Current

Bureau

Mar 2026 — Present · Bangalore

Driving data platform strategy, scalability, and real-time analytics.

Big Data Lead

Zzazz

May 2024 — Mar 2026 · Bangalore

Architected a globally distributed real-time analytics platform capturing 10B+ monthly interactions. Led the engineering of an enterprise-grade web analytics platform on a 100% open-source stack.

Tech Lead, Big Data

Gameskraft

Feb 2023 — May 2024 · Bangalore

Led the design and development of a high-performance Lakehouse platform powering company-wide analytics across product, growth, and business teams.

Senior Software Development Engineer

OYO Rooms

Sep 2021 — Feb 2023 · Bangalore

Achieved 90% reduction in platform costs. Integrated Apache Druid for sub-second query performance on billions of events. Migrated entire data platform from AWS to Azure.

Senior Data Engineer

TripAdvisor

Mar 2021 — Aug 2021 · Gurugram

Led migration from on-premise to cloud, shifting from enterprise solutions to open-source for improved scalability and cost-efficiency.

Data Engineer

Dailyhunt

Jan 2021 — Mar 2021 · Bangalore

Built a scalable big data architecture parsing and enriching 9-10 billion events and 3+ TB of data daily.

Software Development Engineer II

OYO Rooms

Jul 2018 — Dec 2020 · Gurugram

Implemented an early transactional data lake before Delta Lake existed. Built real-time data quality monitoring and automated alerting systems.

B.Tech

IIT Bhubaneswar

2014 — 2018 · Bhubaneswar

Bachelor of Technology.

Skills & Tools

Data Engineering

Apache Spark Apache Kafka Apache Airflow Apache Druid Apache Flink dbt Delta Lake Debezium / CDC Trino / Presto

Cloud & DevOps

AWS Azure GCP Kubernetes Terraform Docker Prometheus Grafana CI/CD

Languages

Python Java SQL TypeScript Scala Bash

AI & Data

Agentic AI LangChain / LangGraph Vector DBs (Qdrant) RAG Pipelines Feature Stores Elasticsearch

Beyond Work

🏸

Badminton

Court regular

✈️

Travelling

New places, new perspectives

🏄

Surfing

Chasing waves

🪂

Paragliding

Sky's the limit