Healthcare R&D Data Platform
Principal Solutions Architect
Built end-to-end data pipelines on Databricks for a pharmaceutical company's research and development teams. The challenge was making scientific data ...
Hello, I'm
// Data Engineer · Solutions Architect · Bookworm · Occasional Cook
Building data pipelines at scale — the kind that process millions of records without breaking at 3am.
0+
Years of Experience
0
Completed Certifications

class Binu:
role = "Principal Solutions Architect"
stack = ["Databricks", "PySpark", "AWS"]
os = "CachyOS (btw)"
def build(self, data):
# the fun part
return Pipeline(data).scale()I build end-to-end data systems on Databricks, Snowflake, and AWS. Before that, I was building full-stack apps — and that experience still shapes how I think about data. For me, it's not just about pipelines, but how the data flows all the way to the end user.
Started the journey — barely knew React, eager to learn everything.
Full-stack development with Hasura, Azure Functions, Next.js. Built ecommerce platforms and internal tools.
Shifted to data engineering while keeping full-stack roots. AWS CDK, AppSync, and first Databricks projects.
Leading data engineering initiatives. Databricks, PySpark, building pipelines that process millions of records.
Principal Solutions Architect
Built end-to-end data pipelines on Databricks for a pharmaceutical company's research and development teams. The challenge was making scientific data ...
Solutions Architect
Worked with a team to build a full-stack portal for managing access across multiple AWS accounts within an organization. The old way involved long-liv...
Full-Stack Developer
Worked with a team to design and build an ecommerce platform where each customer gets their own instance with a dedicated subdomain — fully white-labe...

When PySpark UDFs aren't enough — how plain Python saved a data pipeline that Spark couldn't handle alone.
How building robust data pipelines transforms the way pharmaceutical R&D teams discover and validate insights.

Exploring the power of combining Spark with Delta Lake for reliable, performant data lakehouse architectures.

A step-by-step guide to setting up AWS CDK Pipelines with GitHub as the source using CodeStar Connections.

Getting started with serverless computing on Azure — from setup to deployment.

A walkthrough of building data pipelines for healthcare research — from messy source data to clean, queryable datasets.

Live session covering the full data engineering workflow on AWS — from ingestion to analytics.
96 books on Goodreads and counting. Brandon Sanderson's Cosmere universe has most of my attention — intricate magic systems, interconnected worlds, the kind of world-building that rewards re-reads. Stormlight Archive is my happy place, for all its chaos. Robert Jordan's Wheel of Time is up there too (Mat Cauthon is the best character, fight me). Also: Harry Potter (always), Dan Brown's thrillers, and whatever catches my eye next.
Goodreads ProfileMy Spotify is a mess in the best way — Malayalam film soundtracks, country music, Romanian pop, Lana Del Rey, Avicii, Irish folk, and whatever else I stumbled into that week. Yes, country. Luke Bryan, Johnny Cash, the whole Forever My Girl soundtrack. Songs across languages I don't even speak. I follow the music, not the genre.
Spotify ProfileExperimental. Whatever catches my eye on Instagram gets tried. No cuisine loyalty, no meal prep discipline — just curiosity and a reasonably stocked kitchen.
Usually with an audiobook or music. Sometimes a podcast. The walking part isn't a cameo — it's how I think through problems or stop thinking about them entirely.
StarTalk Radio is my background noise of choice. Astrophysics, AI, black holes, the occasional argument about time travel. Neil deGrasse Tyson makes it all surprisingly easy to get going down the rabbit hole.
StarTalkSeven-plus years of daily driving Linux — Manjaro first, now CachyOS. Work, personal, everything. There's something satisfying about an OS that lets you break things and fix them on your own terms.
Let's build something together.
I'm always up for conversations about data engineering, distributed systems, or building things that work at scale. Also happy to talk about Sanderson theories, music recommendations, or whatever else.