Blog
Short technical notes on GenAI, data engineering, and platform reliability.
May 2026
Scaling RAG Pipelines in Production
Lessons learned in deploying enterprise-grade RAG pipelines, managing vector database latency, and ensuring data freshness.
April 2026
Spark Job Tuning That Actually Reduced Costs
How partition strategy, adaptive execution, and skew handling brought measurable runtime improvements.
March 2026
Practical Data Quality Contracts
A lightweight framework for schema checks, freshness SLAs, and ownership boundaries.
February 2026
Airflow DAG Patterns for Stable Production Runs
Patterns that improved reliability, observability, and developer velocity in orchestration workflows.
January 2026
Evaluating LLM Performance: Beyond Human Review
Implementing automated evaluation metrics for LLM-based data applications.