Projects
Each project targets a specific data engineering skill — pipelines, infrastructure, streaming, and analytics. Click through for details and links to live demos or source code.
Data Engineering Planned
Real-Time Streaming Pipeline
Kafka + Flink pipeline processing live event data at scale
KafkaFlinkJavaDockerPostgreSQL
ML / AI In Progress
Healthcare Analytics — NL-to-SQL
Natural language queries against a real star-schema warehouse with SCD2, dbt artifact context, and transparent SQL reasoning
DuckDBdbtPythonStreamlitAnthropic APIPlotlyCloudflare R2GitHub Actions