Architecture & Platform
Databricks DAIS 2026 HackathonTechnical reference — pipeline, ML models, Genie AI/BI, consumption layers, and CI/CD. Read the backdrop →
70
Customers
$9.2M
Revenue
12
SKUs
24mo
History
8
Gold Views
3
ML Models
Data Pipeline — Bronze → Silver → Gold → ML
Next.js App on Vercel
Public · business stakeholders
Databricks Genie AI/BI
Internal · data teams
ML Models
Customer Segmentation
K-Means Clustering (k=5)
deal2delivery_customer_segments@champion
Churn Prediction v2
XGBoost Classifier
deal2delivery_churn_model@champion
Demand Forecast
XGBoost Regressor
deal2delivery_demand_forecast@champion
Genie AI/BI — Natural Language Analytics
Databricks Genie space backed by all Gold views. Business users ask questions in plain English — Genie generates SQL and returns data-driven answers. A self-improving evaluation loop runs LLM-as-a-Judge scorers on every conversation.
LLM-as-a-Judge Scorers
App Pages
Story →
The business backdrop — NovaTech Electronics, the deal-to-delivery gap, pain points and Databricks solutions.
Dashboard →
Live KPIs, monthly revenue & units trend charts, GPT-4o insight strip.
Demand Forecast →
24-month actuals + 6-month XGBoost forecast overlay. Declining SKU alerts. Product performance table.
Simulator →
Drag a slider −50% to +100% to model demand scenarios. Unit delta + estimated revenue impact.
Inventory →
SAP MARD stock vs 6-month ML forecast per SKU. Critical / Warning / OK gap classification.
Customer Risk →
XGBoost churn scores, K-Means RFM segments, one-click GPT-4o risk explanation per customer.
CI/CD — GitHub Actions + Databricks Asset Bundles
feature/*
—PR to develop
develop
devPush (auto-deploy)
main
stagingPush (auto-deploy)
manual
prodworkflow_dispatch
Performance — Three-Layer Cache
Layer 1
Next.js ISR
API responses cached at Vercel CDN. Tab switches within 5 min hit zero Databricks queries.
Layer 2
Databricks SQL Result Cache
Identical queries return in ~200ms from warehouse cache — no re-execution on Delta.
Layer 3
Delta Cache
Hot table data cached on cluster SSD. Repeated reads skip cloud storage entirely.
Deal2Delivery
Databricks DAIS 2026 Community Virtual Hackathon