Scored 0.390/0.409 on the Vesuvius Challenge Surface Detection — the 1st place team hit 0.614 with nnU-Net. That gap sent us down a rabbit hole.
We built four independent 3D segmentation pipelines to understand what actually matters: standard U-Net, deep supervision + focal loss, multi-scale fusion, and finally nnU-Net itself. Each pipeline taught us something — V1 showed us our GPU was idle 90% of the time (CPU feature bottleneck), V2 proved skeleton recall beats clDice, V3 NaN'd at epoch 55 from FP16 overflow, and nnU-Net is training on HF Jobs right now.
The 1st place post-processing pipeline (binary closing, height-map patching, LUT hole plugging) turned out to be a bigger lever than we expected. We only had access after the competition closed and writeups were published.
Built an agentic Clinical Decision Support system powered by MedGemma 27B
Paste a patient case → get a full clinical decision support report with differential diagnoses, drug interaction checks (via real OpenFDA/RxNorm APIs), guideline recommendations from a 62-guideline RAG corpus, and automated care gap detection.
6-step agentic pipeline, all streaming in real time:
Parse free-text → structured patient data Clinical reasoning → ranked differential diagnosis Drug interaction check → real FDA database queries Guideline retrieval → RAG over 14 medical specialties Conflict detection → gaps between guidelines and patient care Synthesis → comprehensive CDS report Built with MedGemma 27B, FastAPI, ChromaDB, Next.js. No LangChain — custom orchestrator throughout.