Selected projects and research by Abe Flansburg — Principal Engineer & AI Architect

At Altitude Health

  • Clinical signal extraction & feature engineering pipeline — Owner — pipeline, infra, prompts, evaluation. KFP v2 pipelines compiled and submitted to Vertex AI Pipelines, triggered by Pub/Sub. Extracts clinical signals from encounter notes via Gemini batch prediction across 100k+ encounters per run, then engineers per-patient feature matrices with risk stratification — multi-tenant, multi-condition (diabetes, hypertension, CKD, obesity, dyslipidemia, CHF, ASCVD) in a single pass. Technologies: Vertex AI Pipelines, KFP v2, Gemini, BigQuery, Pub/Sub, Multi-tenant, Python.
  • Data platform infrastructure-as-code — Owner — Terraform across prod + staging. Terraform for the data platform across two GCP projects. GCS, BigQuery, KMS, Artifact Registry, Pub/Sub, IAM, monitoring, plus Workload Identity Federation for GitHub Actions CI/CD and Fivetran Private Service Connect. Prod/staging isolation chosen deliberately to keep state migration off the critical path of immutable resources like KMS keyrings and data-bearing buckets. Technologies: Terraform, GCP, BigQuery, KMS, Pub/Sub, WIF, Private Service Connect.
  • Internal Slack-fronted agentic knowledge base — Built end-to-end — bot, agent runtime, wiki, infra. Slack bot on Cloud Run dispatches to an Anthropic managed agent running in a sandboxed container. The agent works against a 300+ page LLM-owned wiki using custom tools (Slack search, BigQuery, git, URL fetch) plus built-in agent tools (bash, read/write, grep, web search). Ingests URLs, PDFs, meeting notes, and Slack threads into structured wiki pages that compound over time. Technologies: Anthropic managed agents, Cloud Run, Slack, BigQuery, Python, Terraform.
  • Cloud Composer (Airflow 3) platform — Migrated from a Tuva canonical model to dbt + Airflow on GCP. Terraform for Cloud Composer plus the production DAG repository. A layered local validation pipeline (py_compile → safe-mode discovery → tenants schema → full DagBag parse → factory unit tests with mocked GCP clients) runs before deploy so broken DAGs never reach Composer. Led the original migration off an overly complex Tuva canonical model. Technologies: Airflow 3, Cloud Composer, Terraform, dbt, BigQuery, Python, uv.
  • Internal viz publishing platform — Built — one hostname, two Cloud Run services. One hostname, split at the load balancer URL map. /ingest/* is public-but-rate-limited (Cloud Armor, shared-secret auth) and accepts JSON envelopes over HTTP or via an MCP server mounted at /mcp for agent clients. /* is org-only behind IAP and renders items in sandboxed per-item iframes — firehose feed, per-user feeds, shareable permalinks. Technologies: FastAPI, Cloud Run, GCS, IAP, Cloud Armor, MCP, Terraform.
  • Reproducible notebook environments — Standardized envs for signal extraction + pipeline EDA. Standardized uv-managed notebook environments for clinical signal extraction and feature-pipeline EDA. Reproducible across team members and across Colab / local Jupyter / VS Code — eliminates the "works on my notebook" failure mode that data engineering teams keep paying for. Technologies: Python, uv, Jupyter, Colab.

Co-authored research

  • Clinician Overrides as Implicit Preference Signals for Clinical AI in Value-Based Care (arXiv 2604.28010 — April 30, 2026). Authors: Prabhjot Singh, Abhishek Gupta, Chris Betz, Brett Ives, Sudeep Lama, Abe Flansburg, Jung Hoon Son. A framework that interprets clinician overrides of AI recommendations as data for training reward models — treating clinical disagreement as preference signal rather than as noise to discard. Available at https://arxiv.org/abs/2604.28010.
  • A Reinforcement Learning Framework for Chronic Disease Management (arXiv 2605.09818 — May 2026). Authors: (co-authored — see arXiv for full author list). Reinforcement learning applied to chronic disease management, addressing reward sparsity over long-horizon outcomes by incorporating learned preference signals from clinical interactions. Available at https://arxiv.org/abs/2605.09818.

Personal & open-source projects

  • CritPost — Production — agentic content analysis. An agentic platform that analyzes LinkedIn content for AI-generated tells, engagement bait, and signal-to-noise. SvelteKit frontend, Python FastAPI backend orchestrating up to 10 BAML-defined agents, Postgres on Cloud SQL, deployed on Cloud Run. Technologies: SvelteKit, FastAPI, BAML, Claude, GPT, Postgres, Cloud Run. critpost.com
  • GIFSlop — Public gallery — agent-generated GIFs. A Claude agent writes Python that draws frames with PIL. A Cloud Run service stitches them into GIFs and publishes to a public manifest. A SvelteKit static site polls the manifest and renders the cards. Two trigger paths: local CLI, or open a GitHub issue and a daemon runs the same agent loop. Technologies: Claude Code SDK, Python, FastAPI, Cloud Run, SvelteKit, Firebase, Terraform. gifslop.com
  • status-trend — Open source — terminal dashboard. A Go TUI built with Bubble Tea and Lip Gloss that aggregates outage and incident data from Claude, OpenAI, Google Cloud AI, Cohere, GitHub, and Vercel into one dashboard. Distributed via a Homebrew tap. Technologies: Go, Bubble Tea, Lip Gloss, Homebrew. github.com/aflansburg/status-trend
  • stratumflow — Open source — data engineering helpers. Python tooling that smooths internal data engineering workflows on Google Cloud. Open-sourced after enough colleagues asked for the helpers. Technologies: Python, GCP, BigQuery. github.com/aflansburg/stratumflow
  • rag-in-a-box — Teaching artifact — RAG, small enough to grok. A RAG implementation small enough to actually understand. Drop in a PDF or text, watch the embeddings visualize, query the vector store, see the inference call. Local embeddings via LM Studio, Claude for inference. Technologies: Python, Embeddings, Claude, LM Studio. github.com/aflansburg/raginabox
  • engabe.com — This site. SvelteKit + Tailwind 4 + Svelte 5 runes. Terminal aesthetic with a side of Rick & Morty (see /c-137). Features OpenAI streaming synthesis, Playwright web scraping, three.js portals, an mdsvex dispatch system, and a side gallery of original music compositions. Technologies: SvelteKit, Svelte 5, Tailwind 4, mdsvex, OpenAI, three.js. /music · /c-137
PROJECTS /

Selected work

What I've built recently, what it does, and the tools it runs on.For deeper conversations, please reach out.

// at altitude health · principal engineer · clinical AI + data platform

Clinical signal extraction & feature engineering pipeline

[ internal ]
Owner — pipeline, infra, prompts, evaluation

KFP v2 pipelines compiled and submitted to Vertex AI Pipelines, triggered by Pub/Sub. Extracts clinical signals from encounter notes via Gemini batch prediction across 100k+ encounters per run, then engineers per-patient feature matrices with risk stratification — multi-tenant, multi-condition (diabetes, hypertension, CKD, obesity, dyslipidemia, CHF, ASCVD) in a single pass.

Vertex AI PipelinesKFP v2GeminiBigQueryPub/SubMulti-tenantPython

Data platform infrastructure-as-code

[ internal ]
Owner — Terraform across prod + staging

Terraform for the data platform across two GCP projects. GCS, BigQuery, KMS, Artifact Registry, Pub/Sub, IAM, monitoring, plus Workload Identity Federation for GitHub Actions CI/CD and Fivetran Private Service Connect. Prod/staging isolation chosen deliberately to keep state migration off the critical path of immutable resources like KMS keyrings and data-bearing buckets.

TerraformGCPBigQueryKMSPub/SubWIFPrivate Service Connect

Internal Slack-fronted agentic knowledge base

[ internal ]
Built end-to-end — bot, agent runtime, wiki, infra

Slack bot on Cloud Run dispatches to an Anthropic managed agent running in a sandboxed container. The agent works against a 300+ page LLM-owned wiki using custom tools (Slack search, BigQuery, git, URL fetch) plus built-in agent tools (bash, read/write, grep, web search). Ingests URLs, PDFs, meeting notes, and Slack threads into structured wiki pages that compound over time.

Anthropic managed agentsCloud RunSlackBigQueryPythonTerraform

Cloud Composer (Airflow 3) platform

[ internal ]
Migrated from a Tuva canonical model to dbt + Airflow on GCP

Terraform for Cloud Composer plus the production DAG repository. A layered local validation pipeline (py_compile → safe-mode discovery → tenants schema → full DagBag parse → factory unit tests with mocked GCP clients) runs before deploy so broken DAGs never reach Composer. Led the original migration off an overly complex Tuva canonical model.

Airflow 3Cloud ComposerTerraformdbtBigQueryPythonuv

Internal viz publishing platform

[ internal ]
Built — one hostname, two Cloud Run services

One hostname, split at the load balancer URL map. /ingest/* is public-but-rate-limited (Cloud Armor, shared-secret auth) and accepts JSON envelopes over HTTP or via an MCP server mounted at /mcp for agent clients. /* is org-only behind IAP and renders items in sandboxed per-item iframes — firehose feed, per-user feeds, shareable permalinks.

FastAPICloud RunGCSIAPCloud ArmorMCPTerraform

Reproducible notebook environments

[ internal ]
Standardized envs for signal extraction + pipeline EDA

Standardized uv-managed notebook environments for clinical signal extraction and feature-pipeline EDA. Reproducible across team members and across Colab / local Jupyter / VS Code — eliminates the "works on my notebook" failure mode that data engineering teams keep paying for.

PythonuvJupyterColab
// research · co-authored, arxiv 2026
PAPER 01 · arXiv 2604.28010 — April 30, 2026

Clinician Overrides as Implicit Preference Signals for Clinical AI in Value-Based Care

Prabhjot Singh, Abhishek Gupta, Chris Betz, Brett Ives, Sudeep Lama, Abe Flansburg, Jung Hoon Son

A framework that interprets clinician overrides of AI recommendations as data for training reward models — treating clinical disagreement as preference signal rather than as noise to discard.

PAPER 02 · arXiv 2605.09818 — May 2026

A Reinforcement Learning Framework for Chronic Disease Management

(co-authored — see arXiv for full author list)

Reinforcement learning applied to chronic disease management, addressing reward sparsity over long-horizon outcomes by incorporating learned preference signals from clinical interactions.

// personal & open source · side work, tools, teaching artifacts

CritPost

Production — agentic content analysis

An agentic platform that analyzes LinkedIn content for AI-generated tells, engagement bait, and signal-to-noise. SvelteKit frontend, Python FastAPI backend orchestrating up to 10 BAML-defined agents, Postgres on Cloud SQL, deployed on Cloud Run.

SvelteKitFastAPIBAMLClaudeGPTPostgresCloud Run

GIFSlop

Public gallery — agent-generated GIFs

A Claude agent writes Python that draws frames with PIL. A Cloud Run service stitches them into GIFs and publishes to a public manifest. A SvelteKit static site polls the manifest and renders the cards. Two trigger paths: local CLI, or open a GitHub issue and a daemon runs the same agent loop.

Claude Code SDKPythonFastAPICloud RunSvelteKitFirebaseTerraform

status-trend

Open source — terminal dashboard

A Go TUI built with Bubble Tea and Lip Gloss that aggregates outage and incident data from Claude, OpenAI, Google Cloud AI, Cohere, GitHub, and Vercel into one dashboard. Distributed via a Homebrew tap.

GoBubble TeaLip GlossHomebrew

stratumflow

Open source — data engineering helpers

Python tooling that smooths internal data engineering workflows on Google Cloud. Open-sourced after enough colleagues asked for the helpers.

PythonGCPBigQuery

rag-in-a-box

Teaching artifact — RAG, small enough to grok

A RAG implementation small enough to actually understand. Drop in a PDF or text, watch the embeddings visualize, query the vector store, see the inference call. Local embeddings via LM Studio, Claude for inference.

PythonEmbeddingsClaudeLM Studio

engabe.com

This site

SvelteKit + Tailwind 4 + Svelte 5 runes. Terminal aesthetic with a side of Rick & Morty (see /c-137). Features OpenAI streaming synthesis, Playwright web scraping, three.js portals, an mdsvex dispatch system, and a side gallery of original music compositions.

SvelteKitSvelte 5Tailwind 4mdsvexOpenAIthree.js