Applied AI Engineer
I build workflow APIs, multi-agent systems, real-time voice pipelines, and Python backends for production AI use cases. Public proof includes live demos, shipped tooling, open source work, and 3,500+ automated tests across production repos.
Production AI infrastructure — from workflow APIs to multi-agent systems
FastAPI services, async workers, Server-Sent Events, integrations, and production-friendly interfaces for multi-step AI workflows.
Multi-agent orchestration, layered caching, model routing, conditional workflows, tool use, and systems that stay understandable to hiring managers.
WebSocket streaming, voice activity detection, STT/TTS, barge-in handling, eval-driven delivery, and test-heavy engineering to keep AI systems reliable.
Best first-click proof for hiring teams: EnterpriseHub, AI Workflow API, TechNova Voice Bot, and Multi-Agent Demo. The sections below include older supporting projects as well.
3 Claude-powered SMS bots handling lead qualification for a real estate firm. 500+ leads processed, under 500ms response time, bilingual EN/ES, zero downtime over 3-month production run.
Capabilities
Stack
Async document processing with hybrid retrieval, citation-aware answers, and agentic ReAct reasoning. 95.5% F1 on a 28-case CI-replayed golden baseline; eval corpus 72 cases (51 golden + 21 adversarial, incl. prompt injection).
Capabilities
Stack
Domain-specific agent mesh with 3-tier cache achieving 88% aggregate hit rate. 8 agent capabilities, circuit-breaker failover, per-agent model routing, OWASP-hardened security, and OpenTelemetry instrumentation.
Capabilities
Stack
9 pre-built MCP servers with A2A adapter, auto-caching, rate limiting, auth middleware. MCPTestClient for testing without live API keys. Reduces LLM tool integration from days to a single import.
PR #24551 -- Surfaces AuthenticationError, RateLimitError, and NotFoundError distinctly through the Router fallback chain instead of swallowing as generic Exception. Enables callers to implement appropriate recovery strategies per error type.
Also: open PRs in FastAPI (80K+ stars, #15217) and pgvector-python (#151)
Selected from 21 certifications totaling 1,831 hours across IBM, DeepLearning.AI, Microsoft, Duke, Google, Vanderbilt, and Anthropic.
Targeting teams building workflow APIs, agent systems, real-time AI applications, and developer tooling. Best fit: Applied AI Engineer, AI Engineer, AI Backend Engineer, and LLM Engineer roles.
US-based (Cathedral City, CA) · Canadian citizen, no sponsorship required
caymanroden@gmail.com