About

AI Engineer with 4 years shipping production AI and backend systems and 10 years of client-facing operations. I build production RAG, agent workflows, eval pipelines, and FastAPI services. Remote, US-based.

Who I Am

Based in the Inland Empire, California. 4 years building production AI systems -- FastAPI backends, retrieval pipelines, agent workflows, eval harnesses, and public demos. 10 years of client-facing operations before that, which still shapes how I scope work, communicate tradeoffs, and deliver under real constraints.

I hold 21 certifications from Google, IBM, Microsoft, DeepLearning.AI, Duke, Vanderbilt, and Anthropic (1,831 hours total), but the main proof on this site is shipped systems: DocExtract AI, Jorge Real Estate AI, EnterpriseHub, and mcp-server-toolkit on PyPI. Open source contributor to LiteLLM, FastAPI, and pgvector-python.

What I Do

Production AI Delivery

Applied AI Systems That Ship

I build AI systems that have to work inside real workflows: RAG pipelines, agent workflows, FastAPI services, and evaluation gates. Strongest public proof includes a Claude-powered lead qualification platform that processed 500+ leads, DocExtract at 95.5% F1, and mcp-server-toolkit on PyPI.

Backend And Reliability

Backend Systems Around LLMs

My strongest layer is the application layer around LLMs: retrieval, tool use, orchestration, backend APIs, eval harnesses, caching, and operational debugging. Across public production repos, I have 3,500+ automated tests and a consistent bias toward reliability over demo-only polish.

Stakeholder Ownership

Client-Facing Execution Without Role Drift

The strongest non-code part of my profile is delivery ownership. I came from 10 years of client-facing operations, so I am comfortable translating vague requirements into working systems, communicating constraints clearly, and making AI useful in live environments.

How I Work

1

Discovery

Understand your data, existing systems, constraints, and what "done" looks like. No scope creep — explicit deliverables before code starts.

2

Architecture

Design the system: data flow, component boundaries, API contracts, caching strategy, and deployment plan. You review before implementation.

3

TDD Build

Test-driven development. Tests first, then implementation, then refactor. CI runs on every push. You see progress in real-time via GitHub.

4

Delivery

Deployed with Docker, documented with examples, demo mode included. Handoff includes architecture docs, test coverage report, and a walkthrough call.

What I Bring to a Team

Strengths

  • Production Python AI systems with retrieval, orchestration, evals, and backend delivery
  • FastAPI, PostgreSQL, Redis, Docker, and API integration in real workflows
  • Public proof stack: 500+ leads processed, 95.5% F1 on CI-replayed golden baseline, 3,500+ tests, PyPI published
  • Open source contributions to LiteLLM, FastAPI, and pgvector-python
  • 10 years of client-facing ops applied to technical delivery and stakeholder communication

Target Roles

  • AI Engineer / Applied AI Engineer
  • AI Backend Engineer / LLM Platform Engineer
  • Selective Forward Deployed Engineer roles
  • Remote, US-based, no sponsorship required

Stack

Backend & APIs

Python, FastAPI, REST APIs, PostgreSQL, Redis, Docker, GitHub Actions CI/CD, GoHighLevel CRM integration

Evaluation & Reliability

pytest, pytest-asyncio, RAGAS, LLM-as-judge, adversarial fixtures, GitHub Actions CI/CD, OpenTelemetry, structured logging

Retrieval & Data

SQL, PostgreSQL, pgvector, Redis, embeddings, semantic search, BM25, reciprocal rank fusion, Streamlit, Plotly

AI APIs & Orchestration

Claude API, OpenAI API, Gemini API, RAG pipelines, agent workflows, tool use, structured output, LangGraph, prompt engineering

Let's Talk

Open to remote AI Engineer, Applied AI, AI Backend, and LLM Platform roles. Selectively open to forward-deployed work that still centers on building and shipping real systems.

View all certifications →