Agent workflows that finish work
I design systems where agents plan, execute, review, and leave artifacts that humans can trust.
AI Agent Systems Builder
I build production-grade AI systems, agentic workflows, evaluation platforms, and scalable full-stack products for teams that need reliable execution, not demos.

Working model
The strongest agentic products are built like operational systems: clear task boundaries, async execution, evaluation artifacts, human review, observability, and deployment discipline.
I design systems where agents plan, execute, review, and leave artifacts that humans can trust.
I care about scoring, review loops, retry behavior, and result provenance because they make AI systems operable.
I move across product surfaces, APIs, workers, data models, and deployment paths without losing the system shape.
My background spans LLM platforms, multimodal benchmarks, computer vision, and production automation.
Selected projects
A few representative projects from multimodal benchmarking and computer vision.
Multimodal benchmarking
An evaluation tool for comparing multimodal language models with a consistent prediction-to-ground-truth methodology.
Computer vision and applied ML
A patented machine-learning system for generating Digital Surface Models from imagery, reducing reliance on external elevation sources.