Who I Am
About me.
- about
Case studies and notes on the tools I build.
About me.
What cloudwright generates from a one-line prompt and why it matters.
100 tasks, sigmoid scoring, 12 capability dimensions. What the workflow benchmark actually measures.
A side-by-side benchmark of seven retrieval strategies on user-supplied corpora.
What you can see in a graph that you can't see across four separate systems.
DPWH says the project is done. Sentinel-2 says nothing was built.
An MCP server that puts Philippine civic data inside any agent.
Why on-device LLMs make sense for regulated enterprise support, and what shipping one looks like.