I work on AI. For the past few years that has mostly meant evaluation - deciding what to measure, and building the infrastructure to measure it properly.

When I was at Groq, I built openbench - an open-source framework for running language-model benchmarks easily and reproducibly.

Now
Staff Software Engineer at Meta Superintelligence Labs
Before
Led evals at Groq; NVIDIA LPU, after the licensing deal
From
Boca Raton, FL. Now in Menlo Park
Reading
Isles of the Emberdark - Brandon Sanderson