I help teams improve how people find, use, and maintain information.
ProcessUnderstand the work→Identify the friction→Improve the system→Measure the result
Featured
Case Study · AI Response Quality
Found that most low-scoring AI responses were incomplete, not incorrect
I evaluated 949 GenAI responses across seven product teams and found that low-scoring answers were usually incomplete rather than wrong, changing how teams diagnosed and fixed them.
Failure-pattern analysis of 278 low-scoring responses
Developed shared guidance for improving RAG content
I led a cross-functional working group of 20–30 contributors across five workstreams to identify which content characteristics improved retrieval and response quality, then turned the findings into shared guidance.
First draft of shared guidance released · 20–30 contributors · 5 workstreams
Improved a workflow's usability score from 39 to 57
I designed a measurement approach that helped the platform team identify what to fix, what to validate, and which workflow changes improved the authoring experience.
KitAI Answerability Audit KitFind why AI responses fail and whether the source content is part of the problem. Includes an audit template, scoring guide, and interpretation notes.