Case Study · Measurement

Using UX measurement to prioritize workflow improvements

Method SUS + feedback cycles

Featured study CCMS development pilot

Window Nov 2023–Jul 2025

Scale Up to 40 respondents

I designed a UX measurement program to help our content platform team prioritize improvements across a tangled authoring ecosystem. By combining standardized usability scores with direct feedback across repeated evaluation cycles, I created a baseline for deciding what to fix, what to validate, and where automation might help next.

The problem

The team supported an authoring workflow spread across many tightly interdependent systems. Nearly every step required writers to move somewhere else: one system for part numbers, another for metadata, another for review, and another for publishing. Because the workflow was fragmented across so many tools and rules, it could take a new writer close to a year to feel comfortable using it.

The team knew users were frustrated, but the signals were scattered across tickets, Slack questions, support conversations, and anecdotal feedback. What was missing was a structured way to decide what to fix first, identify low-hanging fruit, and prove whether development and workflow improvements were making the experience better.

The approach

I selected the System Usability Scale as the consistent measure across cycles, then paired it with structured user feedback to explain what was driving the scores. I designed the repeated evaluation process to identify priorities, test whether changes helped, and distinguish genuine improvement from differences in the participant population.

The program could be reused across related workflow studies, including CCMS development cycles, article publishing, and other authoring and publishing workflows.

1

Establish a baseline — Use the System Usability Scale (SUS) to create a consistent starting point for comparison across cycles.

2

Capture context — Collect direct user feedback to understand why users scored the experience the way they did — not just what the score was.

3

Identify priority improvements — Use quantitative scores alongside qualitative feedback to separate recurring friction from one-off complaints, and focus changes where they would matter most.

4

Repeat across cycles to measure change — Run multiple evaluation rounds to track whether workflow changes moved the experience, and distinguish genuine improvement from sample or population effects.

What the CCMS pilot data showed

I ran four measurement cycles for the CCMS development pilot from November 2023 to July 2025. Comparing SUS scores with participant feedback and experience levels showed not only whether the workflow was improving, but why the results changed between cycles.

Key finding In the CCMS pilot, repeated measurement showed improvement, then revealed why the later dip was not simple regression: newer users were struggling with the broader toolchain.

SUS scores improved from 39.0 in cycle 1 to 57.0 in cycle 3, a 46% increase. In cycle 4, the score dropped to 48.9 as the sample expanded from 14 to 40 respondents. That dip was not straightforward regression: the broader sample included newer users who rated the experience significantly lower. Users with more than six months in the system averaged 68.3; those with less tenure averaged 47.3. The program made this nuance visible, distinguishing a population effect from a performance decline.

What became visible

Once measurement was consistent and repeated, patterns emerged that hadn't been visible through anecdotal feedback. System performance — speed and reliability — had a clear effect on user perception and drove score shifts between cycles more than any single feature change.

Task-level data provided a parallel signal. Against the incumbent tool, the new system showed 86% task attempt rate with 100% completion on attempts. The incumbent's higher attempt rate masked a lower overall success rate — users were more likely to abandon tasks they didn't know how to complete rather than attempt and fail. Task satisfaction ratings also trended upward across the evaluation period, confirming directional improvement even while overall SUS scores remained in the lower range.

The work established a repeatable basis for future evaluation: a consistent scoring method, a feedback collection process, and a cycle cadence that any team could run forward. Without that structure, the next round of improvements would have no baseline to measure against.

The impact

The program gave the platform team a practical way to prioritize development work across the authoring ecosystem. Instead of relying only on tickets, anecdotes, or the loudest pain points, the team could identify recurring friction, connect user feedback to platform changes, and show whether improvements were moving the experience in the right direction.

It also made a deeper issue visible: experienced users had learned how to navigate the toolchain, while newer users experienced the same workflow as significantly harder to understand. That distinction helped separate general usability problems from onboarding, workflow-complexity, and toolchain-friction issues — and pointed toward different interventions for each.