Pilot Evidence Operations Guide - Initial Pilot Cohort

Purpose

This note turns the evidence kit into an operating routine a real pilot can run.

It is for the people who have to set up, run, and maintain the measurement loop:

technical enablement lead
manager sponsor
pilot coordinator
technical lead supporting sampled review

Minimum viable tool stack

Use existing tools first.

Minimum viable setup:

one survey or form tool
one shared spreadsheet or table
one issue-tracker view for pilot workflows
one place to log exceptions or incidents

Do not add a new measurement platform unless the pilot has already outgrown the lightweight stack.

If the organization does not yet know how to instantiate that stack, use Pilot Evidence Reference Deployment - Microsoft 365 and Jira as the reference build.

Core roles

Technical enablement lead

Owns:

kit setup
weekly pulse review
sampled case review coordination
weekly action log
pilot closeout summary

Owns:

workflow participation support
weekly soft-signal input
follow-up on support gaps
reinforce that the kit is not for performance management

Technical lead or reviewer

Owns:

sampled case review quality
verification-quality judgment
escalation of unsafe patterns

Setup checklist before kickoff

Choose 2-6 target workflows total across the pilot.
Tag those workflows in the system of record.
Create one shared evidence sheet.
Decide the baseline window, usually 2-4 weeks of recent comparable work if available.
Build the pre-pilot survey and weekly pulse in the chosen form tool.
Confirm who reviews sampled cases each week.
Confirm confidentiality language with the sponsor.
Tell participants explicitly that the kit is for workflow learning, not employee ranking.

Weekly operating rhythm

Early in the week

update KPI rows for the target workflows
review the weekly pulse responses
note any immediate support needs or red flags

Midweek

sample 5-10 cases across the pilot
apply the rubric
capture 1-2 strong examples and 1-2 weak examples if available

End of week

write one short action summary
decide refine, support, escalate, or hold
communicate only the relevant summary, not raw individual responses

Action thresholds

These are not hard laws, but they are useful default triggers.

Escalate quickly if:

there is a meaningful privacy, security, or policy exception
a high-risk case scores 0 on verification quality or risk handling
multiple cases show fluent but weakly explained output being trusted

Refine the workflow if:

review burden rises for 2 consecutive weeks
weekly pulse responses repeatedly report rework or confusion
adoption quality is mixed while throughput appears better

Pause scaling if:

a target workflow looks faster but hidden rework is rising
managers or technical leads report growing unease that samples confirm
lower-oversight-readiness participants are moving faster without better explanation ability

Shared spreadsheet structure

Use Pilot Evidence Sheet - Initial Pilot Cohort as the working sheet model.

Recommended tabs:

pilot overview
workflow KPI tracker
sampled case log
support and learning log
weekly action log

Reporting rule

Keep the output simple.

The weekly summary should answer:

what improved
what got heavier
what needs support
what needs inspection
what should not scale yet

Closeout routine

At pilot close:

run the post-pilot survey
compare baseline and pilot-period workflow signals
summarize the strongest good and bad examples
name hidden costs, not just gains
recommend refine, pause, or scale

What to avoid

auditing everything
building a scorecard for individuals
chasing statistical proof in a tiny pilot
keeping weak metrics because they are easy to collect
hiding uncertainty in the summary language

Pilot Evidence Operations Guide - Initial Pilot Cohort

Search the site

Purpose

Minimum viable tool stack

Core roles

Technical enablement lead

Manager sponsor

Technical lead or reviewer

Setup checklist before kickoff

Weekly operating rhythm

Early in the week

Midweek

End of week

Action thresholds

Escalate quickly if:

Refine the workflow if:

Pause scaling if:

Shared spreadsheet structure

Reporting rule

Closeout routine

What to avoid