← Dashboard ← Sprint Three: Test and Build

Sprint Four: Plan and Activate

Septapod Financial AI Strategy Engagement

Return & Assessment

Brent returns after the gap period. Assess what transferred: the independent pilot Septapod ran without Brent, check-in call themes, champion network activity, and capability transfer evidence. The gap period is a deliberate design feature, not dead time.

Module info Weeks 1-2 11 hrs facilitator 1.5 hrs CEO 0.5 hrs per exec
When: Weeks 1-2 of Sprint Four. Assessment draws on evidence from the gap period. Brent returns with observations from regular check-in calls and a point of view on what transferred.

Time per person

Facilitator (Brent) 11 hrs Return assessment facilitation, independent pilot review, capability transfer evaluation, outcome metrics comparison
CEO 1.5 hrs Assessment session (60 min), gap-period debrief (30 min)
Each senior exec 0.5 hrs Champion network assessment, gap-period observations
Internal owner 3 hrs Independent pilot debrief, capability transfer scorecard, champion network assessment

What actually happens

Brent returns and assesses what the gap period produced. The independent pilot Septapod ran without Brent is evaluated against the same criteria used for the Sprint Three pilots. Outcome metrics (rework rate, cycle time, error-catch rate) are compared between the two Sprint Three pilots and the independent pilot. Check-in call themes are reviewed for patterns. The capability transfer scorecard measures whether the distributed governance model, the per-pilot method, the signal watch-list, and the cross-functional AI responsibilities functioned without Brent.

Through-line

Generates
Independent pilot evaluation. Outcome metrics comparison (two pilots side by side). Check-in call theme inventory. Champion network assessment. Capability transfer scorecard with five indicators.
Value
Makes the gap period visible as evidence, not as a gap. Reveals whether the capability transfer model worked: did Septapod use the tools Brent taught them? Where did it break down? This is the data that makes Sprint Four's scaling decisions evidence-based.
How Septapod uses it
The assessment drives what Sprint Four prioritizes. Strong transfer means Sprint Four formalizes and scales. Weak transfer means Sprint Four diagnoses what broke and adjusts the approach before scaling.
Next step uses
Step 2 (Scenario Planning) uses pilot evidence and outcome metrics to ground scaling decisions. Step 3 (Annual AI Plan) uses the capability transfer scorecard to shape governance model updates.

Sprint Three Outputs (carried forward)

Data from Sprint Three's pilots and synthesis, loaded from your browser. Read-only.

No Sprint Three data found. Complete Sprint Three first, or continue without carried-forward outputs.

Sprint Three Pilot Summary

The two pilots run during Sprint Three, one co-led with Brent and one advised. Read-only, loaded from Sprint Three data. Displayed alongside the independent pilot for comparison.

No Sprint Three pilot data found. Complete Sprint Three's Operational Pilots first.

Independent Pilot (Gap Period)

Evidence from the pilot Septapod ran independently during the gap period. Same field structure as Sprint Three's pilot template for direct comparability. This is the test of capability transfer: did Septapod use the per-pilot method (Anthropic patterns, eval-driven loop, PAIR design) without Brent?

From Anthropic's "Building Effective Agents." Same six patterns used in Sprint Three. Which pattern did the independent pilot use? If the team did not formally select one, choose "Unknown" at the bottom.

Scenario probes (independent pilot)

Outcome Metrics (Superadditive)

Three metrics from Superadditive's "Metrics and Meat Shields." Compared side by side for the two Sprint Three pilots and the independent gap-period pilot. Do not measure token spend, prompts per day, tool usage hours, or percent of AI-generated content. Those reward the wrong behavior.

Rework Rate

Percentage of AI-assisted work that requires human correction or redo. Industry baseline is approximately 15%. Above this threshold, the AI is creating net additional work rather than reducing it.

Cycle Time

How long the workflow takes with AI compared to before. Measured as before/after comparison. A pilot that increases cycle time has not yet found its shape.

Error-Catch Rate

Percentage of AI errors that human reviewers catch before the output leaves the workflow. A rate of zero means reviewers stopped looking, not that the AI stopped making errors. Track this to detect automation complacency.

Check-In Call Summary

Themes from the regular check-in calls during the gap period. Not a transcript. Patterns: what came up repeatedly, what escalated, what was never mentioned. Each row captures a theme, not a meeting.

Theme 1

Champion Network Assessment

3-5 people across departments who provide peer-to-peer support for AI adoption. Assess gap-period activity: who stayed active, who went quiet, which departments engaged. Maps to the 35% staff-function resistance finding.

No champion network data carried forward from Sprint Three. Champions may be a new Sprint Four activity.

Champion 1

Capability Transfer Scorecard

Five indicators of whether Septapod built internal capability during the gap period. Each indicator is yes/no/partial with space for evidence. The scorecard tests the core premise of the engagement: that Septapod can operate independently after Sprint Four.

1. Did the team use the per-pilot method?

Anthropic agent patterns, eval-driven loop, PAIR human-side design. Did they follow the method or improvise?

2. Did the governance model function without Brent?

The distributed accountability model (four slots: pilot oversight, vendor evaluation, annual plan refresh, board reporting). Did the named people carry their accountabilities?

3. Was the signal watch-list used?

Did the people assigned to watch specific signals (market shifts, regulatory changes, technology capabilities) check them on cadence? Were any trigger conditions met during the gap?

4. Did shadow-AI behavior change from the Sprint One baseline?

Sprint One's shadow-AI audit (Step 3) established a baseline. Compare current state to that baseline. Improvement means the employee communication cadence and sanctioned-use policies took hold.

Sprint One shadow-AI baseline will load automatically.

5. Did the champion network provide peer-to-peer support?

Champions are the distributed support layer. Did they help colleagues adopt AI tools, answer questions, and reduce resistance?

Scenario Planning & Scaling

Build coherent futures from scenario probes accumulated during Sprint Three. Classify pilots by coupling tier. Decide what scales and what gets retired, with evidence thresholds and the 6-8 week stability constraint.

Module info Weeks 2-4 11 hrs facilitator 1.5 hrs CEO 1.5 hrs per exec 2 hrs internal owner
When: Weeks 2-4. Builds on the assessment from Step 1. Scenario planning and scaling decisions happen in facilitated sessions with the CEO and the executive team. Scenario probes accumulated during Sprint Three provide the raw material.

Time per person

Facilitator (Brent) 11 hrs Scenario facilitation, scaling framework sessions, coupling tier classification, what-scales / what-retires decisions
CEO 1.5 hrs Scenario exercise participation, scaling decision sign-off
Each senior exec 1.5 hrs Scenario construction, coupling tier input, scaling decisions for their domains
Internal owner 2 hrs Probe inventory curation, scenario session prep with Brent, scaling decision documentation
Each board member Not involved this step Scenario planning and scaling are internal team work. Board engagement returns in Step 3 with the Annual AI Plan co-creation.

What actually happens

Brent facilitates a scenario exercise using probes accumulated from Sprint Three pilots and the independent pilot. The team constructs 2-3 coherent futures and stress-tests current AI commitments against them. Each pilot is classified by coupling tier (low/medium/high) to determine governance requirements for scaling. Scaling decisions are made with evidence thresholds and the 6-8 week stability constraint. What gets retired is documented with rationale.

Through-line

Generates
Scenario probe inventory. 2-3 constructed scenarios. Per-pilot coupling tier classification. Scaling decisions with evidence and governance requirements. What-scales and what-retires lists.
Value
Scaling decisions are grounded in evidence and scenario-tested, not optimistic projections. Coupling tiers determine governance intensity: low-coupling workflows scale with logging, high-coupling workflows need human-in-the-loop. The stability constraint protects staff from change fatigue.
How Septapod uses it
The what-scales list becomes the operational roadmap for the next 6-12 months. The what-retires list frees up capacity and attention. Coupling tiers determine which governance requirements apply to each scaled workflow.
Next step uses
Step 3 (Annual AI Plan) incorporates scaling decisions, scenario stress-test results, and coupling tier classifications into the formal plan. The governance model v2.0 reflects coupling-tier governance requirements.

Scenario Probe Inventory

Probes accumulated from Sprint Three pilots and the independent pilot. Each probe captures a condition under which current assumptions break. Probes are the raw material for constructing coherent scenarios.

No scenario probes found in Sprint Three data. Add probes manually below.

Identity-Based Scenario Framework (Tighe)

Self-contained reference for running the scenario exercise with the executive team. This framework connects AI strategy decisions to organizational identity, which matters for a mission-driven credit union where values constrain what AI can do.

Framework Overview

Identity-based scenario planning starts from the premise that an organization's response to external change depends on who it believes itself to be. For Septapod, the cooperative charter and the member-ownership model are identity anchors that constrain AI strategy in ways that a conventional bank's identity does not.

Four Scenario Dimensions

Technology Trajectory

How does the AI capability landscape change? What becomes possible in 12-24 months that is not possible today? Where does the cost curve move?

Regulatory Environment

How do NCUA, state regulators, and federal AI policy (including EU AI Act Article 14, August 2, 2026) change the compliance requirements for AI in financial services?

Competitive Landscape

What do peer credit unions, fintechs, and large banks do with AI? Where does competitive pressure come from? What does member expectation look like in 2-3 years?

Member Behavior

How does member interaction with AI-powered services evolve? Where does trust erode or deepen? How does the membership profile shape acceptable AI use?

Facilitator Guidance

Run the exercise in a 90-minute facilitated session with the CEO and the executive team. Start with the probe inventory from Sprint Three. Ask each exec to pick the two probes most likely to materialize in 12-18 months. Cluster the responses. Build 2-3 coherent futures from the clusters. For each future, ask: what does Septapod's current AI plan look like in that world? Where does it hold? Where does it break?

Scenario Construction Workspace

Build 2-3 coherent futures from the accumulated probes. Each scenario describes what changes across the four dimensions and what that means for Septapod's AI commitments.

Scenario 1

Scaling Decision Framework

Per-pilot scaling assessment with coupling tier classification. Evidence thresholds are explicit: scaling requires pilot evidence, not optimism.

Coupling Tier Reference (Superadditive, "The Undo Button")

Low

Action affects one record. Other systems read it on their own schedule. Scales with logging.

Diagnostic: What else happens? Nothing immediate. Who sees results first? Only the user. Detection vs. cascade? Detection is instant, no cascade.

Medium

Action triggers downstream behavior within minutes. Agent recommends, human approves most cases.

Diagnostic: What else happens? Downstream actions trigger. Who sees results first? Multiple people or systems. Detection vs. cascade? Detection within minutes, cascade contained.

High

Real-time cascades before internal detection. Agent surfaces the decision but does not make it.

Diagnostic: What else happens? Immediate cascades. Who sees results first? External parties. Detection vs. cascade? Cascade outpaces detection.

People absorb substantial redesign with 6-8 weeks of stability between changes. Sequence expansions with enough stability between each.

Brent-Led Pilot

Pilot data loads from Sprint Three.

Outcome metrics load from Step 1.

Independent Pilot

Independent pilot data loads from Step 1.

What Scales

Workflows and pilots approved for broader deployment. Each entry includes the coupling tier, conditions for expansion, resources required, and governance requirements implied by the tier.

Item 1

What Gets Retired

Pilots or approaches that are retired based on evidence. Retirement is a success of the method, not a failure: it means the evaluation criteria worked and prevented bad investments from scaling.

Item 1

Annual AI Plan & Board

Formalize the Annual AI Plan from accumulated entries. Update the governance model with gap-period evidence. Design the board co-creation session. Formalize the champion network. Build the annual refresh mechanism. The engagement's handoff is already in progress, not a final-week task.

Module info Weeks 3-7 18 hrs facilitator 1 hr CEO 1 hr per exec 2.5 hrs board 4 hrs internal owner
When: Weeks 3-7. The Annual AI Plan formalizes entries that accumulated during Sprint Three plus Sprint Four scaling decisions. Board co-creation session is the culminating facilitated event. The engagement's handoff is already in progress, not a final-week task.

Time per person

Facilitator (Brent) 18 hrs Plan formalization, governance model update, board session facilitation, engagement wrap, pre-close readiness check (1 hr to assemble evidence and complete the four-prereq check)
CEO 1 hr Plan review, governance sign-off, board session participation
Each senior exec 1 hr Annual Plan draft review for their domain, board co-creation session participation
Each board member 2.5 hrs Pre-read material review, board co-creation session participation
Internal owner 4 hrs Refresh mechanism ownership, champion network formalization, ongoing governance, pre-close readiness evidence (1 hr providing artifacts for the four-prereq check)

What actually happens

The Annual AI Plan has been assembling continuously since Sprint Three. Sprint Four formalizes what already exists rather than writing from scratch. The governance model is updated with gap-period evidence. The board co-creation session brings the board from signal-watchers to governance participants. The champion network is formalized with roles and mandates. The annual refresh mechanism ensures Septapod can run the cycle without Brent.

Through-line

Generates
Formal Annual AI Plan (board-ready). Governance model v2.0. Signal watch-list refresh. Board co-creation session design. Formalized champion network. Annual refresh mechanism. Engagement completion summary.
Value
Septapod has a complete, board-approved AI plan that is refresh-ready. The governance model is evidence-tested, not theoretical. The engagement succeeds if there is no follow-on engagement.
How Septapod uses it
The Annual AI Plan guides AI decisions for the next 12 months. The refresh mechanism triggers the annual update. The governance model operates independently. The board has ownership over AI governance, not just oversight.
Next step uses
There is no next step. The engagement ends with Septapod operating independently. The refresh mechanism is the successor to the engagement.

Annual AI Plan Assembly

Aggregated from Sprint Three accumulated entries and Sprint Four scaling decisions. Seven sections cover the full plan. Each section has a text area for the narrative Brent writes during facilitation.

Sprint Three Annual AI Plan entries will load automatically.

1. What AI Septapod Uses and Why

2. What Septapod Does Not Use and Why

3. Scaling Roadmap

4. Resource Requirements

5. Governance Model

6. Risk Posture

7. Refresh Triggers

Governance Model v2.0

Updated from Sprint Three's governance model with gap-period evidence. Each accountability slot shows the Sprint Three original alongside the updated version.

No governance model was carried forward from Sprint Three. Build a new governance model here or complete Sprint Three first.

Pilot Oversight

Vendor Evaluation

Annual Plan Refresh

Board Reporting

Signal Watch-List Refresh

Carried forward from Sprint Three. For each signal: was it useful during the gap? Keep, modify, or drop. New signals from the scenario exercise can be added.

Signal 1

Board Co-Creation Session Design

The board has been receiving signals to watch since Sprint Three. They have enough exposure to participate in governance decisions, not just receive updates. This card designs the facilitated session, not a form the board clicks through.

Pre-Read Material

Session Structure

Block 1

Champion Network Formalization

Formalizes champion roles and mandates based on the gap-period assessment from Step 1. Connects to the governance model: champions are the distributed support layer.

No champion assessment data from Step 1. Complete Step 1's champion network assessment first, or build the network here.

Champion 1

Annual Refresh Mechanism

How Septapod runs the AI Plan refresh without Brent. The engagement succeeds if there is no follow-on engagement.

Refresh input checklist

Vendor Assumption Sheet (Superadditive Reference)

Six evaluation questions from Superadditive's "Japanophilia and Greenhouses" for AI platform procurement. Reference material for vendor decisions that arise during scaling.

1. Coordination Philosophy

Does the vendor assume centralized or distributed coordination? A platform that assumes all AI work routes through a single team conflicts with a distributed governance model.

2. Unit of Work

What does the vendor consider a single "job"? If the unit is a chat turn, the platform is optimized for conversational AI. If it is a workflow, it fits process automation. The mismatch between the vendor's unit and the credit union's unit creates friction.

3. Destination Assumption

Where does the vendor assume AI output goes? Into a human review queue, directly into a system of record, into a member-facing channel? The destination assumption determines governance requirements.

4. Relational Persistence

Does the platform remember context across interactions? A platform with no memory requires rebuilding context each session. A platform with persistent memory raises data governance and privacy questions.

5. Naming Metaphor

What does the vendor call its AI? "Assistant," "agent," "copilot," "advisor." The naming metaphor reveals the vendor's mental model of the human-AI relationship and shapes user expectations.

6. Exit Cost

What happens if Septapod stops using this vendor? Are prompts, workflows, and training data portable? A platform that locks in institutional knowledge creates dependency that conflicts with the annual refresh model.

Pre-Close Readiness Check

Four prerequisites that have to be in place before the engagement closes. Each one was implied by the design; this card makes them explicit so the engagement does not end leaving Septapod set up to fail the capability transfer test it took in Step 1. If any prerequisite is not met, Sprint Four needs to address it before closing.

Why these four

Sprint Four's capability transfer scorecard tests whether Septapod can run AI work independently. Five indicators were measured: per-pilot method used, governance functioning, signal watch-list active, shadow-AI behavior changed, champion network operating. For those indicators to show as "Yes" rather than "Partial" or "No," four things have to be in place when the engagement closes. Named these explicitly because implication will not survive the gap period.

1. Named internal owner with calendar commitment

Without a single named owner who has documented time on their calendar for post-engagement work, the Annual AI Plan refresh becomes everyone's job and therefore no one's. The refresh mechanism card above identifies the owner; this prereq confirms the calendar commitment is real.

2. Governance cadence ran 2+ cycles before the gap

The Distributed Governance model (Sprint Three Step 1) and its v2.0 update (Sprint Four Step 3) are paper constructs until the cadence runs. Two completed cycles under Brent's eye is the minimum to know the model handles real decisions. One cycle is not enough; the second cycle reveals what the first missed.

3. Champion network of 3-5 people with documented support activities

The Champion Network Formalization card above names the people and roles. This prereq confirms the support activities are real: each champion has done something visible during the engagement, not just received the badge.

4. Signal watch-list with named watchers who completed one monitoring cycle

The Signal Watch-List Refresh card above keeps the list current. This prereq confirms the watchers are practiced: each named watcher has run one monitoring cycle (checked their signal at the assigned frequency, reported what they saw, no action or action taken). A watcher who has never reported has not actually been a watcher.

Engagement Completion Summary

What the full engagement produced across all four sprints. What Septapod now owns and operates independently. Conditions under which Brent would return.

Sprint Four Summary

Click to expand