Hands-On Agentic AI for Leaders

Shift from chatting with AI to building production agent workflows you can deploy, govern, and reuse across your org.

Weekly time: ~6 hrs/week
For: C-suite, leaders, and senior ICs who learn by building, not by watching.
Prerequisites: A ChatGPT, Claude, or Gemini subscription. No coding required.
Structure: 30 days · 4 milestones · 11 checkpoints · 47 steps

Start in mouseover →

Opens the challenge in mouseover. Download mouseover first if you have not yet.

Week 1

Foundations & Workflow Identification

Audit recurring work, identify high-value workflow candidates, set up your AI builder stack.

End-of-week outcome: A ranked list of 3 workflow candidates plus a working builder environment.

Checkpoint 1 · ~2 hrs

Audit your recurring work

You will have: A ranked list of 3 candidate workflows scored on frequency × pain.

Show 4 steps

Step 1 List 10 things you do every week that take more than 20 minutes.
Step 2 Score each on frequency (how often) and pain (how draining or error-prone).
Step 3 Mark the ones you would describe out loud to a colleague the same way every time. These are workflow candidates.
Step 4 Pick the top 3 and write a one-sentence description of each.

Checkpoint 2 · ~3 hrs

Apply the 7-step workflow lifecycle

You will have: Your top candidate decomposed into a written workflow spec.

Show 5 steps

Step 1 Take the top candidate and write out the workflow as a numbered list of steps.
Step 2 Mark each step as "decision" (judgment required) or "mechanical" (rule-following).
Step 3 Identify which mechanical steps an LLM could do with a clear prompt.
Step 4 Identify which decision steps still need human review and what good looks like for each.
Step 5 Write a one-page workflow spec: inputs, steps, outputs, review points.

Checkpoint 3 · ~2 hrs

Set up the builder stack

You will have: A working environment where you can author, version, and test AI workflows.

Show 4 steps

Step 1 Install Cursor or VS Code. Get comfortable opening, saving, and committing files.
Step 2 Create a GitHub account if you do not have one and make a private repo for your AI assets.
Step 3 Pick your primary model: Claude, ChatGPT, or Gemini. Confirm you can access the API or platform of your choice.
Step 4 Run one short prompt as a saved file in your repo. Commit it. You now own that asset.

Week 2

Agent Skills & Deterministic Workflows

Package skills as files you own. Build workflows you can test, version, and hand off.

End-of-week outcome: Two production workflows: one deterministic, one human-in-the-loop.

Checkpoint 1 · ~3 hrs

Build your first agent skill

You will have: A reusable, file-versioned skill that beats raw chat on 5 real inputs.

Show 5 steps

Step 1 Pick a skill type from your workflow: research, summarisation, analysis, or drafting.
Step 2 Write the skill as a prompt-in-a-file with explicit role, inputs, format, and constraints.
Step 3 Run it on 5 real inputs you actually deal with. Capture the outputs.
Step 4 Compare against raw "ask ChatGPT" on the same inputs. Note where the skill version wins.
Step 5 Iterate the prompt once. Commit the new version. You now version a skill.

Checkpoint 2 · ~3 hrs

Build a deterministic workflow

You will have: A 2-to-3-skill chain that runs end-to-end on real input with a pass rate you can quote.

Show 4 steps

Step 1 Chain 2 or 3 of your skills with explicit handoffs (output of A becomes input of B).
Step 2 Define the fixed rules: what runs always, what skips, what triggers a stop.
Step 3 Run on 5 real inputs. Log pass and fail for each step.
Step 4 Write down the pass rate. This is your baseline.

Checkpoint 3 · ~2 hrs

Add a human review checkpoint

You will have: A collaborative workflow where a human approves before the final step ships.

Show 4 steps

Step 1 Identify the one step in your workflow where human judgment matters most.
Step 2 Design the handoff: what does the human see, what do they decide, where do they signal "approved"?
Step 3 Run the collaborative version with a real colleague on a real input.
Step 4 Note what the colleague pushed back on. Update the prompt so the AI catches it next time.

Week 3

Autonomous Systems

Deploy single-agent and multi-agent workflows for your highest-value work.

End-of-week outcome: A deployed single-agent workflow running unattended and a multi-agent design tested in dev.

Checkpoint 1 · ~3 hrs

Deploy a single-agent workflow

You will have: A workflow that runs unattended on a real input and produces a real output.

Show 4 steps

Step 1 Pick the platform that fits your use case: Claude Projects, custom GPT, M365 Copilot, or Gemini.
Step 2 Deploy your workflow with all the skills and rules from week 2.
Step 3 Trigger it on a real input. Measure how long it took, what it produced, where it diverged.
Step 4 Document the trigger, expected inputs, and expected outputs.

Checkpoint 2 · ~3 hrs

Design a multi-agent system

You will have: A documented 3-agent handoff with a tested failure recovery path.

Show 5 steps

Step 1 Pick a workflow too big for one agent. Often: research → synthesis → output.
Step 2 Design 3 agents with explicit roles, handoffs, and inputs/outputs.
Step 3 Implement the handoffs. Could be as simple as file passes or as rich as a tool-call chain.
Step 4 Run on one real input. Inject a known failure midway. Document the recovery path.
Step 5 Decide which workflows are worth this complexity and which are not.

Week 4

AI Registry & Governance

Make your skills, prompts, and agents discoverable, reusable team assets.

End-of-week outcome: A living AI Registry in Notion with a 30-day team rollout plan.

Checkpoint 1 · ~3 hrs

Build the AI Registry

You will have: A Notion database listing all your skills, prompts, and agents with status, owner, and use.

Show 4 steps

Step 1 Create a Notion database with columns: name, type (skill/prompt/agent), status, owner, use case, link to file.
Step 2 Import every asset you built in weeks 1 through 3.
Step 3 Tag each by maturity: draft, tested, production.
Step 4 Add a "last reviewed" date column so quality stays current.

Checkpoint 2 · ~2 hrs

Make it discoverable

You will have: A team-facing landing page on the registry with 3 example usages.

Show 4 steps

Step 1 Write a one-page intro to the registry: what is here, when to use it, who owns it.
Step 2 Add 3 example usages with the exact inputs and outputs.
Step 3 Share with 2 colleagues. Ask them to find a specific asset. Note what they fail to find.
Step 4 Fix the discoverability gaps you observed.

Checkpoint 3 · ~2 hrs

Plan the team rollout

You will have: A 30-day rollout plan with 3 candidate workflows and first office hours scheduled.

Show 4 steps

Step 1 Pick 3 workflows from your registry that are ready for team adoption.
Step 2 For each, write: target team, value sentence, expected adoption blockers.
Step 3 Draft a 30-day rollout plan: week 1 announce, week 2 train, week 3 use, week 4 retro.
Step 4 Schedule your first office hours session and post it.

Ready to start?

You have read the plan. The next thing to do is open the first checkpoint.

Start in mouseover →

Opens the challenge in mouseover. Download mouseover first if you have not yet.

Browse other 30-day challenges ← Back to all challenges