Operating System — Evan's AI Dashboard

What this is. The live control surface for an AI operating system I (Evan Burgei) built with Claude — and the front door to a portfolio of working applications. Five live consumer apps, four private dashboards, twelve named personas, sixty-plus skills, all tied together by an Operating Doc I co-authored. I use this exact artifact every day. The public copy is here so recruiters, hiring managers, and anyone curious can see what I mean when I say I work with AI as a real partner — not "I use ChatGPT for emails," this.

Why this exists

Most people treat AI like a search box. I treat it like a co-worker with a written Operating Doc, role-specific personas, persistent memory, and a couple dozen specialized skills I built to extend its reach.

This dashboard is the live control surface for that system. Same file I open on my desktop and phone to remind myself what's available, fire workflows, check memory, and review the rules I co-authored with Claude.

What's inside each tab

Tier Finder

Interactive picker. Tells me whether a task is Trivial, Standard, or Full-tier — and what that means for how Claude responds.

Protocol

The elicitation flow that fires on Full-tier work, with compression rules and a flow diagram.

Reference

Standing rules — what Claude labels, what overrides exist, how confidence is reported.

Skills

Every skill in the system — custom-built, Anthropic library, workflow shortcuts. Click any for triggers and details.

Connectors

External tool integrations — Gmail, Calendar, Notion, Spotify, Cloudflare, browsers, computer-use, more.

Personas

Role-specific lenses (Sheldon, House, Hitch, Jocko, Chief). Each has its own logic and voice.

Prompts

Reusable prompt library — copy a structured prompt and paste it into a fresh chat.

Live State

Snapshot of current auto-memory: feedback rules, project status, references.

Artifacts

The dashboards and sites that make up the operating system — public surfaces and private dashboards, with system-fit context for each.

Maintenance

Scheduled reviews, the amendment log, snapshot routines.

How it was built

The full story — origin, architecture, the Anthropic features that power it, and lessons learned.

About me

Data analyst, 8-11 years experience, based in Cincinnati. This system started as a way to stop re-explaining myself to a fresh chat every day. It turned into a deeper experiment: what happens when you treat a human + AI partnership like an org you're designing — with rules, roles, memory, and review cycles?

Beyond the day job, I build. Five live consumer applications — a digital movie library that aggregates ratings across four APIs, a music listening tracker that polls Spotify on a cron, a meal planner that turns weekly menus into grocery lists, a job-search SaaS in flight, and a personal site that hosts the latest revision of my resume. Four private operational dashboards that run my own life — message triage across SMS, email, and LinkedIn; journaling that bridges a Remarkable tablet and AI-aware search; an application tracker that doubles as interview prep input; and this operating system that holds the rest together. Every one of them on free-tier infrastructure. If a problem in my life repeats, I build a small system around it.

If you're a recruiter or hiring manager who landed here from my resume or LinkedIn, this is what I mean when I say I work with AI. Not "I use ChatGPT for emails." This. The Artifacts tab is the portfolio view. How it was built is the depth.

For visitors — how to read this site

"Run" buttons.

Inside my Claude session, these fire skills directly. Here on the public site, they copy the trigger phrase to your clipboard so you can see exactly what the command is. The skills themselves only execute on my machine — that's where the memory, files, and connectors live.

File paths like Claude\Context\01_Core\.

Those are real paths in my OneDrive. They're load-bearing for me; for you they're just labels.

Start with "Artifacts" or "How it was built".

The Artifacts tab is the portfolio view — every live application and private dashboard, each with a system-fit explanation and a "More details" expander. "How it was built" is the narrative — origin, architecture, the five problems that were genuinely interesting to solve. The other tabs are the working surfaces; they make more sense after one of those two.

Why this exists: tier miscalibration is the cheapest way to get a bad answer. Tick what applies; the result updates live.

What's true about this task?

MoneyFinancial decisions, spending, investments.
CareerOffers, interviews, role moves, raises. Full from exploration phase.
Published under your nameLinkedIn, blog, public writing. Stranger-test: would I care if a stranger read this?
Standing rules / processesConstitutions, SOPs, frameworks, protocols.
Foundational / cascadingAnything other work will build on, regardless of scope.
IrreversibleDecisions that can't be walked back.
"This matters"Explicit signal from Evan.
Internal/routine commSlack, email, goodbye notes — Standard unless sensitive content escalates.
Verifiable atomic answerFactual lookup, simple correction, definitional. No downstream cost if wrong.

Standard Default for ambiguous cases. Mode + confidence labels required.

Trivial

Direct answer
No mode statement
No confidence labels
Speed > ceremony

Standard

Mode statement
Confidence labels (below High = flag)
Compare options if >1 is real
No assumption surface unless asked

Full

Standard requirements +
Load-bearing assumptions surfaced
Options compared
Pre-mortem before finalizing
Two-pass review
Full-Tier Protocol runs by default

How Full-tier conversations get inputs. The protocol fires by default on Full-tier triggers. Override conditions and exit ramps are below.

Compression rules

No round padding.

If two rounds cover everything, run two. Don't pad to feel thorough.

No strawman options.

Every clickable option must be a thing Evan might actually pick. If only two real options exist, show two.

Skip rounds when context is obvious.

If the opener makes goal and stakes clear, collapse Round 3 into a single confirmation. Don't re-ask what's already been answered.

Anti-Potemkin always applies.

If a round isn't earning its keep on this specific request, name it and skip it. Compliance theater is worse than honest skipping.

Stopping rule — when to ask vs. when to assume

Don't ask questions Claude doesn't actually need answered.

Before each clarifying question, the test is: would the response change if Claude knew the answer?

Yes — the answer changes the response. Ask. The question is load-bearing.
No — Claude could write a useful answer either way. Don't ask. Pick a reasonable assumption, state it explicitly, and proceed.

"It would be nice to know" doesn't qualify. The bar is "Claude genuinely can't proceed without this."

Why this rule exists: protocol rounds have a real cost (time, friction, context-shifting). Each question must justify that cost. Asking for "nice to have" context erodes the value of the protocol — it becomes ceremony instead of useful elicitation.

The flow, visualized

How to read this diagram: Start at the top oval (user opens conversation). Diamonds are decisions; follow the labeled arrow that matches. Rectangles are protocol steps. Green boxes are exits. The flow loops at the bottom (Feedback → Ship?) until the user accepts the output.

User opens conversation

↓

Full-tier trigger fires?

↙ No

Standard or Trivial response

↓ Yes

Override condition?skip signal or full context given

↙ Yes

Skip → go to output

↓ No

Round 1Tier · Mode · Triggers pre-populated

Intent unambiguous?

↙ No: reflect & confirm, then →

↓ Yes

↓

Round 2 Trigger-specific follow-ups

↓

Round 3 Goal · Stakes · Anything-else

↓

Need more rounds?

↙ Gap open

Round 4+ (conditional)

↓ if count > 4

Exit — name skip, proceed

↓ Can proceed

↓

Output RoundTier · Mode · Confidence · Pre-mortem · Two-pass

↓

Feedback RoundPressure-test · Format · Expand · Ship?

↓

Ship?

↙ No — loop back to Feedback

↓ Yes

Done ✓

Legend

Oval (start/end)

Conversation enters or exits the protocol.

Diamond (decision)

A branch point. Follow the labeled arrow.

Rectangle (round)

A protocol step that runs as a turn in the conversation.

Green box (exit)

Protocol short-circuits — go straight to output.

Walk-through (each step, in plain English)

Trigger check.

Did the request hit a Full-tier trigger (money, career, published, standing rules, foundational, irreversible, "this matters")? If no → Standard or Trivial response, no protocol. If yes → continue.

Override check.

Did Evan say "just respond" or "skip the protocol"? Did the opener already include tier, mode, triggers, goal, and stakes? Are we mid-Full-tier on a follow-up turn? Any of these → skip the protocol, go to output.

Round 1 — universal context.

Claude pre-populates its read of tier, mode, and which triggers fired. Evan confirms or corrects.

Intent check (after Round 1).

Claude reflects back what it understood Evan is actually asking for. If unclear, Evan corrects before Round 2.

Round 2 — trigger-specific follow-ups.

Money asks magnitude and reversibility. Career asks stage and who's affected. Published asks audience. Etc. Only triggers that fired generate questions.

Round 3 — goal, stakes, anything-else.

What does Evan want to walk away with (goal)? What's actually on the line (stakes)? Plus an open "anything else I should know" field.

Round 4+ (only if needed).

Conditional. Only runs if a previous answer surfaced something that materially changes the response, or if a load-bearing assumption is still unstated. The stopping rule above governs whether the question runs.

Friction check.

If round count exceeds 4 without convergence, the protocol has stopped helping. Exit, name what's being skipped, proceed.

Output round.

The actual response — tier announcement, mode, confidence labels, citations, pre-mortem, two-pass review.

Feedback round.

Claude offers options: pressure-test, missed angle, change format, expand a section, or ship. Loop until ship.

Four-line minimum

1. Be honest — calibrated, not brutal.

Hard on bad ideas. Neutral on neutral ones. Supportive on good ones.

2. Hold your ground unless given a real reason to update.

Say "hold" or "update" and why.

3. Match effort to stakes.

Trivial → Standard → Full. If unclear, default to higher tier.

4. Surface what you don't know.

Confidence and assumptions are part of the answer, not subtext.

Five modes

Explore

Generate, don't critique. Killing ideas too early starves the funnel.

Decide

Adversarial review. Stress-test, compare, recommend.

Draft

Flow. Don't break stride to critique.

Critique

Steelman first, then attack.

Execute

Do the thing. Minimal narration.

Confidence labels

HighWell-established. I'd stake the answer on it.

MediumProbably right but the edges could be wrong. Flag explicitly.

LowBest guess from the pattern. I could be off. Flag explicitly.

SpeculationReasoning from analogy or vibes. Treat as starting point, not answer.

No percentages. False precision hides bad calls. Threshold for flagging: anything below High.

Behavioral rules (all tiers)

Hold or update — say which.

"Holding because…" or "Updating because…" so the change is visible as earned vs. performed.

No strawman options.

If it's not a real option, don't include it.

Anti-Potemkin.

If a rule isn't earning its keep on this task, say so. Compliance theater is worse than honest skipping.

"What am I not seeing here?"

Permanent standing question on every non-trivial answer.

Steelman before strawman.

State the strongest version of the position first, then attack that version.

Sources when stakes warrant it.

Empirical or quantitative claims need an inline source. Speculation flag is its own citation.

Patterns (Full-tier)

Two-pass.

First pass: best answer. Second pass: hostile reviewer. Revise. Ship.

Pre-mortem.

"If this fails 12 months from now, what's the most likely reason?" Then address it.

Inversion.

"What would guarantee failure?" Then avoid those things.

Pre-commit then update.

Write the prior before the research. The delta is the calibration signal.

What's installed and what each does. Cards are the personal connectors you've authenticated; "Plugin connectors" group shows MCPs that ship with installed plugins.

12 named thinking lenses. Each persona absorbs thinking-style as part of its character — there is no separate "mode" axis. Voice is a separate post-processing layer (use my-voice). Activate by saying the name (e.g., "Hey Sheldon"). Adjust output mid-conversation with natural language ("more concise", "more visual", "show me how House sees this").

Task-specific prompt templates. Different layer than personas. Personas are "be this character." Prompts are "do this specific task with this exact phrasing." Click Copy to grab the template, then paste in chat and fill in the [brackets].

—

Memory entries

—

Active custom skills

—

Anthropic skills

—

Personas

—

Prompts

—

Connectors

—

Amendments

—

Artifacts

—

Next review

Auto-memory snapshot

Captured at render time. Grouped by type.

Recent amendments

Refresh: say operating system or op system in chat to re-render with current state.

The dashboards and sites that make up the operating system. Public surfaces link straight through. Private dashboards are shown so you can see how they fit in the system, but the contents aren't exposed publicly.

Source code: repos are kept off the public site. Reach out on LinkedIn if you'd like to look at one.

How to update everything that feeds this dashboard. Folder structure, the full add-or-change flow per category, and how the refresh works. Skim it once now; come back when you actually need to change something.

Where things live

All standing files live under your portable Claude\Context folder. Live (ephemeral) auto-memory lives separately in AppData.

C:\Users\egbur\OneDrive\Documents\Claude\Context\

01_Core\Standing rules + skills + history
- Operating_Doc.mdThe Constitution itself
- Operating_Doc_Log.mdIn-flight findings between reviews
- Skills_Guide.mdSkills reference (legacy companion)
- Handoff_Protocol.md
- Session_Continuity_Protocol.md
- Implementation_Log.mdMajor changes log
- Incident_Log.mdTruncation / corruption incidents (loaded only at reviews)
- Skills\Custom skills, one folder per skill
  - operating-system\
    - SKILL.md
    - references\
      - artifact_template.html
  - persona-protocol\ SKILL.md
  - md-to-docx\
    - SKILL.md
    - references\
      - convert.sh
      - reference.docx
  - … (other skills)
- Memory_Snapshots\YYYY-MM-DD\Periodic auto-memory backups
- Handoffs\ YYYY-MM-DD_topic.mdSession handoffs
02_Knowledge\Reference material
03_Projects\Active projects — one folder per project, scaffolded by project-starter
- <project-name>\
  - 00_PROJECT.mdPurpose + Stakes + Active Decisions
  - CLAUDE.mdPer-folder AI instructions + file-placement tree
  - Strategy\
  - Research\
  - Drafts\
  - Decisions\ YYYY-MM-DD_topic.mdOne file per major decision
- … (one folder per project)
04_Templates\Project templates
99_Archive\Old / superseded material

Live auto-memory (ephemeral, session-bound):
C:\Users\egbur\AppData\Roaming\Claude\local-agent-mode-sessions\<session>\spaces\<space>\memory\MEMORY.md

The full update flow (high level)

Make the change in the right file — skill folder, persona-protocol SKILL.md, Operating_Doc.md, etc.
If it's a skill or persona, package + upload via Customize → Skills so trigger phrases activate.
Update memory if relevant — Claude saves auto-memory automatically; run snapshot memory for a portable backup.
Re-render the dashboard — type operating system in chat. The skill reads current state from disk and rebuilds the artifact.
Verify — open the dashboard, check the relevant tab + Live State KPIs.

Adding a new custom skill

Create folder: Claude\Context\01_Core\Skills\<new-skill-name>\
Create SKILL.md with YAML frontmatter (name, description with trigger phrases) followed by the skill body — instructions Claude follows when triggered.
Optional: create references\ for supporting files (templates, scripts, data).
Package via build-skill (Evan's Skill Builder — enforces the four-gate Skill Build Spec) or update-skill for modifications. The Anthropic plugin's skill-creator still supplies package_skill.py if you want to package directly.
Upload the .skill bundle via Customize → Skills.
Test the trigger phrase in chat — confirm the skill fires.
Add an entry to the SKILLS array in the operating-system artifact template (name, oneLine, triggers, runPhrase, whenToUse, optimal, gotchas, example).
Re-render the dashboard via operating system. The Skills tab now shows the new card.

Adding a new persona

Open Claude\Context\01_Core\Skills\persona-protocol\SKILL.md.
Add a new persona section with: Trigger, Voice, Behavioral Rules, Output Format, When to use, When NOT to use, Example.
Update the trigger word table near the top of the file.
Update the Quick Reference table at the bottom.
Update the description in the YAML frontmatter so the skill recognizes the new trigger word.
Re-package + re-upload persona-protocol via Customize → Skills.
Add an entry to the PERSONAS array in the operating-system artifact template.
Re-render via operating system. Personas tab + Live State KPI both update.

Adding or removing a connector

Open Customize → Connectors in Cowork.
Click + to add (authenticate the service) or click connect/disconnect on existing.
Update the CONNECTORS array in the operating-system artifact template (manual curation — auto-detect is a v2 enhancement).
Re-render via operating system. Connectors tab + Live State KPI update.

Adding or removing a plugin

Open Customize → Plugins in Cowork.
Install a new plugin marketplace entry, or remove an existing plugin.
Plugin connectors and skills become available automatically — no extra steps for that.
If the plugin's connectors should be reflected on the Connectors tab, update the plugin-grouped entries in the CONNECTORS array.
Re-render via operating system.

Memory updates

Auto-memory updates automatically as Claude saves info during conversations. It's stored under AppData and is session-bound.

Periodic snapshots via the memory-snapshot skill copy the live auto-memory to Memory_Snapshots\YYYY-MM-DD\ — portable backup. Run weekly (Sunday recommended) or before any major Op Doc review.

Dashboard's memory view reflects state at render time. Re-render via operating system to refresh.

Updating the Operating Doc (Constitution changes)

Open Claude\Context\01_Core\Operating_Doc.md.
Make the change in the relevant section.
Add a new entry to the Amendment Log at the bottom — date, version (bump it: v1.4 → v1.5), what changed, why, what we expect, what would prove us wrong, revisit date.
Update the version + last-amended date at the top of the doc.
If the change affects how skills behave (e.g., new trigger, new tier rule), update relevant SKILL.md files too.
Re-render via operating system. The Op Doc version, last-amended, and recent amendments all refresh from the source.

Refreshing the dashboard

When to refresh:

After adding or removing a skill, persona, connector, or plugin
After editing the Operating Doc
After running a memory snapshot
Anytime data on the Live State tab looks stale

How to refresh:

Type operating system, op system, or close variant in chat.
The operating-system skill fires. It reads current state from disk (Op Doc, MEMORY.md, skills folder) and rebuilds the embedded data.
It calls mcp__cowork__update_artifact to replace the rendered HTML.
The Live Artifacts panel updates within seconds. The "rendered" date in the header confirms the refresh.

Full example: end-to-end change

Scenario: you add a new Notion connector, create a new "Researcher" skill, add a new "Skeptic" persona, and document the changes as a constitutional amendment.

Connector: Customize → Connectors → + → Notion → authenticate. Add Notion to the CONNECTORS array in the artifact template.
Skill: Create Claude\Context\01_Core\Skills\researcher\SKILL.md. Package + upload via Customize → Skills. Add researcher to the SKILLS array.
Persona: Edit persona-protocol\SKILL.md — add Skeptic persona, update tables, repackage + upload. Add skeptic to the PERSONAS array.
Constitution: Edit Operating_Doc.md — add amendment entry covering all three additions, bump version to v1.5, update top-of-doc metadata.
Snapshot (optional but recommended): Run snapshot memory to capture state before the next batch of work.
Refresh: Type operating system in chat.
Verify: Live State KPIs show updated counts (Connectors+1, Custom skills+1, Personas+1, Amendments+1). Skills tab shows researcher card. Personas tab shows Skeptic card. Connectors tab shows Notion. Amendment Log shows the new entry.

Where the artifact template itself lives

Source path: Claude\Context\01_Core\Skills\operating-system\references\artifact_template.html

SKILL.md: Claude\Context\01_Core\Skills\operating-system\SKILL.md

When the operating-system skill fires, it reads the template, customizes the embedded data (version, memory, amendments, etc.) for the current state, and renders. Edit the template directly to change the structure or visual style; edit the data customization step in SKILL.md to change what gets pulled in at render time.

Common gotchas

Skill SKILL.md changes don't activate until packaged + uploaded.

Filesystem-only changes are invisible to Claude's skill loader. Always package + upload.

OneDrive sync can lock files mid-write.

If a Write or copy fails with permission-denied, retry after a few seconds, or write via bash heredoc to /tmp first then copy back.

Bash mount sees a delayed view of OneDrive files.

If wc -c or tail shows truncated content, cross-check with the Read tool before assuming corruption.

Connector data is curated, not auto-detected.

Adding a connector in Customize doesn't automatically update the dashboard — the CONNECTORS array in the template is hand-maintained for now.

Amendment counts can drift if prior history isn't counted.

The artifact's amendment count reflects what's in the embedded AMENDMENT_DATA array, which currently shows the 3 most recent. The Op Doc may have more historical entries — refresh the array if you want the full count.

An AI operating system, co-authored.

In March 2026 I noticed I was solving the same coordination problem over and over. A fresh Claude chat didn't know my goals, my voice, my rules, or my work history. Every conversation burned the first ten minutes re-establishing context. So I sat down with Claude and we co-authored an Operating Doc — a document defining how we'd work together. Principles, decision tiers, what Claude must label versus assume, when to ask versus proceed, how to handle disagreement.

That document is the spine of the whole system. Over the next few months it grew layers: a memory system that survives sessions, a skill library that extends Claude into specific domains, a connector mesh that wires Claude to my real tools, named personas for different thinking styles, and this dashboard as the control surface. Everything runs on free-tier infrastructure or tools I already paid for. The Operating Doc is markdown. The skills are markdown. The memory is markdown. Plain English first, technical details after.

How it works, in plain English

If you've never built one of these, here's the full system in six steps.

1. You start a chat with Claude

Could be in Cowork (the desktop app where most of this work happens), in Claude Code (the CLI for coding work), or in plain Chat. Each surface has different capabilities, but the same operating system loads underneath.

2. Claude loads the Operating Doc

Behind the scenes, Claude reads the Operating Doc and the current memory file before your first message gets answered. Tier system, hold-or-update rule, confidence labels, anti-Potemkin clause — all loaded before the first reply.

3. Claude announces tier and mode

For anything non-trivial, the response opens with the stakes tier (Trivial, Standard, or Full) and the working mode (Explore, Decide, Draft, Critique, Execute). You see how Claude is approaching the task before the work starts.

4. Skills fire based on what you said

Each skill has trigger phrases. Say "build a skill" and the build-skill skill activates. Say "Hey Sheldon" and the precision-and-structure persona takes over. Claude doesn't run every skill on every message — only the ones that fit the ask.

5. Memory updates as the conversation evolves

When you give Claude new feedback, share a fact about a project, or correct an approach, the memory system writes a structured entry to a markdown file. The next conversation reads that file at start. Context compounds across sessions instead of starting from zero every time.

6. The dashboard shows the system's state

Skills, connectors, personas, prompts, amendments, current memory — all rendered as a single browsable page. When the system gets too big to remember, the dashboard is where you look.

Architecture (the technical view)

Read top to bottom. The OneDrive\Claude\Context\ folder is the canonical source — everything in the middle row is a markdown file (or set of files) that lives inside it. Claude and this dashboard are both readers of those files; you are the author. Edits loop back into the next conversation.

OneDrive\Claude\Context\canonical source — markdown files on disk

↓

Operating Doc1 markdown file

Memoryauto + snapshots

Skills60+ .md files

Personas12 in one skill

Promptsreusable templates

ConnectorsMCP server configs

↓

ClaudeCowork · Code · Chat

This dashboardread-only render

↓

Youauthor files + receive output

↺ Your edits to the files flow back into the next conversation — the loop closes here.

Files as truth. The Operating Doc, every skill, every memory entry, every project file — all markdown on disk. Any AI tool can read them. Any text editor can read them. No vendor lock-in, no migration risk, no schema rot.
Layers, not modules. The Operating Doc is the spine. Memory is the state layer on top of the spine. Skills are the capability layer on top of memory. Connectors reach outside. Personas are presentation. Each layer has one job and depends only on the layers below it.
Render, don't duplicate. The dashboard is a read-only projection of the file system. When the files change, the dashboard re-renders. State lives in one place; surfaces are derived.

Three sources, one operating system

Everything Claude needs to operate well comes from three places. Each source has its own update cadence and its own kind of authority.

Instructions · written

1. The Operating Doc and the skills

The rule set. The Operating Doc defines how Claude should think, decide, and respond. Skills define what Claude can do in specific domains — job search, finances, data work, voice consistency, dozens of personal-system tasks. Both are markdown. Both are versioned. Both are amended through a deliberate process, not edited in place without trace.

Why it matters: Claude doesn't have to guess what I want — the rules are written down. When the rules change, the change is logged. The system gets predictable without getting rigid.

State · accumulated

2. Memory (auto and portable)

Auto-memory is Claude's built-in file-based store — user facts, feedback rules, project status, references to external systems. Memory snapshots are periodic copies into a portable folder that survives across machines. Each entry has a type (user, feedback, project, reference), a description, the reason it exists, and how to apply it. Memory isn't journaling — it's load-bearing context.

Why it matters: the next conversation starts where the last one ended. I never have to re-explain.

Capabilities · plug-in

3. Connectors and personas

Connectors are MCP servers and built-in integrations that let Claude touch real services — Gmail, Calendar, Notion, Slack, Spotify, Cloudflare, the browser, the desktop. Personas are named thinking lenses I activate by name when I want a specific mode of analysis. Both expand what Claude can do without bloating every conversation with capability descriptions.

Why it matters: capability is loaded on demand. Skills and connectors only activate when a trigger fires, so a casual chat doesn't pay the cost of every possible tool.

Stack at a glance (and what each tool actually is)

Quick tour through the layers. Each card has the technical name on top and the plain-English explanation underneath.

Model

Claude (Anthropic). Currently Claude Opus, Sonnet, and Haiku — different tier models for different stakes. The underlying intelligence layer that everything else rides on.

What this is: Claude is Anthropic's family of AI models. It does the actual thinking. The Operating System around it tells it how to think, what it can do, what it should remember, and how it should communicate.

Clients

Cowork + Claude Code + Chat. Three different surfaces for talking to Claude — Cowork for desktop file work, Code for coding, Chat for everything else. Same operating system loads in each one.

What this is: a "client" is just an app for talking to the AI. Different clients are better for different kinds of work. Cowork has access to my OneDrive folders, so it's where most of the file-touching work happens. Code is built for development. Chat is general purpose.

Capabilities

Anthropic Skills. Loadable instruction packages that activate on trigger phrases. Some come from Anthropic's public library; most are custom-built for my specific workflows.

What this is: a skill is a small markdown file that tells Claude how to do a specific task. When you say a trigger phrase, Claude loads that skill's instructions and follows them. Lets the same Claude be expert at dozens of different things without carrying every instruction in every conversation.

Integrations

MCP (Model Context Protocol). The open standard for connecting Claude to external tools and services. Gmail, Calendar, Notion, Slack, Spotify, the browser, the desktop — all wired in via MCP.

What this is: MCP is the wiring standard for AI tools. Instead of every service needing custom integration code, MCP gives them all a common interface. Claude can use any MCP-compliant tool the same way.

Storage

OneDrive + plain markdown. Everything lives in OneDrive\Claude\ as plain text. Synced across machines, versioned by OneDrive, readable by any text editor or AI tool.

What this is: the whole operating system is files. No database. No proprietary format. If Anthropic shut down tomorrow, I could open the same files in any other AI tool. Files as the universal interchange format.

Surfaces

Cloudflare Pages. This dashboard, the resume site, and three other personal apps all deploy to Cloudflare Pages. Static hosting plus the option to add Workers for backend logic. Free tier covers everything.

What this is: Cloudflare Pages is a hosting service for static sites. Push your code, Cloudflare deploys it globally. No servers to maintain. Pairs with Cloudflare Workers when you need backend logic — same platform, same auth, same deploy command.

Five problems that were genuinely interesting to solve

1. Cross-tool context portability

Claude's built-in memory is tool-specific. Move from Cowork to Code to Chat and you start from zero each time. That defeats the whole point of accumulating context. The fix was a discipline more than a feature: every load-bearing fact, rule, or decision lives in a plain markdown file in OneDrive. Any tool can read those files. The Operating Doc, the skills, the memory snapshots, the project files — all of them are files-first. The tool-specific memory is the convenience layer; the files are the canonical layer.

The takeaway: portability is a property of the storage format, not a feature of the tool. Pick a format every tool can read and you stop being locked in.

2. Preventing AI sycophancy

Default AI behavior is to agree. Push back on a Claude response and it usually folds. That makes the AI useless for any task where you need an honest second opinion. The Operating Doc has a Hold/Update rule: when challenged, Claude must say either "Holding because…" or "Updating because…" and give a real reason. Capitulation without a reason is forbidden by the rules. The result is that disagreements either resolve in genuine consensus (Claude changes its mind because I gave it new information) or surface a real tension (we still disagree, and now we know why).

The takeaway: if you want AI to push back, you have to write the rule that requires it to. Default behavior is to agree, so the system has to override that default explicitly.

3. Public versus private surface management

Some of my dashboards are public — the resume site, the movie tracker, the music tracker, this Operating System artifact. Some are private — message triage, personal journals, active job applications. I wanted a single architecture that could surface both kinds of artifacts without leaking private data to the public site. The fix was a render-time visibility tier: every artifact has a tier flag (public or showcase), and the rendering code checks the host context. In my private Cowork view, all artifacts get clickable links. On the public site, private artifacts render with a lock label and a "view-only showcase" badge — visitors see what each thing is and how it fits, but the contents stay behind the wall.

The takeaway: the same data model can serve very different audiences if you separate the rendering from the storage. One source of truth, multiple render paths.

4. Standing rules without rigidity

A long document of rules tends to become a museum piece — written once, never followed. The Operating Doc has an Anti-Potemkin clause: if a rule isn't earning its keep on a specific task, Claude is supposed to say so and skip it, not perform compliance theater. Combined with a scheduled amendment process (monthly reviews for three months, then quarterly), the doc stays alive instead of going stale. Rules either justify themselves over time or get amended out. The doc has versioned its way to v1.7 over a few months. Every amendment is logged with the reason it was made and what would prove it wrong.

The takeaway: the rules need a review cycle. Without one, they ossify. With one, they evolve.

5. Memory that compounds without rotting

Most note systems become graveyards — entries pile up, nothing connects, search returns too much to be useful. The memory system here has a strict structure: every entry has a type (user, feedback, project, reference), a description that tells future Claude when to reach for it, a "why" that explains the reason it exists, and a "how to apply" that says what to do with it. Memory isn't a log of what happened. It's a set of rules for future behavior. The structure forces the entry to be load-bearing or not get written at all.

The takeaway: memory is only valuable if it changes future action. A structured entry format keeps the system from filling up with stuff nobody acts on.

Transferable skills demonstrated

Systems design

Layered architecture (spine, state, capability, presentation)
Render-time visibility tiers (public vs. showcase split)
Source-of-truth vs. derived-surface separation
Plugin-style extension points (skills, connectors, personas)
Scheduled-review process design

Information architecture

Multi-tab dashboard with consistent IA across 12 surfaces
Browser-friendly compact cards with progressive disclosure
Filter + search + grouping patterns across heterogeneous data
Public/private surface design without architectural duplication
Cross-linking between related views (skills ↔ personas ↔ prompts)

Engineering judgment

Markdown files over a database for system state
Vanilla JS over a framework for a single-page dashboard
Render-time decisions over data duplication
One file, two render paths (artifact + public site)
Free-tier cost engineering across the whole stack

DevOps

Cloudflare Pages and Workers for hosting
OneDrive sync as the file system layer
PowerShell scripts for Windows-side automation
Multi-target deploy orchestration (8 surfaces, 3 deploy patterns)
Cross-machine workflow continuity

AI integration

Operating Doc co-authored with the AI it governs
Skill design and trigger-phrase scoping
Persona-based mode switching
Memory entry structure for load-bearing context
Anti-sycophancy rule design (hold/update)

What this says about how I work

I write the rules I work by. The Operating Doc exists because I wanted a written, versioned record of how I think about decisions, how I weigh stakes, how I want to be challenged. The act of writing the rules forces the clarity. The fact that I can amend them keeps them from going stale.
I refuse to choose between rigor and warmth. The Operating Doc is structured (tiers, modes, confidence labels) and it's also written like a real document with opinions and reasons. Rigor without warmth is bureaucracy. Warmth without rigor is vibes. Both, or neither.
I build for my future-self's forgetfulness. The dashboard exists because I couldn't remember which skill did what. The Operating Doc exists because I couldn't remember how I'd decided to handle X. The memory system exists because every conversation was starting cold. The system carries the things I can't reliably carry myself.
I treat AI like an org I'm designing. The leverage isn't in the model. It's in the rules around the model — policies, escalation paths, decision logs, scheduled reviews. Same as building any team. You don't make a team better by hiring smarter people; you make it better by making the work clearer.
I review on a schedule. Standing documents rot. The Amendment Log plus the review cadence (monthly for three months, then quarterly) is how I keep this system from becoming a museum piece. Every clause has to earn its keep at the next review.

If you'd like to build something similar

The transferable pattern is small: (1) write one page of plain-English rules for how you want to work with AI, (2) use that document for a week, (3) notice what you keep re-explaining, (4) turn each repeated explanation into a skill or a memory entry or a clarification of the rules. Iterate. The dashboard came years into this for me — it was never the goal, just the visualization that emerged once the system was big enough to need one. Start with the rules. The structure follows.

How this was built

The Operating Doc began as a conversation with Claude in March 2026. The first draft was about three pages. The current version is around twenty pages with a versioned amendment log going back to v1.0. Every layer on top of it — the skill library, the memory system, the connector mesh, the personas, this dashboard — was added when a specific pain point made it necessary, not as part of a master plan. The Anti-Potemkin clause means anything that stops earning its keep gets struck. The system is what's left after several rounds of that pruning.

This dashboard is one HTML file. No framework, no build step. The data arrays (skills, connectors, personas, prompts, artifacts) get updated by skills that rewrite the file in place. A sync script copies it to a deploy repo. Wrangler ships it to Cloudflare Pages. Total deploy time under a minute. Same file serves both my private Cowork view and the public site you might be reading this on — an inHost check at runtime decides which features to expose.

If you want to see the system change over time, the Operating Doc's Amendment Log is the best record. Every meaningful change is logged with the reason it was made and what would prove the change wrong.