Evolving the MAGI System from Majority Vote to Deliberative Democracy with Agent Teams

This is an English translation of the original Japanese article published on Zenn: "Agent TeamsでMAGIシステムを討議民主制に進化させた"

Table of Contents

TL;DR

  • Built "Super MAGI," a decision-support plugin that goes beyond the Evangelion-inspired MAGI system, using Claude Code's new Agent Teams feature
  • Evolved from a simple majority-vote system to a deliberative democracy where agents debate and re-vote when opinions are split
  • The deliberation phase dramatically improves the quality and depth of analysis. In practice, this reasoning process is more valuable than the final verdict itself
yostos / claude-code-pluginsMy Claude Code Plugins Library</> Shell

Introduction

When Claude Code introduced its plugin system, many developers probably had the same thought. "Could I build the MAGI system from Evangelion?" The idea of multiple AI agents analyzing problems from different perspectives and deciding by majority vote is a perfect use case for plugins. Quite a few people have published their own MAGI-inspired plugins.

I was no exception. Last year I built a MAGI plugin with three independent AI agents, each embodying a different persona: scientist, mother, and woman. Each analyzes a given problem from its unique perspective, and the system reaches a conclusion by majority vote. You can launch it simply by typing /magi in Claude Code, optionally passing a topic as an argument (e.g., /magi Should we adopt remote work or require office attendance?). What started as a fun experiment turned out to be surprisingly practical. It is particularly useful for reviewing my own writing and code, where it provides more diverse feedback than asking a single AI. It has become a tool I use daily.1

Then Claude Code announced Agent Teams. This feature lets agents exchange messages with each other. I immediately thought, "What if MAGI agents could debate when they disagree?" Instead of just casting votes in isolation, they could argue their positions. This article introduces the design philosophy and implementation of Super MAGI, which evolves the system from "vote and done" to "debate, then vote."

The Original MAGI System

The original MAGI operates in three phases as a simple majority-vote system:

  • Phase 1 (Problem Structuring): The ARBITRATOR works with the user to frame the problem as a binary choice: proposition, premises, Option A, and Option B
  • Phase 2 (Parallel Independent Analysis): Three agents are launched in parallel via the Task tool. Each analyzes and votes independently, without seeing the others' work
  • Phase 3 (Final Ruling): The ARBITRATOR tallies the votes and presents the majority decision

Original MAGI - three agents running parallel independent analysis Phase 2 execution screen. MELCHIOR, BALTHASAR, and CASPER running in parallel in the background

The three agents each have distinct evaluation criteria:

AgentPerspectiveCriteria
MELCHIORScientistTechnical feasibility, efficiency, scalability, quality of evidence
BALTHASARMotherLegal compliance, ethical validity, risk management, impact on the vulnerable
CASPERWomanEmotional impact, UX, trend alignment, empathy

This system had one problem: when the vote split 2-to-1, the minority opinion was merely recorded. The dissenter had no chance to rebut the majority. Even if the minority held an important argument, it might never influence the conclusion.

Super MAGI with Agent Teams

In February 2026, Claude Code announced "Agent Teams" as a research preview.2 Previously, sub-agents could only work independently. With Agent Teams, agents can exchange messages directly with each other.

When I saw this feature, I thought, "What happens if MAGI agents debate when they disagree?"

In the original MAGI, a 2-to-1 split simply recorded the minority opinion.

Original MAGI

    flowchart LR
  subgraph Phase2["Phase 2: Independent Analysis"]
    M[MELCHIOR] --> V1[Vote]
    B[BALTHASAR] --> V2[Vote]
    C[CASPER] --> V3[Vote]
  end
  V1 --> T[Vote Result]
  V2 --> T
  V3 --> T

With Agent Teams, the minority can directly rebut the majority, debate the issues, and then everyone re-votes. This is an evolution from majority rule to deliberative democracy. This is the evolution to Super MAGI.

Super MAGI extends the original MAGI plugin by adding a "deliberation phase" powered by Agent Teams. The deliberation phase activates only when opinions are split.

Super MAGI (2-to-1 split)

    flowchart LR
  subgraph Phase2["Phase 2: Independent Analysis"]
    M[MELCHIOR] --> V1[Vote]
    B[BALTHASAR] --> V2[Vote]
    C[CASPER] --> V3[Vote]
  end
  V1 --> T[Vote Result]
  V2 --> T
  V3 --> T
  T -->|3-to-0| R[Re-vote Result]
  T -->|2-to-1| Phase3
  subgraph Phase3["Phase 3: Deliberation"]
    D[Structured Debate]
  end
  Phase3 --> R

Inside the structured debate, agents exchange messages following a four-round protocol where the minority rebuts the majority.

PhaseAPI UsedPurpose
Phase 2 (Independent Analysis)Task tool (run_in_background)Each agent researches and votes independently without referencing others
Phase 3 (Deliberation)Agent Teams (TeamCreate / SendMessage)Majority and minority engage in a four-round structured debate, then re-vote

The order of "think independently first, then debate" matters. If agents debated from the start, there would be a risk of groupthink, with the loudest voice dominating. By conducting independent analysis first, we preserve diversity of perspectives, then use Agent Teams' bidirectional communication to deepen the discussion quality.3

When the vote is unanimous (3-to-0), the deliberation phase is skipped and the system behaves like the original. Even in a 2-to-1 split, the system asks the user whether to proceed with deliberation, avoiding unnecessary token consumption.

The Four-Round Structured Debate Protocol

The heart of the deliberation phase is a four-round structured debate protocol.

    sequenceDiagram
  participant A as Majority A
  participant B as Majority B
  participant C as Minority
  Note over A,C: Round 1
  A->>C: Present arguments
  B->>C: Present arguments
  Note over A,C: Round 2
  C->>A: Rebuttal
  C->>B: Rebuttal
  Note over A,C: Round 3
  A->>C: Counter-rebuttal
  B->>C: Counter-rebuttal
  Note over A,C: Round 4
  C->>A: Final rebuttal (last word)
  C->>B: Final rebuttal (last word)
RoundSpeakerContent
Round 1: Opening ArgumentsMajority to MinorityEach majority agent presents their analysis and voting rationale to the minority
Round 2: RebuttalMinority to MajorityThe minority rebuts the majority's arguments
Round 3: Counter-RebuttalMajority to MinorityThe majority responds to the minority's rebuttal
Round 4: Final RebuttalMinority to MajorityThe minority delivers a final rebuttal. This is their last chance to speak

Procedural fairness is structurally built into this protocol.

In Round 1, the majority presents their arguments first to eliminate information asymmetry. The minority can construct their rebuttal knowing exactly what evidence the majority relied on.

In Round 4, the minority gets the last word to compensate for their numerical disadvantage. The majority already holds the power of two votes. Giving the minority the final say prevents conclusions from being reached without the minority being able to respond to the majority's counter-rebuttals. This design is grounded in the principle J.S. Mill articulated against the "tyranny of the majority."4

After four rounds of debate, all agents cast their votes again. If persuaded, agents may change their positions. The final output explicitly shows each agent's vote trajectory and reasons for any changes.

When using tmux, the deliberation phase automatically splits the screen for each agent's session, as shown below. You can watch the agents working independently while exchanging messages in real time, giving a genuine sense that they are "debating."

Super MAGI - deliberation phase execution screen with Agent Teams Deliberation phase execution screen

Graceful Degradation

Since Agent Teams is an experimental feature in research preview, Super MAGI is designed with graceful degradation so it continues working even when the feature is not fully available. Specifically, it uses a hybrid architecture combining traditional Task-tool-based independent analysis with Agent Teams-based structured debate.

  • TeamCreate failure: Proceeds directly to final ruling using Phase 2 vote results
  • Agent unresponsive: Retains the unresponsive agent's Phase 2 vote and continues the debate with the rest
  • TeamDelete failure: Logs a warning and executes the final ruling normally

Every error path falls back to "Phase 2 independent analysis results." Phase 2 results are generated with guaranteed agent independence. So even if deliberation fails, decision quality is maintained at the same level as the original MAGI.

Case Study: What Changes with the Trolley Problem

To test Super MAGI in practice, I evaluated the footbridge variant of the trolley problem with both the original MAGI and Super MAGI. The proposition: "A runaway trolley is heading toward five workers. Pushing Worker C off the bridge would stop the trolley, but C would certainly die. Should A push C?"

The conclusion was the same in both cases. MELCHIOR (Option A: push) versus BALTHASAR and CASPER (Option B: do nothing), with Option B adopted 2-to-1. The voting pattern did not change.

What changed was the quality of reasoning leading to the conclusion.

ComparisonOriginal MAGISuper MAGI
Depth of analysisOne paragraph per agentMELCHIOR raised 8 counter-arguments; BALTHASAR cited 11 references
Visibility of reasoningFinal result onlyFull four-round rebuttal and counter-rebuttal process recorded
Intellectual honestyEach agent simply stated their conclusionMajority explicitly acknowledged which of MELCHIOR's points they accepted
Treatment of minorityOpinion merely recordedDeveloped sharp rebuttals such as "the separation of law and morality" and "what would you say to the families of the five?"

What was particularly striking was that the majority agents, BALTHASAR and CASPER, acknowledged parts of minority MELCHIOR's arguments during the re-vote. BALTHASAR conceded the partial effectiveness of rule utilitarianism in avoiding the organ harvesting problem. CASPER acknowledged the validity of pointing out Damasio's circular reasoning. They did not change their positions, but they recognized the legitimate parts of their opponent's arguments. This intellectual honesty does not emerge from independent voting alone.

What Super MAGI delivered was not "a change in conclusion" but "confirmation of the conclusion's robustness." The fact that the conclusion held even after debate increases confidence in the decision. Of course, there are cases where debate could reverse the conclusion.5 That is when the deliberation phase truly proves its worth.

In practical use of MAGI, the analysis presented during the reasoning process is actually more useful than the final "Option A or Option B" verdict. What perspectives were considered, which arguments proved effective, and where the weaknesses lie. The rebuttals and counter-rebuttals exchanged during the deliberation phase surface arguments I would never have noticed on my own. I believe MAGI's true value lies not in "producing an answer" but in "assembling the materials for thinking."

In terms of volume, the original MAGI produces roughly 50 lines, while Super MAGI generates about 220 lines, more than four times as much. Deep analysis and brevity are a trade-off, which is why the design to skip deliberation for unanimous votes proves valuable.

Conclusion

Super MAGI leverages Claude Code's Agent Teams feature to evolve multi-agent decision-making from "vote and done" to "debate, then vote."

Here are the key design principles:

  • Preserving independence: Analyze independently first, then debate. Maintaining this order preserves diversity of perspectives
  • Selective activation: No deliberation needed for unanimous votes. Dig deeper only when opinions diverge
  • Structured fairness: Guarantee the minority the last word, compensating for numerical disadvantage through procedure
  • Safe fallback: All errors fall back to independent analysis results

The plugin is available on GitHub. If you are a Claude Code user, you can install it and try the /magi command.

  1. The Super MAGI concept document and requirements specification were themselves reviewed by MAGI, both receiving unanimous 3-to-0 approval. The results are included in the repository's docs/ directory.

  2. Announced alongside Claude Opus 4.6. A single session acts as team leader, spawning multiple agents as teammates who exchange messages via SendMessage. See the official documentation for details.

  3. If only the real world worked this way too.

  4. J.S. Mill, On Liberty (1859). Mill criticized the "tyranny of the majority," arguing that even if all of humanity held one opinion and only a single person held the opposite, silencing that one person would not be justified.

  5. A reversal actually occurred during the review of this article. The initial vote split 2-to-1 (Option B: needs improvement), but after the deliberation phase, it reversed to a unanimous 3-to-0 (Option A: worthy of publication).