Idea 06

Agora

Multi-agent debate system with org-chart personas and Jury Learning philosophy

Multi-Agent Simulation Decision Making

Core Problem


Solution — Model × Role Matrix

Each cell represents one debate participant. Assigning the same role on different models reveals model bias.

            | Claude | Gemini | Codex | GPT |
------------|--------|--------|-------|-----|
CTO         |   ✓    |        |       |     |
Junior Dev  |        |   ✓    |       |     |
PM          |        |        |  ✓    |     |
Designer    |   ✓    |        |       |     |
Executive   |        |   ✓    |       |     |

Key Features

1. Moderator-Secretary Separation

The Moderator manages debate flow, while the Secretary records proceedings in real-time with minority opinion highlights. Separating these roles ensures that minority viewpoints are explicitly captured rather than silently merged into consensus.

2. Jury Learning (Stanford CHI 2022)

3. 3-Tier Graceful Degradation

  1. Model fallback: Gemini Pro rate limit → Flash
  2. CLI fallback: Codex 402 → skip panelist
  3. Inline persona fallback: all external CLIs fail → Claude simulates personas directly
Principle: "Always produce a result with the best available resources"

User Scenarios

Feature Launch Decision

CTO says ready, Junior Dev worries about 70% test coverage, PM pushes for competitive timing. The final report includes majority opinion alongside a Dissenting Report preserving the Junior Dev's coverage concern.

Tech Stack Selection

The same role (Senior Dev) is assigned to both Claude and Gemini. Comparing their arguments reveals where model bias diverges from genuine technical reasoning.

Graceful Degradation in Action

5 panelists configured. Codex returns 402, Gemini hits a rate limit. The system auto-falls back — Gemini Pro → Flash, Codex panelist skipped — and the debate continues with 4 participants, producing a complete report.


Differentiation vs Existing Approaches

Existing MAD / LLM Council Agora
Goal Convergence toward "correct answer" (math, QA) Decision support for judgment calls (no single right answer)
Aggregation Majority voting / consensus Jury Learning — structurally surfaces minority opinions