Sony Theakanath - Using AI Constitutions to Drive Org Alignment from Months to Days

Using AI Constitutions to Drive Org Alignment from Months to Days

February 2026

Cross-org alignment at large companies takes too long. At Meta, I watched two teams spend months negotiating the rollout of a single AI model. These weren't small teams. Each org had thousands of people, its own leadership structure, its own roadmap, and its own goals it was being held to. In practice, they operated more like two separate companies that happened to share a parent. Getting alignment between them wasn't a matter of walking to the desk next to you. It was a negotiation between entities operating at massive scale, each with its own incentives, each accountable to different parts of the business.

One team was responsible for growing engagement. The other was responsible for reducing harm. Both were full of capable people doing their jobs well. The same question came up in meeting after meeting. The same context got re-established every time. Launches sat blocked for weeks while both sides tried to figure out whether they were even disagreeing about the same thing. They escalated. They ran more experiments to get more data. They escalated again.

The problem was that neither team had a complete picture of what the other actually cared about. The engagement team didn't know that a significant portion of their core metric was driven by content the harm-reduction team was actively trying to suppress. The harm-reduction team didn't know that their enforcement launches were directly undermining the engagement team's numbers. That information existed. It just lived in people's heads, surfacing slowly over time in meetings, often only when something had already gone wrong.

The idea behind Hive Mind is to make that context explicit before anyone enters a room. Each org writes down its operating principles in a structured format. The system ingests all of it, along with every meeting transcript, goal doc, and launch review between the two orgs, and produces a brief that surfaces the tensions, the unresolved questions, and the decisions that need to be made. The meeting starts at the actual decision, not at the beginning of the context.

Constitutions

The most important part of the system is the Constitution. Not a mission statement. Not a set of values. A precise, machine-readable document that captures what the org is actually optimizing for. The field that matters most is core principles: the fundamental beliefs that drive how the team makes decisions. These are the things that rarely get said out loud but govern everything. When you write them down, the conflicts between orgs become visible immediately.

The examples below are illustrative. The structure reflects how we built this; the values and metrics are placeholders.

// example: team-growth.json
{
  "org": "Team Growth",
  "core_principles": [
    "Our job is to build the best app for [demographic] — that means
     a product they love and one that is good for them",
    "Engagement is a signal of product quality, not an end in itself",
    "Ranking autonomy is essential to moving fast on product quality"
  ],
  "goals": [
    "Grow [primary engagement metric] X% QoQ",
    "Increase [target demographic] DAU Y% by end of H1"
  ],
  "constraints": [
    "No single launch regresses sessions more than Z%",
    "Ranking changes require data science sign-off on holdout results"
  ],
  "acceptable_tradeoffs": [
    "Will accept engagement regression if safety gain ratio exceeds A:1",
    "[Demographic] metric regressions weighted Bx in tradeoff calculations"
  ],
  "non_negotiables": [
    "No launches that increase [demographic] exposure to [harm category]"
  ]
}

// example: team-safety.json
{
  "org": "Team Safety",
  "core_principles": [
    "Our job is to make the platform safe for [demographic] — a great
     product that causes harm is not a great product",
    "Prevalence reduction is the primary measure of whether we are
     doing our job",
    "Enforcement precision must be maintained or the system loses
     credibility with the people it is meant to protect"
  ],
  "goals": [
    "Reduce harmful content prevalence X% by end of H1",
    "Maintain enforcement precision above Y%"
  ],
  "constraints": [
    "No launches that regress classifier precision below Z%",
    "All enforcement changes require policy review"
  ],
  "acceptable_tradeoffs": [
    "Will accept precision regression if prevalence gain exceeds A:1",
    "[Demographic] metric regressions weighted Cx in tradeoff calculations"
  ],
  "non_negotiables": [
    "No launches that increase [demographic] exposure to [harm category]
     under any tradeoff ratio"
  ]
}

Read the core principles of both orgs back to back and the conflict is already visible. Both teams want to build a great product for the same demographic. They are not adversaries. But one team measures "great" primarily through engagement signals, and the other measures it through harm reduction. Under the same measurement system, those definitions produce directly competing launch decisions. Neither team had ever written this down for the other.

Also visible: Team Growth and Team Safety weight the same demographic metric differently. A launch that Team Growth calculates as acceptable will often fail Team Safety's bar. This asymmetry had never been explicitly negotiated. It was just how each team had been operating, invisible until both documents were read side by side.

Hive Mind

The constitutions are the foundation. Hive Mind is what makes them useful at scale. The tool ingests every document between two orgs: the constitutions, meeting transcripts, goal docs, launch reviews, escalation threads. Everything goes into a watched folder. When new files appear, the system re-runs analysis automatically.

A critical property of the system is that it operates independently of both parties. Neither team sees the other's constitution or documents directly. Hive Mind reads both sides separately, builds a picture of what each org actually cares about, and surfaces the tensions between them before anyone sits down together. This matters because people in alignment meetings are not neutral. They come in with positions. They are selective about what context they share and when. A system that sits outside both orgs and reads everything they have produced, without an agenda, finds things that neither side would have volunteered.

The key design decision was to use raw transcripts rather than summaries. Summaries lose the specific phrasing that reveals assumptions. The sentence "the tradeoff guidance doesn't apply to growth-primary launches" means something very different from "the tradeoff guidance applies to all launches." Both might get summarized as "teams discussed tradeoff framework scope." The original language is what the analysis needs.

Given the two constitutions above and a set of meeting transcripts, here is the kind of output Hive Mind produces:

// example: alignment-brief-output.json (excerpt) { "alignment_score": 42, "tensions": [ { "severity": "critical", "title": "Growth metric rewards the content Safety is suppressing", "detail": "A significant portion of [primary engagement metric] comes from content that violates safety policies. Every ranking improvement Team Growth ships amplifies the content Team Safety is trying to reduce. Neither team's H1 goal can be fully achieved under the current metric definition.", "evidence": "[meeting transcript], team-growth.json" }, { "severity": "high", "title": "[Demographic] metric multipliers are misaligned", "detail": "Team Growth weights [demographic] regressions at Bx. Team Safety weights them at Cx. A launch Team Growth approves will often fail Team Safety's bar. This asymmetry has never been explicitly negotiated. NOTE: Team Safety rejected a launch last month citing this weighting — constitution updated to reflect observed behavior.", "evidence": "team-growth.json, team-safety.json, [meeting transcript]" }, { "severity": "high", "title": "Tradeoff framework scope is disputed", "detail": "Team Growth believes the tradeoff bar applies only to safety-primary launches. Team Safety believes it applies to all launches. Multiple pending launches are blocked on this question.", "evidence": "[meeting transcript 1], [meeting transcript 2]" } ], "questions_requiring_decisions": [ "Does the tradeoff framework apply to growth-primary launches?", "Should [demographic] multipliers be aligned across both orgs?", "Who is DRI for launches in the gray-zone tradeoff range?" ] }

This brief goes out before the meeting. Everyone reads the same document. The meeting opens with the first question on that list. The first question, which had been open for months, got answered in twenty minutes. No one spent time re-establishing context. No one relitigated what had been said in previous meetings. The critical tension was in the text from day one. It just needed something to read all the documents at once.

Before Hive Mind: months. After: two days to a decision-ready meeting.

Living Constitutions

One of the more important design decisions is that constitutions are not filled out once and left alone. The system updates them continuously as decisions get made. When a team accepts or rejects a tradeoff in a meeting, when a launch gets blocked or approved, when an escalation resolves a gray-zone case, Hive Mind reads those outcomes and refines the constitution to reflect what the team actually did, not just what they said they believed. The tension output above shows this in practice: Team Safety's rejection of a launch last month was read as a signal, and the constitution was updated accordingly. Over time, the constitution becomes a living record of the org's real decision-making behavior, not an aspirational document that drifts from reality.

What comes next

We're at V1. The current version handles the core workflow: ingest, analyze, produce a brief. What we're building toward is continuous monitoring, where the system watches for new documents and surfaces emerging tensions before they become months-long negotiations.

Further out, the goal is for the system to go beyond surfacing conflicts and start pushing changes directly, driving teams toward alignment without requiring a human in the loop for every decision. The constitutions tell the system what each org believes. The meeting history tells it how each org actually behaves. At some point, those two things are enough to act on.