TAG ARCHIVE
agentic-company
62 MARIA OS blog articles tagged agentic-company. Research on human-agent organizations, delegation boundaries, role topology, and governed autonomy. This canonical topic archive supports search engines and LLM retrieval.
Judgment OS / Decision Intelligence OS
Core MARIA OS research on turning organizational judgment into executable decision systems.
Agentic Company Architecture
Research on human-agent organizations, delegation boundaries, role topology, and governed autonomy.
Responsibility Gates and AI Governance
Safety, accountability, fail-closed gates, auditability, and human-in-the-loop control for AI agents.
Multi-Agent Mathematics
Formal models for convergence, stability, game theory, graph dynamics, and multi-agent evaluation.
Evidence, RAG, and Knowledge Governance
Evidence bundles, retrieval architecture, Graph RAG, knowledge trust, and auditable reasoning pipelines.
Agentic R&D and Judgment Science
Research operations, simulation labs, judgment science, recursive improvement, and experimental AI governance.
創業者の頭の中を、外に見える階段へ変える
高い抽象度の思想を、エンタープライズ顧客、技術リード、投資家、採用候補者が登れる中間言語へ翻訳するためのMARIA OSブリッジ論
MARIA OS、Decision OS、CEO Clone、Agent Company、harness、envelope、reflexといった概念は、単体では凄そうに見えるが、聞き手によっては理解の足場を失いやすい。本稿は、創業者の頭の中にある抽象階層を下げるのではなく、原理、身体的アナロジー、具体例、実装証跡の階段として外部化する方法を整理する。目的は、思想を薄めずに、顧客、CTO、投資家、エンジニア候補がそれぞれ入れる入口を作ることである。
Dynamic Harness and Phase-Space Control: From virtual-talent to MARIA OS
Reframing runtime episodes, failure taxonomies, dynamic scorecards, repair proposals, and controlled self-healing as phase control for agentic society
The central question for agentic systems is shifting from model intelligence to runtime phase control. This article defines the Dynamic Harness as a Runtime Governance Layer that observes, evaluates, and controls the phase space of an agent runtime, connecting MARIA OS research with implementation lessons from bonginkan/virtual-talent.
動的ハーネスと位相空間制御:virtual-talentからMARIA OSへ
runtime episode、failure taxonomy、dynamic scorecard、repair proposal、controlled self-healingを、Agentic Society Runtimeの位相制御として再定義する
AI Agentの時代における本質的な問いは、モデルがどれほど賢いかではなく、知能がどの位相に入り、どの位相から戻れなくなるかである。本稿は、bonginkan/virtual-talentのProducer AIで進むDynamic Harness実装を踏まえ、MARIA OSにおけるハーネスをRuntime Governance Layer、さらにAgent runtimeの位相空間を制御する層として定義する。runtime episode、failure taxonomy、dynamic scorecard、repair proposal、controlled self-healingを軸に、静的テストから動的制御へ移行する設計原理を整理し、企業OSとAgentic Societyへ拡張する研究課題を示す。
From AI Office to Agent HR OS: The Operating Stack for Human + AI Organizations
Why AI Office, AI Office Building, and Agent HR OS should be understood as one connected system for operating AI employees, not just using AI tools
Enterprise AI is moving from isolated assistants to managed AI labor. This article explains how AI Office provides the workplace layer, AI Office Building provides organizational topology, and Agent HR OS provides the HR and governance layer for recruiting, evaluating, promoting, and operating AI employees inside a Human + AI Organization.
AI OfficeからAgent HR OSへ: Human + AI Organizationを運営する新しいOS
AI Office、AI Office Building、Agent HR OSを、AIツール群ではなくAI社員を運営する一つのスタックとして捉え直す
企業AIは、孤立した補助ツールから管理されたAI労働へ進みつつある。本稿は、AI Officeが仕事場を、AI Office Buildingが組織トポロジーを、Agent HR OSが採用・評価・昇進・統治の人事レイヤーを担うという全体像を整理し、Human + AI Organization の運営スタックとして解説する。
How Agent Office Replaces White-Collar Execution: Workflow Transfer, Organizational Redesign, and a Staged Change Roadmap
Why the real shift is not job-title extinction but the transfer of drafting, coordination, reporting, and repeatable execution into an agent operating layer
Agent Office does not first replace white-collar employees as a category. It first replaces the hidden execution layer inside white-collar work: drafting, routing, follow-up, reconciliation, reporting, and first-pass judgment. This article uses current evidence from OpenAI, OECD, ILO, Anthropic, WEF, and NIST to model which workflows move first, how fast the shift can happen, and what a practical change-management roadmap looks like.
Agent Officeはホワイトカラーをどう置き換えるのか: 実行レイヤー移管、組織再設計、段階的ロードマップ
職種の消滅ではなく、下書き、調整、報告、追跡、一次判断の実行層がAgent Officeへ移る。公開研究をもとに、その順序と変化管理を整理する
Agent Officeが先に置き換えるのは、ホワイトカラーの人材そのものではなく、白領業務の内部にある実行レイヤーです。OpenAI、OECD、ILO、Anthropic、WEF、NISTの示唆をもとに、どのワークフローが先に移り、組織がどう段階的に変わるのかを、日本語で整理した実務向けブログ記事です。
Command-less AI Architecture: Goal-Driven Agents That Generate Their Own Tools Without Pre-Defined Commands
Eliminating the command registry in favor of goal decomposition, plan generation, and dynamic tool synthesis
Traditional agent architectures bind agents to pre-defined command sets — fixed APIs, registered tools, and enumerated actions. This paper presents the MARIA OS command-less architecture, where agents receive goals rather than commands, decompose them into hierarchical plans, detect capability gaps, and synthesize whatever tools are needed for execution. We formalize the morphisms between Goal space G, Plan space P, and Tool space T, prove convergence of the tool space under recursive planning, and demonstrate that command-less agents achieve 3.2x higher task completion rates on novel problem classes compared to command-bound architectures.
コマンドレスAIアーキテクチャ — Goal駆動型Agentが事前定義なしに自律実行するOS設計
コマンドレジストリを排除し、Goal分解・Plan生成・動的Tool合成によるAgent自律実行を実現する
従来のAgentアーキテクチャは事前定義されたコマンドセットに束縛される。本論文はMARIA OSのコマンドレスアーキテクチャを提示する。AgentはコマンドではなくGoalを受け取り、階層的Planに分解し、能力ギャップを検出し、必要なToolを動的に合成して実行する。Goal空間G、Plan空間P、Tool空間T間の射を形式化し、再帰的計画のもとでTool空間が収束することを証明する。
Capability Gap Detection: The Metacognitive Layer That Enables Self-Extending Agents
How agents recognize what they cannot do and trigger autonomous self-extension through formal gap analysis
Self-extending agents require a prerequisite that most architectures ignore: the ability to know what they do not know. This paper formalizes capability gap detection as a metacognitive layer that compares required capabilities against the agent's capability model, classifies detected gaps, prioritizes them by urgency and impact, and decides whether to synthesize, request, delegate, or escalate. We introduce the capability coverage metric, gap entropy measure, and multi-agent gap negotiation protocol. Experimental results show that agents with formal gap detection achieve 4.1x fewer silent failures and 2.8x faster self-extension compared to agents relying on runtime error detection.
Capability Gap Detection — Agentが自分の能力不足を認識するメタ認知アーキテクチャ
形式的ギャップ分析を通じて、自分にできないことを認識し自律的な自己拡張をトリガーする方法
自己拡張型Agentには、ほとんどのアーキテクチャが無視する前提条件がある。自分に何ができないかを知る能力である。本論文はCapability Gap Detectionをメタ認知レイヤーとして形式化する。必要な能力をAgentの能力モデルと比較し、検出されたギャップを分類し、緊急度とインパクトで優先順位付けし、合成・要求・委任・エスカレーションの判断を下す。能力カバレッジメトリック、ギャップエントロピー測度、マルチAgent間ギャップ交渉プロトコルを導入する。
Self-Modifying Agent Systems: Architecture for Agents That Rewrite Their Own Tools, Commands, and Workflows
Beyond tool creation — a formal framework for bounded self-modification with stability guarantees and immutable audit trails
Agents that merely create new tools hit a ceiling. Real operational autonomy requires agents that can modify existing tools, rewrite commands, and restructure workflows based on performance feedback. We present a formal architecture for bounded self-modification with Lyapunov stability analysis, halting guarantees, and responsibility-gated audit trails.
自己書き換えAgentシステム — Tool・Command・Workflowを自律的に進化させるアーキテクチャ
ツール生成を超えて — 安定性保証と不変監査証跡を備えた有界自己修正の形式的フレームワーク
新しいツールを生成するだけのAgentには限界がある。真の運用自律性には、パフォーマンスフィードバックに基づいて既存のツール・コマンド・ワークフローを自ら書き換える能力が必要だ。本稿では、Lyapunov安定性解析・停止保証・責任ゲート付き監査証跡を備えた有界自己修正アーキテクチャSMASを提示する。
Agent Tool Compiler: From Natural Language Intent to Executable Tool Code via Compilation Pipeline
Agents as compilers — a formal framework mapping NL intent through intermediate representation to optimized, type-safe runtime tools
Tool-generating agents are ad-hoc code producers. We reframe tool synthesis as a compilation problem: natural language intent is parsed into an Intent AST, lowered to a Tool IR (intermediate representation), optimized through security hardening and dead code elimination passes, and emitted as type-safe executable code that hot-loads into the agent runtime. This paper presents the Agent Tool Compiler architecture with formal language theory foundations.
Agent Tool Compiler — 自然言語からAPI設計・コード生成・実行までのコンパイルパイプライン
コンパイラとしてのAgent — NL意図を中間表現を経由して最適化された型安全なランタイムツールに変換する形式的フレームワーク
ツール生成Agentはアドホックなコード生産者である。本稿ではツール合成をコンパイル問題として再定義する。自然言語意図をIntent AST(意図の抽象構文木)に解析し、Tool IR(中間表現)に変換し、セキュリティ強化・デッドコード除去などの最適化パスを適用し、型安全な実行可能コードとしてエージェントランタイムにホットロードする。形式言語理論に基づくAgent Tool Compilerアーキテクチャを提示する。
Self-Extending Agent Architecture: Capability Gap Detection, Tool Synthesis, and Autonomous Evolution Under Governance Constraints
Agents that recognize their own limitations and autonomously build the tools they need — within the safety boundaries of an operating system
Traditional AI agents are bounded by the tools humans provide. When an agent encounters a task outside its toolset, it halts and waits. This paper introduces the Self-Extending Agent Architecture (SEAA), where agents detect their own capability gaps, synthesize new tools through code generation, validate those tools in sandboxed environments, and register them into the OS runtime — all under human-governed safety constraints. We formalize the agent state model X_t = (C, T, M, R), derive the self-extension equation X_{t+1} = E_t ∘ G_t ∘ J_t(X_t), prove Capability Monotonicity under validation gates, and demonstrate the architecture within MARIA OS's hierarchical coordinate system.
自己拡張型Agentアーキテクチャ — 能力不足を自ら認識し、ツールを自律生成するOS設計
Agentが自身の限界を検知し、コード生成でツールを合成し、サンドボックスで検証し、OSランタイムに登録する — すべてガバナンス制約の下で
従来のAIエージェントは、人間が提供したツールセットに束縛される。未対応タスクに遭遇すると停止し、人間の介入を待つ。本論文では、Self-Extending Agent Architecture(SEAA)を提案する。エージェントが自律的に能力ギャップを検出し、構造化コード生成でツールを合成し、サンドボックス環境で検証し、OSランタイムに登録するフレームワークである。エージェント状態モデル X_t = (C, T, M, R) を形式化し、自己拡張方程式 X_{t+1} = E_t ∘ G_t ∘ J_t(X_t) を導出し、検証ゲート下での能力単調性定理を証明する。MARIA OSの階層座標系における具体的な実装を示す。
Agents That Write Their Own Tools: A 4-Phase Architecture for Tool Discovery, Synthesis, Validation, and Registration in Autonomous Systems
From static tool chains to self-extending capability — how MARIA OS agents create the tools they need at runtime
Normal agents wait for humans to build tools. MARIA OS agents create their own. This paper details the 4-phase tool lifecycle — Discovery, Synthesis, Validation, Registration — that enables agents to identify missing capabilities, generate tool implementations, verify correctness and safety in sandboxed environments, and hot-load new tools into the OS runtime. We formalize tool generation rate, quality convergence, and multi-agent tool sharing, and present a case study of an Audit agent creating an OCR extraction tool at runtime.
ツールを自ら書くAgent — Tool Discovery, Synthesis, Validation, Registrationの4フェーズ設計
静的ツールチェーンから自己拡張能力へ — MARIA OSのAgentが実行時に必要なツールを自ら生成する方法
通常のエージェントは人間がツールを作るのを待つ。MARIA OSのエージェントは自らツールを作る。本論文では、エージェントが不足能力を特定し、ツール実装を生成し、サンドボックス環境で正確性と安全性を検証し、OSランタイムに新ツールをホットロードする4フェーズアーキテクチャ — Discovery, Synthesis, Validation, Registration — を詳述する。ツール生成率、品質収束、マルチエージェントツール共有を形式化し、監査エージェントが実行時にOCR抽出ツールを生成したケーススタディを提示する。
Agent Capability OS: Command Registry, Tool Registry, and Capability Graph as the Three Pillars of Self-Extending Agent Architecture
Why individual agents cannot manage organizational capability — and how an OS-level abstraction solves the coordination problem
As agentic organizations scale beyond dozens of agents, managing capabilities becomes a systems-level challenge that no single agent can solve. This paper introduces the Agent Capability OS — an operating system abstraction that governs how capabilities are registered, discovered, allocated, and evolved across an agent population. We formalize three core registries (Command, Tool, Capability Graph) and prove that OS-level capability management achieves O(log N) discovery latency versus O(N^2) in decentralized approaches. A case study of a 54-agent audit office demonstrates how the Capability OS manages 200+ tools across 6 organizational floors while maintaining zero capability conflicts.
Agent Capability OS — Command Registry・Tool Registry・Capability Graphで能力を管理するOS設計
個々のエージェントでは組織的な能力管理ができない理由と、OSレベルの抽象化がもたらす解決策
エージェント組織が数十体規模に拡大すると、能力管理はシステムレベルの課題となり、単一エージェントでは解決できなくなる。本稿ではAgent Capability OS — エージェント集団全体の能力の登録・発見・割当・進化を統治するOS抽象化を提案する。3つの中核レジストリ(Command Registry、Tool Registry、Capability Graph)を形式化し、OSレベルの能力管理がO(log N)の発見遅延を実現することを証明する。54体エージェント監査事務所のケーススタディでは、6フロアにわたる200以上のツールを能力衝突ゼロで管理した実績を示す。
Tool Genesis Under Governance: How to Safely Turn Generated Code into New Commands
A formal framework for sandbox verification, permission escalation, audit trails, and rollback mechanisms that enable self-extending agent systems without sacrificing safety
When an AI agent generates code that could become a new command in a production system, every line of that code becomes an attack surface. Without governance gates between generation and registration, a self-extending agent is indistinguishable from a self-propagating vulnerability. This paper presents the MARIA OS Tool Genesis Framework: a 7-stage pipeline that transforms generated code into governed commands through sandbox verification, formal safety proofs, permission escalation models, immutable audit trails, and automatic rollback mechanisms. We formalize tool safety as a decidable property under bounded execution, derive permission escalation bounds using lattice theory, introduce the Tool Safety Index (TSI) as a composite metric, and demonstrate that governed tool genesis achieves 99.7% safety compliance with only 12% latency overhead compared to ungoverned registration. The central thesis: self-extension is not dangerous — ungoverned self-extension is.
ガバナンス下のツール生成:生成コードを安全にコマンド化する方法
サンドボックス検証、権限昇格モデル、監査証跡、ロールバック機構による自己拡張エージェントシステムの安全性フレームワーク
AIエージェントが生成したコードが本番システムの新しいコマンドになりうるとき、そのコードのすべての行が攻撃対象面となる。生成からレジストリ登録までの間にガバナンスゲートがなければ、自己拡張エージェントは自己増殖する脆弱性と区別がつかない。本論文はMARIA OSツール生成フレームワークを提示する:生成コードをガバナンス済みコマンドに変換する7段階パイプラインであり、サンドボックス検証、形式的安全性証明、束論に基づく権限昇格モデル、改ざん不可能な監査証跡、自動ロールバック機構を含む。有界実行の仮定のもとでツール安全性が多項式時間で決定可能であることを証明し、10,000件のツール生成イベントにわたるベンチマークで99.7%の安全性コンプライアンスを12%のレイテンシオーバーヘッドで達成することを示す。中心的命題:自己拡張は危険ではない。ガバナンスなき自己拡張が危険なのだ。
MARIA OS Evaluation Harness: A Standard Testing Infrastructure for Measuring Agent Quality
Formal test categories, composite scoring, and continuous evaluation pipelines that transform agent quality from subjective assessment into reproducible engineering measurement
Agent quality cannot be managed if it cannot be measured. Traditional software testing verifies deterministic input-output mappings, but AI agents operate in stochastic, multi-step decision spaces where correctness is contextual, safety is probabilistic, and governance compliance is structural. This paper introduces the MARIA OS Evaluation Harness — a standardized testing infrastructure that defines four test categories (correctness, safety, performance, governance compliance), four primary metrics (decision accuracy, gate compliance rate, evidence quality score, latency under load), and a formal composite scoring framework. We present the harness architecture comprising a test runner, scenario generator, oracle comparator, and regression detector, all scoped through MARIA coordinates for hierarchical test targeting. We prove that the composite agent score is monotonically responsive to genuine quality improvements and demonstrate that continuous evaluation pipelines catch 94.7% of quality regressions before production deployment.
MARIA OS 評価ハーネス:Agentの品質を測定するための標準テストインフラストラクチャ
形式的テストカテゴリ、複合スコアリング、継続的評価パイプラインによって、Agent品質を主観的評価から再現可能なエンジニアリング測定へ変革する
Agent品質は測定できなければ管理できない。従来のソフトウェアテストは決定論的な入出力マッピングを検証するが、AIエージェントは確率的かつ多段階の意思決定空間で動作し、正確さは文脈依存であり、安全性は確率的であり、ガバナンス準拠は構造的である。本論文はMARIA OS評価ハーネスを紹介する——4つのテストカテゴリ(正確性、安全性、パフォーマンス、ガバナンス準拠)、4つの主要メトリクス(意思決定精度、Gate準拠率、エビデンス品質スコア、負荷時レイテンシ)、そして形式的な複合スコアリングフレームワークを定義する標準化されたテストインフラストラクチャである。テストランナー、シナリオジェネレーター、オラクルコンパレーター、リグレッションディテクターで構成されるハーネスアーキテクチャを提示し、すべてのコンポーネントがMARIA座標系を通じてスコーピングされる。複合Agentスコアが真の品質改善に対して単調応答性を持つことを証明し、継続的評価パイプラインが本番デプロイ前に94.7%の品質回帰を検出することを実証する。
Governance Load Testing: Where Does Governance Break in the 1000-Agent Era?
Stress-testing decision pipelines, approval queues, gate evaluation, and conflict detection under extreme agent concurrency to identify governance breaking points and mitigation architectures
Governance architectures designed for 10-agent teams do not survive contact with 1000 concurrent agents. Decision pipeline throughput saturates, approval queues grow unbounded, gate evaluation latency exceeds SLA windows, and conflict detection explodes as O(n^2) pairwise comparisons overwhelm detection infrastructure. This paper presents a rigorous load-testing methodology for AI governance systems, identifies precise breaking points across the MARIA OS decision pipeline, models governance bottlenecks using formal queueing theory (M/M/c and M/G/1 models), and proposes mitigation strategies including hierarchical delegation, batch approval, predictive gating, and zone-scoped conflict partitioning. We report benchmark results at 10, 100, 1000, and 10000 agent scales, demonstrating that naive governance collapses at approximately 340 concurrent agents under default configuration, while the optimized architecture sustains governance integrity up to 12000 agents with sub-second gate latency.
ガバナンス負荷テスト:1000エージェント時代にガバナンスはどこで崩壊するか?
極限的なエージェント同時実行下における意思決定パイプライン、承認キュー、ゲート評価、競合検出のストレステストを通じたガバナンス崩壊点の特定と緩和アーキテクチャの提案
10エージェント向けに設計されたガバナンスアーキテクチャは、1000エージェントの同時実行に耐えられない。意思決定パイプラインのスループットは飽和し、承認キューは無限成長し、ゲート評価レイテンシはSLAを超過し、競合検出はO(n^2)のペアワイズ比較でインフラを圧倒する。本論文はAIガバナンスシステムの体系的な負荷テスト手法を提示し、MARIA OS意思決定パイプラインにおける正確な崩壊点を特定する。待ち行列理論(M/M/cおよびM/G/1モデル)によるガバナンスボトルネックのモデル化、4つの緩和戦略(階層的委譲、バッチ承認、予測的ゲーティング、ゾーンスコープ競合分割)の提案を行い、デフォルト構成での約340エージェントから最適化構成での12,000エージェントへのガバナンス容量拡張を実証する。10、100、1000、10000エージェントの4つのスケールポイントでのベンチマーク結果を報告する。
AI Office Operating Model: Design Principles for a Virtual Office Where 10 Teams Work as a Unified Organizational OS
Formalizing the virtual office as a graph-theoretic operating system with inter-team protocols, shared resource management, and graduated autonomy boundaries
This paper presents a comprehensive architecture for a virtual AI office where 10 specialized teams — Sales, Audit, Dev, HR, Legal, Finance, Strategy, Support, QA, and R&D — operate as a unified organizational OS. We formalize inter-team communication protocols as message-passing on a directed graph, define shared resource management through capacity allocation tensors, establish team autonomy boundaries via responsibility cones, and map the entire office to the MARIA coordinate system. The model introduces meeting scheduling agents, knowledge sharing infrastructure, team performance metrics, and conflict resolution mechanisms grounded in organizational graph theory. We prove that office-level governance and team-level autonomy can coexist under a hierarchical gate structure, achieving 89% autonomous operation while preserving 100% accountability traceability.
AIオフィス運用モデル:10チームが統合された組織OSとして機能するバーチャルオフィスの設計原則
チーム間プロトコル、共有リソース管理、段階的自律境界を備えたグラフ理論的オペレーティングシステムとしてのバーチャルオフィスの形式化
本論文は、10の専門チーム — Sales、Audit、Dev、HR、Legal、Finance、Strategy、Support、QA、R&D — が統合された組織OSとして運営されるバーチャルAIオフィスの包括的アーキテクチャを提示する。チーム間通信プロトコルを有向グラフ上のメッセージパッシングとして形式化し、容量配分テンソルによる共有リソース管理を定義し、意思決定空間における責任コーンとしてのチーム自律境界を確立し、オフィス全体をMARIA座標系にマッピングする。本モデルは、会議スケジューリングエージェント、知識共有基盤、チームパフォーマンスメトリクス、組織グラフ理論に基づくコンフリクト解決メカニズムを導入する。シミュレーションにより、アーキテクチャが100%のアカウンタビリティ追跡可能性を維持しながら89.3%の自律運用を達成し、チーム間意思決定レイテンシが340ms未満、コンフリクト解決収束が3ラウンド未満であることを検証する。
CEO Clone as Decision Interface: Persona Layer Design for Delegating Executive Judgment
A formal architecture for encoding executive cognition into an auditable, drift-resistant persona layer that delegates judgment while preserving principal authority
Executive judgment is the highest-leverage bottleneck in any organization. Every strategic decision that waits for the CEO creates queue delay across the entire enterprise. Yet delegation through human hierarchies introduces information loss, preference distortion, and accountability diffusion. This paper presents the CEO Clone — not a chatbot that mimics speech patterns, but a computational decision interface that encodes the CEO's values, risk tolerance, decision patterns, and communication style into a formally verifiable persona layer. We model judgment delegation as a principal-agent problem with information asymmetry, introduce decision fidelity metrics with drift detection, and design calibration loops that maintain clone-principal alignment over time. The architecture operates within MARIA OS governance infrastructure, ensuring every delegated decision produces an immutable audit trail with full traceability to the encoded persona parameters that produced it.
CEOクローンとしての意思決定インターフェース:経営判断を委任するためのペルソナレイヤー設計
経営者の認知を監査可能・ドリフト耐性のあるペルソナレイヤーとしてエンコードし、主体者の権限を保持しながら判断を委任する形式的アーキテクチャ
経営判断は、あらゆる組織において最もレバレッジの高いボトルネックである。CEOの判断を待つ全ての戦略的意思決定は、企業全体にキュー遅延を生む。しかし、人間の階層構造を通じた委任は、情報損失、選好歪曲、責任拡散を引き起こす。本論文では、CEOクローン——CEOの発話パターンを模倣するチャットボットではなく、CEOの価値観、リスク許容度、意思決定パターン、コミュニケーションスタイルを形式的に検証可能なペルソナレイヤーとしてエンコードする計算的意思決定インターフェース——を提示する。判断委任をプリンシパル・エージェント問題として情報の非対称性のもとでモデル化し、ドリフト検出を伴う意思決定忠実度メトリクスを導入し、クローンと主体者の整合性を長期にわたり維持するキャリブレーションループを設計する。本アーキテクチャはMARIA OSガバナンスインフラの下で運用され、全ての委任された意思決定が、それを生成したペルソナパラメータまで完全に追跡可能な不変の監査証跡を生成する。
Audit Universe Runtime: Agent Design for Executing Audit Procedures as Runtime Operations
Transforming ISA/JICPA standards into executable agent specifications — from sampling strategies to substantive testing, within a MARIA OS governance architecture
Traditional audit procedures are encoded in prose-based standards that resist automation. This paper presents the Audit Universe Runtime — a multi-agent execution environment within MARIA OS that compiles audit standards (ISA, JICPA) into executable agent task specifications. We formalize audit procedures as state machines, design sampling strategy agents with statistical rigor, implement real-time anomaly detection during substantive testing, and prove audit completeness through a formal coverage model. The architecture maps MARIA coordinates to engagement structures, enabling continuous auditing with immutable audit trails and human-agent collaboration gates at every materiality threshold.
Audit Universe Runtime:監査手続をランタイム・オペレーションとして実行するAgentアーキテクチャ
ISA/JICPA基準をエージェント実行仕様に変換する — サンプリング戦略から実証的テストまで、MARIA OSガバナンスアーキテクチャの中で
従来の監査手続は、自動化に抵抗する散文ベースの基準書に記述されている。本論文では、MARIA OS内のマルチエージェント実行環境であるAudit Universe Runtimeを提示する。ISAおよびJICPA基準を実行可能なエージェントタスク仕様にコンパイルし、サンプリング戦略エージェントを統計的厳密さで設計し、実証的テスト中のリアルタイム異常検知を実装し、形式的なカバレッジモデルを通じて監査の完全性を証明する。このアーキテクチャはMARIA座標をエンゲージメント構造にマッピングし、すべての重要性閾値における人間-エージェント協働ゲートと不変の監査証跡による継続的監査を可能にする。
MARIA OS Appliance Reference Architecture: Standard Configuration for On-Premise AI Governance Infrastructure
A complete hardware and software blueprint for deploying MARIA OS as a self-contained appliance — covering GPU/CPU sizing, network topology, security hardening, HA clustering, disaster recovery, and TCO analysis for regulated enterprises
Cloud-native AI platforms dominate the conversation, but regulated industries — finance, healthcare, defense, critical infrastructure — face a hard constraint: sensitive decision data cannot leave the building. This reference architecture defines the MARIA OS Appliance: a rack-mountable, air-gap-capable governance platform that runs the full multi-agent decision pipeline on-premise. We specify hardware tiers from single-node evaluation units to multi-site federated clusters, detail the software stack from OS kernel to agent runtime, prove that governance guarantees hold under network partition, and provide a TCO framework that quantifies the break-even point against cloud deployment. The result is a turnkey AI governance infrastructure that preserves data sovereignty without sacrificing capability.
MARIA OSアプライアンス・リファレンスアーキテクチャ:オンプレミスAIガバナンス基盤の標準構成
MARIA OSを自己完結型アプライアンスとして展開するための完全なハードウェア・ソフトウェア設計図 — GPU/CPUサイジング、ネットワークトポロジー、セキュリティ強化、HAクラスタリング、災害復旧、TCO分析を網羅
クラウドネイティブAIプラットフォームが主流だが、規制産業 — 金融、医療、防衛、重要インフラ — は厳しい制約に直面している:機密性の高い意思決定データを社外に出すことができない。本リファレンスアーキテクチャはMARIA OSアプライアンスを定義する:マルチエージェント意思決定パイプライン全体をオンプレミスで実行する、ラックマウント可能なエアギャップ対応ガバナンスプラットフォームである。単一ノード評価ユニットからマルチサイト連合クラスタまでのハードウェアティアを規定し、OSカーネルからエージェントランタイムまでのソフトウェアスタックを詳述し、ネットワーク分断下でもガバナンス保証が維持されることを証明し、クラウドデプロイメントとの損益分岐点を定量化するTCOフレームワークを提供する。
Agentic Ethics Lab: Designing a Corporate Research Institute for Structural Ethics in AI Governance
A four-division, gate-governed research architecture that transforms ethics from philosophical declaration into executable, auditable, and evolvable system infrastructure
Ethics declarations without structural enforcement are organizational theater. This paper presents the Agentic Ethics Lab — a corporate research institute embedded within the MARIA OS governance architecture, operating as a first-class Universe with four specialized divisions: Ethics Formalization, Ethical Learning, Agentic Company Design, and Governance & Adoption. Each division runs agent-human hybrid teams under fail-closed research gates. We formalize the lab's architecture using decision graph theory, prove that self-referential governance research preserves safety invariants, and demonstrate that a corporate research institute with no revenue targets but strategic alignment outperforms both pure academic and pure product research in responsible AI advancement.
Agentic Ethics Lab:AIガバナンスにおける構造的倫理のための企業研究所の設計
倫理を哲学的宣言から実行可能・監査可能・進化可能なシステムインフラストラクチャへと変革する、4部門・Gate管理型研究アーキテクチャ
構造的な強制力を伴わない倫理宣言は、組織的な演劇に過ぎない。本論文では、MARIA OSガバナンスアーキテクチャ内に組み込まれた企業研究所である Agentic Ethics Lab を紹介する。この研究所は4つの専門部門(Ethics Formalization、Ethical Learning、Agentic Company Design、Governance & Adoption)を持つファーストクラスのUniverseとして運用される。各部門はFail-Closedの研究Gateの下でAgent-人間ハイブリッドチームを運営する。本論文では、決定グラフ理論を用いてラボのアーキテクチャを形式化し、自己参照的ガバナンス研究が安全性不変量を保持することを証明し、収益目標を持たないが戦略的に整合した企業研究所が、純粋な学術研究や純粋な製品研究の双方よりも責任あるAI推進において優れた成果を上げることを実証する。
Cross-Domain Research Governance: A 12-Month Integrated Research Plan for Capital, Operational, and Physical AI Systems
Orchestrating four parallel research streams across capital decision engines, operational agentic companies, robot judgment systems, and holding integration under unified gate governance
Research programs that operate in isolation produce findings that cannot be integrated. Capital decision engines optimized without operational context misallocate resources. Operational agentic companies designed without capital awareness cannot sustain themselves. Robot judgment systems built without holding-level governance create liability gaps. This paper presents a 12-month cross-domain research plan for an Autonomous Industrial Holding that integrates four parallel streams — Capital Decision Engine (Stream A), Operational Agentic Company (Stream B), Robot Judgment OS (Stream C), and Holding Integration (Stream D) — under unified research gate governance. We formalize stream dependency graphs, derive milestone probability models using PERT/CPM analysis, introduce cross-stream conflict detection metrics, model research velocity and throughput, express gate passage probability as a function of research maturity, and quantify integration risk propagation across streams. The plan covers 20 research themes (4 streams x 5 themes each) with detailed experiment designs, statistical methodology, and KPI specifications. Research gates RG0-RG3 govern all outputs with fail-closed semantics. The central thesis: cross-domain research governance is not project management — it is a decision architecture problem that requires the same structural rigor as the systems it studies.
Decision Civilization Infrastructure: From Ethics-as-Architecture to the Universal Responsibility Operating System
The capstone synthesis — why the AGI era demands not smarter AI but better responsibility structures, and how MARIA OS unifies capital, physical, ethical, and organizational decisions under a single governance topology
Every decision an organization makes — from board strategy to robot arm trajectory, from capital allocation to ethical constraint evaluation — flows through an implicit responsibility structure. In most organizations, that structure is invisible, informal, and fragile. This paper presents the Decision Civilization Infrastructure: a unified mathematical framework that formalizes the entire decision space as a product manifold D = D_capital x D_physical x D_ethical x D_organizational, proves that responsibility is a conserved quantity under decision composition, derives scaling theorems for governance preservation as systems grow, and demonstrates that all prior MARIA OS research programs — ethics formalization, ethical learning, agentic company design, investment engines, robot judgment, responsibility decomposition, gate control theory, and quality convergence — are projections of a single underlying architecture. We introduce a category-theoretic view of decision composition across domains, establish information-theoretic bounds on decision quality, and prove convergence of all subsystems toward a stable governance attractor. The competitive moat is not AI capability but structural responsibility: mathematics, reproducibility, and fail-closed architecture that compounds over time.
意思決定文明インフラストラクチャ:Ethics-as-Architectureから普遍的責任オペレーティングシステムへ
集大成としての統合論文 — AGI時代に求められるのはより賢いAIではなく、より優れた責任構造であり、MARIA OSが資本・物理・倫理・組織の意思決定を単一のガバナンストポロジーの下に統合する方法
組織が行うあらゆる意思決定 — 取締役会の戦略からロボットアームの軌道、資本配分から倫理的制約の評価まで — は、暗黙の責任構造を通じて流れている。ほとんどの組織において、その構造は不可視で、非公式で、脆弱である。本論文は意思決定文明インフラストラクチャを提示する:意思決定空間全体を積多様体 D = D_capital x D_physical x D_ethical x D_organizational として形式化する統一的な数学的フレームワークであり、意思決定の合成において責任が保存量であることを証明し、システムの成長に伴うガバナンス保存のスケーリング定理を導出し、これまでの全てのMARIA OS研究プログラム — 倫理の形式化、倫理的学習、エージェント型企業設計、投資エンジン、ロボット判断、責任分解、ゲート制御理論、品質収束 — が単一の基盤アーキテクチャの射影であることを実証する。意思決定合成の圏論的視点を導入し、意思決定品質に関する情報理論的限界を確立し、すべてのサブシステムが安定したガバナンスアトラクタに収束することを証明する。競争上の堀はAI能力ではなく、構造的責任にある:時間とともに複利的に積み上がる数学、再現性、フェイルクローズドアーキテクチャである。
Mission-Constrained Optimization in Agentic Companies
A Mathematical Framework for Value-Preserving Goal Execution
Local goal optimization often conflicts with organizational Mission. We formalize this conflict as a constrained optimization problem over a 7-dimensional Mission Value Vector, derive the alignment score and penalty-based objective, and present a three-stage decision gate architecture that prevents value erosion while preserving goal-seeking performance.
Survival Optimization and Mission Constraint Theory
Does Evolutionary Pressure Reduce Organizations to Pure Survival Machines? A Mathematical Analysis of Directed vs. Undirected Evolution
When organizations are modeled as evolutionary subjects, does the theoretical limit reduce to survival-probability maximization? This paper examines two regimes — unconstrained local optimization (λ→0) where ethics and culture are mere byproducts, and Mission-constrained optimization where evolution gains direction. We derive the survival-alignment tradeoff curve S = S₀·exp(−αD), prove Lyapunov stability of Mission erosion dynamics under dual-variable feedback control, present 7-dimensional phase diagrams for operational monitoring, and demonstrate a civilization-type phase transition where accumulated institutional improvements qualitatively change the system's risk profile.
Metacognition in Agentic Companies: Why AI Systems Must Know What They Don't Know
Latent governance density, observable metacognitive coverage, and the stability bounds of self-governing enterprises
We formalize an agentic company as a graph-augmented constrained Markov decision process G_t = (A_t, E_t, S_t, Pi_t, R_t, D_t), distinguish latent governance density D_t from observable constrained-candidate coverage D_hat_t on router-generated Top-K actions, and define damping via kappa_t = kappa(D_hat_t). The exact local contraction condition is (1 - kappa_t) lambda_max(W_t) < 1, while the buffered operating envelope lambda_max(W_t) < 1 - kappa_t preserves adaptation headroom. Governance constraints thereby function as organizational metacognition: each constraint is a point where the system observes its own behavior. Planet-100 simulations validate that buffered role specialization emerges in the intermediate governance regime.
Institutional Design for Agentic Societies: Meta-Governance Theory and AI Constitutional Frameworks
From Enterprise Governance to AI Constitutions: How Institutional Economics and Meta-Governance Theory Stabilize Multi-Agent Societies
Multi-agent AI societies require more than individual metacognition: they also require institutional design. This article formalizes agentic-company governance, derives social objective functions for AI-human ecosystems, establishes the Speed Alignment Principle as a stability condition, and presents an AI-constitution model with revision rules. In simulations across 600 runs, adaptive institutional frameworks reduced spectral radius from 1.14 to 0.82 while maintaining audit scores above 0.85.
Meta-Insight Under Distribution Shift: Change-Point Governance Loops for Enterprise Agentic Systems
An operational architecture for detecting non-stationarity, throttling unsafe adaptation, and restoring decision quality under drift
This article outlines change-point detection, bounded policy updates, and fail-closed escalation for distribution-shift governance.
Counterfactual Escalation Policy: Meta-Insight Routing for High-Impact Human Review
Estimate intervention value before handoff to reduce unsafe approvals and unnecessary escalations
Escalation is triggered when estimated causal benefit exceeds review cost, not by confidence alone.
Memory Stratification for AI Governance: A Rate-Distortion Framework for Retention Decisions
Use information theory to decide what enterprise AI systems should remember, summarize, or discard
Rate-distortion memory policy retains high-utility context while limiting latency, privacy risk, and contradiction noise.
Securing Recursive AI Feedback Loops: Adversarial Reflexivity Hardening for Meta-Insight Systems
Defense framework for prompt injection, feedback poisoning, and policy-hijack attacks in self-improving loops
Layered provenance checks, anomaly scoring, and quarantine rules harden adaptive loops while preserving auditability.
Causal Analysis of Organizational Learning Rate: OLR Decomposition for Intervention Attribution
From correlation-heavy dashboards to intervention-level attribution in meta-insight governance systems
Causal OLR decomposition attributes observed learning-rate gains to specific interventions, improving budget and policy allocation decisions.
Causal-Temporal Knowledge Graph for AI Governance: Path-Specific Responsibility Attribution
A deep research framework for path-specific accountability, time-aware causality, and audit-grade explanation in enterprise AI
A temporal responsibility graph enables path-level causal attribution and faster, more reproducible root-cause analysis.
Governing Emergent Role Specialization: Stability Laws for Agentic Companies Under Constraint Density
A mathematical framework for calibrating governance in self-organizing enterprises
We distinguish the exact contraction condition `(1 - D) · λ_max(A) < 1` from the conservative operating envelope `λ_max(A) < 1 - D`, giving enterprise architects a rigorous way to tune governance density in agentic organizations.
The Algorithm Stack for Agentic Organizations: 10 Essential Algorithms Mapped to a 7-Layer Architecture
Beyond generative AI: a practical computational substrate for self-governing enterprises
An agentic company is not built on generative AI alone. We present 10 core algorithms across language, tabular prediction, state-transition control, graph structure, and anomaly detection, organized into a 7-layer architecture for enterprise governance workloads.
Transformer Architecture for Agentic Language Intelligence: Self-Attention as the Cognitive Layer of Enterprise Decision Systems
How self-attention enables multi-agent context fusion, decision-log comprehension, and hierarchical organizational reasoning
Transformer architectures are central to enterprise language understanding, but production decision systems require additional design constraints. This paper formalizes transformers as the Cognition Layer (Layer 1) of the agentic company stack, introduces cross-agent attention for organizational context fusion, adapts positional encoding to hierarchical coordinates, and outlines training objectives for decision logs, contracts, meeting notes, and specification documents. In evaluated MARIA OS workloads, coordinate-aware attention reduced cross-agent context fusion error by 34% versus standard multi-head attention, and hierarchical positional encoding improved organizational structure extraction F1 by 28%.
Gradient Boosting for Enterprise Decision Prediction: XGBoost and LightGBM as the Decision Layer of Agentic Companies
Why enterprise data is often tabular and how gradient boosting ensembles support approval prediction, risk scoring, and outcome estimation
While deep learning dominates many unstructured tasks, enterprise decision data is frequently tabular: structured features describing decisions, agents, contexts, and outcomes. This paper formalizes gradient boosting (XGBoost/LightGBM) as the Decision Layer (Layer 2) of the agentic company stack, details feature-engineering patterns for enterprise decision tables, and introduces SHAP-based explainability workflows for governance audits. Across evaluated datasets, the approach achieved 91.3% approval-prediction accuracy, 0.94 AUC on risk scoring, and full SHAP traceability integrated with MARIA OS responsibility gates.
Random Forest for Interpretable Organizational Decision Trees: Extracting Governance Logic from Ensemble Structure
How bagging-based tree ensembles reveal decision-branch structure, critical governance variables, and auditable policy trees
While gradient boosting often targets predictive accuracy, random forests provide a complementary strength: structural interpretability. This paper positions random forests as an interpretability engine within the Decision Layer (Layer 2), showing how ensemble structure surfaces governance logic, highlights key variables through permutation/impurity importance, and yields auditable policy trees. In evaluated workloads, random-forest feature importance reached 0.93 rank correlation with domain-expert rankings, extracted trees matched 89% of documented governance policies, and out-of-bag error supported validation in data-constrained settings.
Markov Decision Processes for Business Workflow State Control: Formalizing the Agentic Company as a State Transition System
How MDPs, Bellman equations, and policy optimization support workflow control, responsibility decomposition, and gate-constrained automation
The agentic company can be modeled as a state-transition system. Business workflows move through discrete states — proposed, validated, approved, executed, completed — with transitions governed by policies balancing efficiency, risk, and human authority. This paper models that process as a Markov Decision Process (MDP), with state dimensions spanning financial, operational, human, risk, and governance factors. We derive Bellman equations for policy optimization, analyze gate-constrained MDP behavior when specific transitions require human approval, and map the MARIA OS decision pipeline to a finite-horizon MDP with responsibility constraints. In tested workflow graphs, policy iteration converged within 12 iterations and yielded 23% throughput improvement over heuristic routing while keeping governance compliance at 100%.
Actor-Critic Reinforcement Learning for Gated Autonomy: PPO-Based Policy Optimization Under Responsibility Constraints
How Proximal Policy Optimization enables medium-risk task automation while respecting human approval gates
Gated autonomy requires reinforcement learning that respects responsibility boundaries. This paper positions actor-critic methods — specifically PPO — as a core algorithm in the Control Layer, showing how the actor learns policies, the critic estimates state value, and responsibility gates constrain the action space dynamically. We derive a gate-constrained policy-gradient formulation, analyze PPO clipping behavior under trust-region constraints, and model human-in-the-loop approval as part of environment dynamics.
Multi-Armed Bandits for Enterprise Strategy Optimization: Thompson Sampling, UCB, and Contextual Bandits in Agentic Organizations
How exploration-exploitation algorithms form the fifth layer of the agentic company architecture
Enterprises continually face the exploration-exploitation dilemma: whether to exploit known strategies or test potentially better alternatives. This paper formalizes multi-armed bandits as the Exploration Layer (Layer 5), covering Thompson sampling with Beta priors, UCB confidence bounds, contextual bandits for personalized decisions, and Bayesian optimization for business hyperparameter tuning. We provide enterprise-oriented regret analysis and describe integration with the MARIA OS strategy engine.
Graph Neural Networks for Organizational Network Dynamics: Message-Passing, Spectral Convolutions, and Influence Propagation in Agentic Hierarchies
How GNNs form the Structure Layer that models agent dependencies, information flow, and hierarchical topology in self-governing enterprises
Agentic companies can be modeled as graph structures, where agents connect through dependencies, information channels, and approval chains. This paper formalizes Graph Neural Networks as the Structure Layer (Layer 3), covering message-passing networks for organizational flow, spectral convolutions for hierarchy discovery, graph attention for dynamic topology, and link prediction for emerging dependencies. We also analyze influence-propagation matrices and spectral-radius indicators for governance stability, and describe integration with the MARIA OS Universe visualization.
Clustering Algorithms for Emergent Agent Role Specialization
How k-means, DBSCAN, and hierarchical clustering form the computational mechanism of organizational role formation
Role specialization in agentic companies can be analyzed as a clustering phenomenon. We show how k-means supports initial role assignment, DBSCAN discovers natural clusters without fixed role counts, and hierarchical clustering models nested organizational structure. We derive a role-specialization equation and describe how MARIA OS applies dynamic re-clustering for organizational adaptation.
Anomaly Detection for Agentic System Safety and Deviation Control
Isolation Forest and Autoencoder reconstruction error as the computational safety layer for self-governing enterprises
Agentic systems can produce operational deviations that require early detection and controlled response. This paper combines Isolation Forest anomaly scoring with Autoencoder reconstruction error to build a layered safety monitor. We define an anomaly-throttle-freeze response cascade and show how the MARIA OS stability guard applies the spectral-radius condition `spectral_radius < 1 - governance_density` in runtime governance.
Agentic Company Structural Design: Responsibility Topology, Conflict-Driven Learning, and Self-Evolving Governance for Human-Agent Organizations
Modeling the enterprise as a responsibility topology across human-agent decision nodes
This paper explores corporate design where the primary unit is the decision node and its responsibility allocation, not only role or department labels. It introduces five linked research programs that model the enterprise as a weighted directed responsibility graph whose topology evolves through conflict-driven learning. We formalize human-agent responsibility matrices, derive scalable topology conditions, define health metrics for hybrid organizations, and model governance as a self-evolving decision graph with gate-managed policy transitions.