EngineeringJune 1, 202619 min read

AIエージェントが業務で失敗する理由は、LLMではなくハーネス不足である

PoCでは動くのに本番化できない原因を、目的・権限・記憶・停止条件・復旧経路・監査証跡の設計から捉える

企業AIエージェントが失敗する主因は、モデル性能だけではない。目的、権限、記憶、品質、停止条件、復旧経路、監査証跡を囲うハーネスがないまま、AIに行動させようとしていることが本質である。

AI-agentDynamic-Harnessenterprise-AIHITLMARIA-OSjapanese
Safety & GovernanceMay 30, 202638 min read

Operational AI Governance as a Technical Moat: A Realistic Assessment of MARIA OS

Why internal auto-recovery, external HITL, responsibility envelopes, and fail-closed gates matter more than another agent demo

The next credible enterprise AI advantage will not come from claiming full autonomy. It will come from knowing where autonomy must stop, how recovery paths are tested, and how human accountability survives at production speed. This article gives a realistic assessment of Bonginkan's MARIA OS architecture and the operational evidence required to turn that architecture into a durable technical moat.

MARIA-OStechnical-moatagent-governanceHITLfail-closedoperational-ai
Safety & GovernanceMay 30, 202640 min read

運用されるAIガバナンスは技術的優位性になるか:MARIA OSの現実的評価

内部では自動復旧を攻め、外部ではHITLを厚くする。責任契約・fail-closed・回復経路を実装レイヤーで見る

企業AIの次の優位性は、完全自律を主張することではなく、どこで止めるか、どう復旧するか、人間の責任をどう残すかを本番運用で証明することから生まれる。本稿では、ボンギンカンのMARIA OSが持ちうる技術的優位性と、グローバル・日本市場での現実的な位置づけを、過剰な断定を避けて評価する。

MARIA-OStechnical-moatagent-governanceHITLfail-closedoperational-aijapanese
Safety & GovernanceFebruary 14, 202644 min read

Recursive Self-Improvement Under Governance Constraints: Governed Recursion via Contraction Mapping and Lyapunov Stability

How MARIA OS's Meta-Insight turns unbounded recursive self-improvement into convergent self-correction while preserving governance constraints

Recursive self-improvement (RSI) — an AI system improving its own capabilities — is both promising and risky. Unbounded RSI raises intelligence-explosion concerns: a system improving faster than human operators can evaluate or constrain. This paper presents governed recursion, a Meta-Insight framework in MARIA OS for bounded RSI with explicit convergence guarantees. We show that the composition operator M_{t+1} = R_sys ∘ R_team ∘ R_self(M_t, E_t) implements recursive improvement in meta-cognitive quality, while a contraction condition (gamma < 1) yields convergence to a fixed point instead of divergence. We also provide a Lyapunov-style stability analysis where Human-in-the-Loop gates define safe boundaries in state space. The multiplicative SRI form, SRI = product_{l=1..3} (1 - BS_l) * (1 - CCE_l), adds damping: degradation in any one layer lowers overall autonomy readiness. Across simulation and governance scenarios, governed recursion retained 89% of the unconstrained improvement rate while preserving measured alignment stability.

meta-insightrecursive-self-improvementAI-safetyLyapunov-stabilitycontraction-mappinggoverned-recursionHITLalignmentMARIA-OSgovernance
Safety & GovernanceFebruary 12, 202642 min read

Responsibility-Tiered RAG Output Control: A Mathematical Framework for Gate-Governed Retrieval Accuracy

Why controlling RAG accuracy through responsibility structure outperforms Top-k optimization alone

Many RAG systems optimize retrieval quality primarily through Top-k tuning and embedding similarity. This paper adds a governance-oriented approach: responsibility-tiered gates that adjust validation intensity by risk classification. The framework reports an 82% hallucination-rate reduction on enterprise document corpora while maintaining sub-second response times for low-risk queries.

RAGresponsibility-gatesrisk-tiershallucination-reductionHITLmathematical-models
Safety & GovernanceFebruary 12, 202644 min read

Fail-Closed Gate Design for Agent Governance: Responsibility Decomposition and Optimal Human Escalation

Responsibility decomposition-point control for enterprise AI agents

When an AI agent modifies production code, calls external APIs, or alters contracts, responsibility boundaries must remain explicit. This paper formalizes fail-closed gates as a core architectural primitive for responsibility decomposition in multi-agent systems. We derive gate configurations via constrained optimization and use internal simulations to illustrate how a 30/70 human-agent ratio can preserve responsibility coverage while reducing decision latency versus full human review.

fail-closedagent-governanceresponsibility-gatesrisk-scoringHITLoptimization