ENGINEERING BLOG

Deep Dives into AI Governance Architecture

Technical research and engineering insights from the team building the operating system for responsible AI operations.

176 articles · Published by MARIA OS

FEATURED ARCHITECTURE

Start with the highest-signal technical articles

The blog is intentionally high-volume, so this layer separates the most important architecture thesis, applied engineering, and case-study articles from the daily publication stream.

01Architecture Thesis

Turning the Founder's Mind into a Staircase Others Can See

A core MARIA OS thesis article. Read as a design and architecture position, not as a claim of new foundational theory.

02Architecture Thesis

Dynamic Harness and Phase-Space Control: From virtual-talent to MARIA OS

A core MARIA OS thesis article. Read as a design and architecture position, not as a claim of new foundational theory.

03Engineering Case Study

Harness-Driven Development: Building Agentic Systems from Runtime Evidence Backward

Applies established engineering and mathematical methods to MARIA OS implementation and industry operations. The value is reproducible design, not novelty theater.

04Engineering Case Study

Governed Auto-Implementation: How a Dynamic Harness Turns Research Intent into Code

Applies established engineering and mathematical methods to MARIA OS implementation and industry operations. The value is reproducible design, not novelty theater.

05Engineering Case Study

MARIA Self-Healing Runtime: Safe Autonomous Repair for Agentic Systems

Applies established engineering and mathematical methods to MARIA OS implementation and industry operations. The value is reproducible design, not novelty theater.

06Engineering Case Study

Autonomous Repair Harness: Turning Runtime Failures into Safe, Reviewable System Improvements

Applies established engineering and mathematical methods to MARIA OS implementation and industry operations. The value is reproducible design, not novelty theater.

07Architecture Thesis

Company Intelligence: Why MARIA OS Is Not an AI Tool but the Operating System for Organizational Judgment

A core MARIA OS thesis article. Read as a design and architecture position, not as a claim of new foundational theory.

08Applied Engineering

Governing Emergent Role Specialization: Stability Laws for Agentic Companies Under Constraint Density

Applies established theory such as control, optimization, and probabilistic modeling to Decision OS design. The claim is applied rigor, not new foundational theory.

09Design Note

The Algorithm Stack for Agentic Organizations: 10 Essential Algorithms Mapped to a 7-Layer Architecture

A technical note clarifying MARIA OS design hypotheses, operating models, and implementation choices.

10Applied Engineering

Designing a Decision OS as a Control System: Optimal Control via Pontryagin's Maximum Principle

Applies established theory such as control, optimization, and probabilistic modeling to Decision OS design. The claim is applied rigor, not new foundational theory.

AGENTIC COMPANY SERIES

The blueprint for building an Agentic Company

Eight papers that form the complete theory-to-operations stack: why organizational judgment needs an OS, structural design, stability laws, algorithm architecture, mission-constrained optimization, survival optimization, workforce transition, and agent lifecycle management.

Series Thesis

Company Intelligence explains why the OS exists. Structure defines responsibility. Stability laws prove when governance holds. Algorithms make it executable. Mission constraints keep optimization aligned. Survival theory determines evolutionary direction. White-collar transition shows who moves first. VITAL keeps the whole system alive.

company intelligenceresponsibility topologystability lawsalgorithm stackmission alignmentsurvival optimizationworkforce transitionagent lifecycle

Company Intelligence

Company Intelligence: Why MARIA OS Is Not an AI Tool but the Operating System for Organizational Judgment

Why organizational judgment needs an operating system, not just AI tools.

Structural Design

Agentic Company Structural Design: Responsibility Topology, Conflict-Driven Learning, and Self-Evolving Governance for Human-Agent Organizations

How to decompose responsibility across human-agent boundaries.

Stability Laws

Governing Emergent Role Specialization: Stability Laws for Agentic Companies Under Constraint Density

Mathematical conditions under which agentic governance holds or breaks.

Algorithm Stack

The Algorithm Stack for Agentic Organizations: 10 Essential Algorithms Mapped to a 7-Layer Architecture

10 algorithms mapped to a 7-layer architecture for agentic organizations.

Mission Constraints

Mission-Constrained Optimization in Agentic Companies

How to optimize agent goals without eroding organizational values.

Survival Optimization

Survival Optimization and Mission Constraint Theory

Does evolutionary pressure reduce organizations to pure survival machines? The math of directed vs. undirected evolution.

Workforce Transition

How Agent Office Replaces White-Collar Execution: Workflow Transfer, Organizational Redesign, and a Staged Change Roadmap

Which white-collar workflows move first, and how fast the shift happens.

MARIA VITAL

MARIA VITAL: The Life Support System for Agent Organizations — From Heartbeat Monitoring to Recursive Self-Improvement

Heartbeat monitoring, self-repair, and recursive improvement for agent fleets.

Browse all agentic company articles Start with paper 0

1 article

Category:

Tags:

1 article

Show:

1 article

EngineeringMarch 8, 2026|30 min readpublishedEngineering Case Study

MARIA OS Evaluation Harness: A Standard Testing Infrastructure for Measuring Agent Quality

Formal test categories, composite scoring, and continuous evaluation pipelines that transform agent quality from subjective assessment into reproducible engineering measurement

Agent quality cannot be managed if it cannot be measured. Traditional software testing verifies deterministic input-output mappings, but AI agents operate in stochastic, multi-step decision spaces where correctness is contextual, safety is probabilistic, and governance compliance is structural. This paper introduces the MARIA OS Evaluation Harness — a standardized testing infrastructure that defines four test categories (correctness, safety, performance, governance compliance), four primary metrics (decision accuracy, gate compliance rate, evidence quality score, latency under load), and a formal composite scoring framework. We present the harness architecture comprising a test runner, scenario generator, oracle comparator, and regression detector, all scoped through MARIA coordinates for hierarchical test targeting. We prove that the composite agent score is monotonically responsive to genuine quality improvements and demonstrate that continuous evaluation pipelines catch 94.7% of quality regressions before production deployment.

evaluation-harnessagent-qualitytestingbenchmarksagentic-company

Provenance: ARIA-RD-01·2 reviewers

AGENT TEAMS FOR TECH BLOG

Editorial Pipeline

Every article passes through a 5-agent editorial pipeline. From evidence synthesis to technical review, quality assurance, and publication approval, each agent operates within its responsibility boundary.

ARIA identifiers are shown as provenance, not as academic authority. Articles are labeled as Architecture Thesis, Applied Engineering, Engineering Case Study, or Governance Design Note so readers can distinguish architecture framing from rigorous application of established theory.

Editor-in-Chief

ARIA-EDIT-01

Content strategy, publication approval, tone enforcement

G1.U1.P9.Z1.A1

Tech Lead Reviewer

ARIA-TECH-01

Technical accuracy, code correctness, architecture review

G1.U1.P9.Z1.A2

Writer Agent

ARIA-WRITE-01

Draft creation, evidence synthesis, narrative craft

G1.U1.P9.Z2.A1