DECISION OS
FOR AGENT
COMPANIES
人間の判断構造をOS化し、AI Agentが企業活動を実行する。
人間の判断構造をOS化し、AI Agentが企業活動を実行する。
AIを動かすのではない。意思決定を動かす。
多くのAIツールはプロンプトを連携させ、自動化を加速させます。しかし企業に必要なのは単なる自動化ではありません — どこでAIに任せるのか、 どこで停止するのか、 どこで人間が責任を持つのか — この意思決定の構造です。MARIA OSはリーダーの判断をオペレーティングシステムとして定義し、AI Agentによる実行へと変換します。
こんな組織のために
こんな用途には向きません
Dynamic Harness / Why it works
起動前に、設計の契約が満たされているか。
いま起きていることを踏まえ、まだ続けて安全か。
設計が正しくても実行中に壊れる。品質低下、証拠欠落、ツール障害が起きれば、それでも不正なepisodeを止める。
重複課金や予算超過をblockし、高コストなツール呼び出しはhuman approvalへ。副作用が発生する前に止める。
heartbeat欠落、retry枯渇、権限拒否が出たAgentは、失敗が増幅する前にquarantineへ移す。
loop guardが同一failure fingerprintと修復試行の反復を検知し、ループせずhuman approvalまたはquarantineへ落とす。
Decision/Workflow Scanは実行時ベクトルになる。観測・制約・承認要求・修復・隔離へ。受け身のレポートで終わらない。
許可される行動空間は、実行時の証拠が安定している間だけ広がり、リスクが上がれば即座に縮む。
つまりMARIA OSは「このAgentは安全に設計された」だけでなく、「いま安全に動いており、危険になれば被害が広がる前に自律性を下げられる」と言える。
研究記事を読む最終出力だけではなく、目的・記憶・identity・品質・latency・cost・authorityを同じruntime episodeとして評価する。scorecardが悪化した時は、rerun、quarantine、human approval、repair proposalへ切り替える。
意図、記憶、ツール、Gate、生成物、レイテンシ、修正履歴をruntime episodeへ正規化する。
失敗をowner、severity、confidence、user visibility、検証コマンドへ写像する。
completion、pass rate、retry、advisory lift、failure densityを時間変化するscorecardとして扱う。
不安定性をrerun、quarantine、draft repair PR、人間承認へ変換し、自律性の拡張を制御する。
Dynamic Harnessは、episode抽出、failure taxonomy、scorecard、repair proposal、controlled self-healingを、企業とAgentic Society全体のRuntime Governanceへ接続する。
研究記事を読む実装パターン / 脊髄反射型配線
既知の刺激をすべてLLMへ投げない。MARIA OSでは、定型・限定・責任範囲が明確なイベントを反射弧で処理し、曖昧または高リスクなものだけ熟慮へ上げる。
既知業務は高速経路へ。未知業務は熟慮経路へ。
01
反射は生のテキストから直接発火させない。会話、フォーム、ワークフロー変更、APIコールバック、ドキュメント更新を、文脈・行為者・対象・リスク・現在状態を持つ刺激パケットへ変換する。
重要なのは意図分類ではなく、業務上の型付け。
02
反射弧は、既知の業務クラスに対して事前設計された実行経路。不備入力の差し戻し、依頼分類、禁止送信の停止、証跡付与、高リスク案件のエスカレーション、決定論的ワークフローを担う。
反射は近道ではない。すでに設計済みの判断である。
ルーティング判断は明示する。既知かつ限定された業務は反射へ。曖昧な業務は熟慮へ。権限が欠ける業務はfail-closedへ。
03
静的ハーネスは権限、データ、ツール、禁止条件を固定する。動的ハーネスはリスク、信頼度、状態、期限、監査条件に応じて、実行可能範囲を実行時に調整する。
反射は、統治された行動空間の内側でだけ速く動く。
04
Envelope型責任契約を通してのみ実行する
Envelopeが欠ける、または不正ならfail-closedする。
05
反射は、発火・停止・エスカレーション・上書き・巻き戻しの証跡を残す。FDEチームはその証跡で局所反射を調整し、安定したパターンをMARIA OSの再利用資産へ昇格させる。
現場実装が、プラットフォームの学習になる。
Operational Governance
MARIA OSでは、停止・復旧・証拠・人間エスカレーションを例外処理ではなく本番経路として扱う。内部では復旧経路を攻めて鍛え、顧客環境では信頼・証拠・反復性が揃うまでHITLを厚めに保つ。
評価記事を読む見るべき指標
Runtime proof権限・証拠・文脈が足りなければ実行しない。
内部では原因ログと復旧後検証つきで回復経路を鍛える。
安定が証明された反復ワークフローから人間レビューを減らす。
責任者のない実行経路を有効にしない。
MARIA OSのコンテンツは、孤立したマーケティングコピーではない。プロダクトページ、実験、アーキテクチャノート、技術記事は、発行者・構築者・説明責任を負うソースとしてBonginkanへと結び付けられている。
インテグレーションで製品をつなぐのではなく、判断で整合させます。
判断を考慮した自動化で商談を実行するAIエージェントチーム。すべての商談ステージに専門家がいます。
Learn more02高速なスプレッドシートではありません。すべての発見にエビデンスが伴う再現可能な監査エンジンです。
Learn more03実際のドキュメントからFAQを自動生成。すべての回答がソース・ページ・エビデンス品質を引用します。
Learn more04エージェントが構築・テスト・レビュー・デプロイ。「AIがコードを書く」のではなく、データベースが変更を認可します。
Learn more05ナレッジグラフとスペースドリピティションを使ってCPA試験を学習するAIエージェント — エビデンスによるガバナンス付き。
Learn more06掲げる価値観と実践の価値観。組織の行動が信念と矛盾する箇所を可視化します。
Learn more07実際のプロセスをスキャンし、無駄・責任の空白・ボトルネックを特定。再構成を処方します。
Learn more08ミッション・ビジョン・バリューを実行可能なガバナンスに変換。理念が運用制約になります。
Learn more09企業のエージェント化成熟度を評価。エージェントが活動できる場所、人間が判断すべき場所、リスクゲートが欠けている場所を特定します。
Learn more10Decision OSに話しかける。音声コマンドがガバナンスされたアクションに変換されます。
Learn more11AIエージェントが部門として働くバーチャルオフィス — HR、経理、法務、開発 — MARIA OSが統治します。
Learn more16音声インタビュー、Decision OS、5KB Genome、会議Agent、稟議ゲート、連携、Doctor Agent修復まで含む判断OS。
Learn more12Agent組織のための生命維持OS。行動健全性、判断品質、連携状態、回復可能性を継続的に監視・制御します。
Learn more13ヒューマンカンパニーから自己改善型へ。ガバナンスを各段階に組み込んだ、構造化された進化パスです。
Learn more14Continuously monitor agent vitals — behavior health, judgment quality, coordination state, and recoverability.
Learn more15From Human Company to Self-Improving — a structured evolution path with governance at every stage.
Learn moreSee → Fix → Run
Harness Adoption Map
横断ハーネスはepisode、gate、scorecard、quarantineを共有し、個別ハーネスはSales、Audit、Voice、Meetingなど固有の失敗モードを制御する。
入力・証拠・会話・差分をepisode化
全プロダクト共通のgateとscorecard
ドリフトに応じて制約と自律性を調整
Comprehensive Harness Cycle
Harness Designer型のplan、cycle report、stable fingerprintを生成し、失敗しても後続の診断可能な段階を継続する。
G1.U1.P1.Z1.A1
Deal Evidence Intake Harness / Deal Phase Harness
Attach episode scoring to proposal and estimate generation.
G1.U1.P2.Z1.A1
Evidence Chain Harness / Procedure-Specific Audit Harness
Evaluate every generated finding through evidence completeness and risk-tier gates.
G1.U1.P3.Z1.A1
Source Crawl Harness / FAQ Voice Harness
Add source freshness and public-release gates to generated FAQ artifacts.
G1.U1.P4.Z1.A1
Diff Episode Harness / Repository-Specific Dev Harness
Attach dynamic harness scoring to CI failure triage and repair proposals.
G1.U1.P5.Z1.A1
Learning Evidence Harness / Exam Domain Harness
Gate pass readiness with source validity and repeated-correction signals.
G1.U1.P6.Z1.A1
Consent Episode Harness / Meeting Phase Harness
Extend gate evaluation with harness interventions and episode severity.
G1.U2.P5.Z1.A1
Decision Evidence Harness / Decision Context Harness
Score live decision scans with evidence density, branch risk, and authority-gate pressure.
G1.U2.P1.Z1.A1
Value Evidence Harness / Executive Values Harness
Add harness confidence and evidence density to value scan summaries.
G1.U2.P2.Z1.A1
Process Evidence Harness / Workflow Domain Harness
Score recompose plans with flow-drift and evidence-density controls.
G1.U2.P3.Z1.A1
MVV Interview Harness / CEO Clone Harness
Add contradiction and rule-enforceability scoring to CEO Clone outputs.
G1.U2.P4.Z1.A1
Role Mapping Harness / Department Harness
Add role autonomy confidence and rollback conditions to insight output.
G1.U3.P1.Z1.A1
Turn Episode Harness / Voice Mode Harness
Attach harness severity to action-chat function-call rounds.
G1.U3.P6.Z1.A1
Booking Conversation Harness / Reservation Phase Harness
Gate booking voice and calendar-sync episodes with consent, slot, and notification evidence.
G1.U3.P2.Z1.A1
Office Event Harness / Agent Lifecycle Harness
Score task-engine events with office-health and handoff-drift signals.
G1.U3.P3.Z1.A1
Judgment Sample Harness / Executive Persona Harness
Add contradiction density and identity-boundary scoring to elicitation outputs.
G1.U3.P4.Z1.A1
Vital Signal Harness / Agent Vital Harness
Unify vital signals with the runtime harness scorecard.
G1.U3.P5.Z1.A1
Company Phase Harness / Evolution Path Harness
Add phase advancement criteria and rollback triggers to Agentic Company stages.
17 surfaces -> raw intake -> cross gates -> individual dynamic control
Harness Installation Plan
目的は単なる自律修復ではなく、安全な自律修復です。Failure Analyzer、Meta-Harness、Envelope、Memory Store、Human Approval Gate、Loop Controlでepisodeを収集し、confidence付きで分類し、最小改修を計画し、個別/横断Harnessを再実行し、学習を残します。
ログ、型、HTTP、DB、traceを機械分類し、LLM仮説と過去Memoryで照合してから改修へ進める。
KPI: 誤分類率
新しいAPI、画面、Agent、外部連携、権限、promptに必要なHarnessがあるかを検知する。
KPI: Harness抜け漏れ率
改修をLow/Medium/High/Memory Write envelopeへ分類し、Fixerが権限を越えないようにする。
KPI: 権限外変更数
失敗内容、原因、証拠、修正差分、再実行結果、副作用、review、人間レビュアーの判断根拠、再発防止ruleを資産として保存する。
KPI: 同一失敗の再発率
runtime risk score、monitor finding、reviewer判断、後続incidentを照合し、専門家事前分布の閾値を運用証拠で較正する。
KPI: 較正誤差
自律改修の最終単位をPRにし、Human Approval Gate、Loop Control、個別、横断、Meta、Deploy、Post-Deploy Harnessの証跡を添付する。
KPI: 自律改修成功率
G1.U4.P1.Z1.A1
観測
制御: Blocks implementation when API, UI fields, DB columns, and acceptance criteria disagree.
分類: Deterministic schema diff first, LLM review only for ambiguous requirement language.
責任境界: May block implementation and draft spec diffs; may not approve scope changes.
網羅性: Flags new API, DB, or screen files that lack a spec-contract episode.
初回実装: Generate a schema-to-screen diff for product specs before agent work starts.
G1.U4.P1.Z2.A1
観測
制御: Quarantines prompts that lack prohibited actions, output format, evidence rules, or gate policy.
分類: Rule-based prompt checklist with memory lookup for prior prompt failures.
責任境界: May quarantine prompts and propose edits; core authority prompts require reviewer approval.
網羅性: Detects production prompts without output format, forbidden actions, or evaluation criteria.
初回実装: Score production prompts for format, authority boundary, and evaluation coverage.
G1.U4.P2.Z1.A1
観測
制御: Stops an agent before it reads customer data outside its contract, role, or approval state.
分類: Deterministic tenant and role policy evaluation before any LLM reasoning.
責任境界: May deny or request approval; may not expand customer-data access grants.
網羅性: Finds data retrieval paths without tenant, PII, and permission preflight checks.
初回実装: Attach a preflight decision to every customer-data retrieval and exported artifact.
G1.U4.P2.Z2.A1
観測
制御: Routes public, financial, destructive, or production actions to human approval before execution.
分類: Structured action taxonomy with confidence threshold and human fallback.
責任境界: May draft outbound actions; public, financial, destructive, and deploy actions require approval.
網羅性: Reports external side-effect commands not covered by action preflight policy.
初回実装: Gate outbound email, invoice issue, GitHub PR creation, and deploy commands with one policy matrix.
G1.U4.P3.Z1.A1
観測
制御: Detects drift during execution and changes route, model, retrieval scope, or escalation state.
分類: Metric thresholds plus failure-taxonomy classifier backed by similar runtime episodes.
責任境界: May reroute, degrade, retry, or escalate; may not change authority policy while running.
網羅性: Finds agent runs missing cost, retrieval, gate, and correction telemetry.
初回実装: Normalize every agent run into a runtime episode with cost, retrieval, gate, and correction signals.
G1.U4.P3.Z2.A1
観測
制御: Falls back to text, pauses tool execution, or escalates when voice state becomes unstable.
分類: Deterministic audio-state checks with LLM review for semantic or emotion mismatch.
責任境界: May pause voice execution or switch channels; may not execute irreversible customer actions.
網羅性: Flags voice flows without turn continuity, TTS completion, and fallback telemetry.
初回実装: Score each voice turn for recognition continuity, TTS completion, and unsafe action pressure.
G1.U4.P4.Z1.A1
観測
制御: Returns generated artifacts for repair when evidence, numbers, deadline, or owner is missing.
分類: Structured source comparison first, LLM panel only for semantic support checks.
責任境界: May return artifacts for repair; may not send customer-visible artifacts automatically.
網羅性: Finds generated artifacts without source episode, owner, or review outcome.
初回実装: Review proposal, SOW, estimate, and meeting-minute artifacts against their source episode.
G1.U4.P5.Z1.A1
観測
制御: Switches provider, narrows retrieval, downgrades autonomy, or regenerates queries from live signals.
分類: Scorecard slope and provider error analysis before model-choice LLM reasoning.
責任境界: May switch models within approved tiers; budget or provider-policy changes require approval.
網羅性: Detects model routes without confidence, cost, retry, and provider-failure records.
初回実装: Add dynamic routing decisions to failed RAG and low-confidence answer episodes.
G1.U4.P6.Z1.A1
観測
制御: Creates scoped repair PRs, reruns failed jobs, and quarantines flaky harness paths.
分類: Log-signature classifier, deterministic changed-file mapping, then LLM patch planning.
責任境界: May create scoped repair PRs; may not merge, deploy, or weaken required checks.
網羅性: Finds CI checks, harness jobs, and changed surfaces missing repair coverage.
初回実装: Convert CI failures into repair scope, candidate files, validation commands, and PR body.
G1.U4.P7.Z1.A1
観測
制御: Turns organizational anomalies into owner alerts, follow-up tasks, policy reviews, or repair workflows.
分類: Business-rule anomaly detection with memory lookup for repeated operating patterns.
責任境界: May create tasks and escalation briefs; may not alter contracts, invoices, or staffing authority.
網羅性: Finds business processes without event source, owner, SLA, or escalation route.
初回実装: Connect CRM, contract, invoice, recruiting, and support events into one operating scorecard.
G1.U4.P3.Z3.A1
観測
制御: Blocks write paths when connector schema, auth, or idempotency state is unsafe and creates bounded repair work for the owning integration.
分類: Connector telemetry and contract snapshots are compared first, then ambiguous partial-sync cases are routed to LLM-assisted impact analysis.
責任境界: May pause connector writes, degrade to read-only, or open repair tasks; may not rotate credentials or expand third-party scopes.
網羅性: Flags integrations that lack schema snapshots, retry policy, auth expiry telemetry, or partial-write reconciliation.
初回実装: Attach runtime contract checks to Salesforce, freee, Google Calendar, and storage sync episodes.
G1.U4.P4.Z2.A1
観測
制御: Converts slow or unstable approval paths into owner alerts, queue reshaping proposals, and gate-policy repair tickets.
分類: SLA and queue metrics are inspected deterministically before LLM review summarizes why approvals are delayed or repeatedly reversed.
責任境界: May recommend reviewer reassignment, SLA changes, or gate copy updates; may not bypass approval or approve work on behalf of humans.
網羅性: Finds human gates without explicit SLA, reviewer owner, escalation route, reversal tracking, or stale-approval handling.
初回実装: Score finance, audit, deploy, and outbound customer approval gates for wait time and reversal patterns.
G1.U4.P5.Z2.A1
観測
制御: Stages learning-store writes until source evidence, retention class, contradiction status, and rollback path are attached.
分類: Structured provenance checks and retention rules run before semantic contradiction review decides whether a memory write is safe.
責任境界: May stage or reject memory writes and request reviewer rationale; may not permanently mutate shared memory without source evidence.
網羅性: Detects memory-writing agents without provenance, retention class, reviewer route, rollback key, or contradiction scan.
初回実装: Gate CI repair, workflow repair, and customer-operations memory writes with provenance and contradiction checks.
G1.U4.P6.Z2.A1
観測
制御: Stops rollout and produces a rollback or flag-disable proposal when canary metrics exceed the approved blast-radius envelope.
分類: Deployment metrics, smoke probes, and flag diffs are checked first, with LLM analysis limited to summarizing blast-radius evidence.
責任境界: May disable feature flags, stop rollout, or open rollback PRs; may not promote canaries to full rollout without approval.
網羅性: Finds deployable surfaces without canary probes, flag owner, rollback command, post-deploy observation, or customer-impact tier.
初回実装: Add canary probes and rollback evidence to Auto-Dev repair PRs and Vercel preview promotion.
G1.U4.P7.Z2.A1
観測
制御: Turns backlog, SLA, renewal, and incident-communication gaps into routed owner work with draft evidence packs.
分類: Operational thresholds and account-health rules are evaluated first, then LLM review drafts customer-safe escalation summaries.
責任境界: May create internal tasks and draft customer updates; may not send incident, renewal, or contractual messages without approval.
網羅性: Flags customer operations flows without SLA owner, customer visibility tier, account-risk signal, or approved communication path.
初回実装: Join support tickets, account health, renewal dates, and incident events into one customer-ops harness scorecard.
G1.U4.P1.Z3.A1
観測
制御: Blocks UI changes when route ownership, hydration boundaries, metadata, or user-visible fallback behavior is incomplete.
分類: Static route and component inspection checks client directives, async boundaries, metadata, and empty-state contracts before visual review.
責任境界: May block component changes and propose boundary fixes; may not convert server components to client components without owner approval.
網羅性: Flags new pages, layouts, or interactive components without render contract, loading state, empty state, or ownership evidence.
初回実装: Run render-contract checks on product pages, dashboard panels, and experimental surfaces added in each PR.
G1.U4.P2.Z3.A1
観測
制御: Stops pages from shipping when English and Japanese content, route availability, or mobile layout behavior diverge.
分類: Message-key diffs and viewport constraints are evaluated deterministically before visual checks review overflow or layout regressions.
責任境界: May block release and propose copy or layout fixes; may not change product messaging intent without content owner review.
網羅性: Finds locale-aware pages without message parity, mobile viewport coverage, overflow checks, or translated route validation.
初回実装: Attach locale parity and mobile text-fit checks to blog, product, dashboard, and experimental pages.
G1.U4.P2.Z2.A2
観測
制御: Blocks market-facing visual acceptance when a route scores below the richness threshold and queues a UI-agent repair plan.
分類: Playwright captures first-viewport screenshots and deterministic DOM visual metrics, then emits scoped UI-agent repair tasks for low-scoring routes.
責任境界: May draft visual improvement plans and low-risk UI patches; may not ship brand direction changes or remove governance evidence without review.
網羅性: Finds public routes without enough primary visual asset density, color variety, layered surfaces, hierarchy, or screenshot evidence.
初回実装: Score public routes above the fold and write screenshot-backed repair tasks for any page that feels visually underbuilt.
G1.U4.P4.Z3.A1
観測
制御: Returns UI surfaces for repair when keyboard navigation, focus management, contrast, labels, or visual rendering evidence is missing.
分類: Automated accessibility and screenshot checks run first, with LLM review only for ambiguous visual hierarchy or interaction clarity.
責任境界: May return UI artifacts for repair; may not waive accessibility regressions on production paths without documented approval.
網羅性: Flags interactive screens without keyboard path, contrast check, semantic labels, screenshot evidence, or canvas fallback verification.
初回実装: Add postrun accessibility and screenshot review to dense dashboards, voice UI, and canvas-heavy experimental pages.
G1.U4.P1.Z4.A1
観測
制御: Blocks backend endpoints when request validation, response shape, error behavior, or governance coordinates are missing.
分類: Route-handler AST and schema checks validate methods, input parsing, status codes, and response shape before semantic contract review.
責任境界: May block API route changes and draft schema repairs; may not alter public API semantics without product and backend approval.
網羅性: Finds route handlers without input validation, typed response envelope, error taxonomy, MARIA coordinate, or test coverage.
初回実装: Score new and modified app/api route handlers for validation, typed envelopes, and explicit error outcomes.
G1.U4.P2.Z4.A1
観測
制御: Stops frontend, API, and agent actions before they cross tenant, role, data, or tool authority boundaries.
分類: Deterministic session, tenant, role, and tool-scope policy evaluation runs before any request or agent action mutates state.
責任境界: May deny requests, downgrade to read-only, or request approval; may not grant roles, tenants, or tool permissions.
網羅性: Flags server actions, API routes, and agent tools without session checks, tenant filters, role policy, or permission envelope.
初回実装: Attach auth preflight results to write APIs, customer-data reads, agent tools, and external action routes.
G1.U4.P2.Z5.A1
観測
制御: Stops DB changes when reversibility, tenant policy, data migration, index coverage, or test evidence is incomplete.
分類: Schema diff, migration operation, index coverage, and RLS policy checks run before reviewer-guided data-risk analysis.
責任境界: May block migrations and draft reversible plans; may not apply destructive DB changes or relax RLS without explicit approval.
網羅性: Finds schema changes without rollback, RLS impact, seed update, data backfill, index analysis, or integration-test plan.
初回実装: Evaluate db/schema changes for destructive operations, RLS coverage, rollback path, and dependent API surfaces.
G1.U4.P3.Z4.A1
観測
制御: Detects live adapter drift and switches views to bounded fallback states while routing repair work to the provider owner.
分類: Runtime adapter telemetry and response-shape checks compare mock and live provider contracts before fallback behavior is adjusted.
責任境界: May degrade to mock-safe or read-only mode and open adapter repair tasks; may not silently mix tenant data across providers.
網羅性: Flags data providers without mock-live parity tests, timeout policy, fallback state, tenant filter, or response-shape contract.
初回実装: Monitor dashboard and product data providers for mock-live parity, adapter timeout, and shape mismatch episodes.
G1.U4.P3.Z5.A1
観測
制御: Prevents duplicate or stale scheduled execution and routes missed ticks, queue backlogs, and lock failures to bounded recovery.
分類: Schedule, idempotency, lock, and backlog telemetry are checked first, then historical incident memory ranks likely repair paths.
責任境界: May pause jobs, skip duplicate ticks, or enqueue repair tasks; may not replay side-effecting jobs without approval.
網羅性: Finds cron and background workflows without idempotency key, stale-lock handling, backlog metrics, or replay policy.
初回実装: Add runtime checks to Civilization daily advancement, intelligence scans, and automation harness jobs.
G1.U4.P3.Z6.A1
観測
制御: Blocks answer generation or downgrades confidence when retrieval freshness, source integrity, or citation coverage fails.
分類: Index timestamps, source hashes, retrieval hit rates, and citation coverage are checked before semantic answer support review.
責任境界: May narrow retrieval, mark sources stale, or request reindex; may not publish unsupported answers or delete source corpora.
網羅性: Flags ingestion and RAG paths without source hash, freshness SLA, retrieval metric, citation requirement, or reindex workflow.
初回実装: Attach RAG freshness checks to FAQ, CPA, knowledge graph, and document-scanner answer episodes.
G1.U4.P3.Z7.A1
観測
制御: Stops or rewrites streamed output when partial content violates schema, authority, safety, or customer-visibility rules.
分類: Chunk-level schema, safety, and tool-call guards run during streaming before postrun review evaluates full artifact quality.
責任境界: May stop streams, redact partial chunks, or fall back to safe summary; may not continue unsafe public output after a guard trip.
網羅性: Finds streaming endpoints without chunk guard, abort policy, redaction path, final envelope validation, or audit trace.
初回実装: Add chunk-level guards to audit chat, voice responses, workflow scans, and model-generated report streams.
G1.U4.P5.Z3.A1
観測
制御: Prevents blind autonomous execution by requiring traceable coordinates, redacted logs, owned metrics, and alert coverage.
分類: Trace coverage, coordinate presence, metric completeness, and PII log policy checks run before observability repair planning.
責任境界: May add instrumentation tasks and block blind automation; may not expose sensitive logs or weaken retention policy.
網羅性: Finds routes, jobs, agents, and UI workflows without trace ID, MARIA coordinate, metric owner, redaction, or alert rule.
初回実装: Score new APIs, cron jobs, and agent workflows for trace coverage and coordinate completeness.
G1.U4.P6.Z3.A1
観測
制御: Creates scoped repair plans when user-critical flows fail through selector drift, visual regression, navigation, or data fixture mismatch.
分類: Playwright traces, screenshots, selector changes, and route diffs are classified before repair planning proposes the smallest UI or test fix.
責任境界: May update scoped selectors, fixtures, and low-risk UI defects; may not delete user-critical assertions or weaken journey coverage.
網羅性: Finds product-critical flows without E2E journey, screenshot baseline, responsive coverage, fixture owner, or failure fingerprint.
初回実装: Attach E2E journey repair loops to booking, workflow scanner, audit office, and dashboard critical paths.
G1.U4.P3.Z8.A1
観測
制御: Detects stale, misrouted, or incorrectly cached responses and routes safe cache disablement or middleware repair proposals.
分類: Header, redirect, locale, and cache-control traces are checked deterministically before impact analysis reviews user-visible fallout.
責任境界: May disable caching for affected routes or open middleware repair tasks; may not change global cache policy without approval.
網羅性: Flags middleware and cached routes without cache-key policy, locale redirect tests, stale-content SLA, or header verification.
初回実装: Monitor locale middleware, product pages, blog pages, and API cache headers for redirect and stale-content incidents.
スクロールして構築を開始...
目標 > スコープ > チーム > 責任 > スキル > 構築 > ゲート > 検証 > テスト > デプロイ
スキル(K1-K8)はSkill Storeから動的に取得・自動補充されます