Skip to main content
  1. The Orchestration Layer/

Executive Summary: How a Request Becomes an Action

·737 words·4 mins

BMT-02.04 Executive Summary
#

BlueMirror.tech | May 2026
#

Margaret Chen says twelve words at 3:14 in the afternoon: “I think my blood pressure medication is making me dizzy.” The system has roughly 500 milliseconds to produce a response that is medically responsible, emotionally appropriate, calibrated to Margaret’s communication preferences, and aware that her daughter Sarah is listed as the primary caregiver contact for health concerns. The clock starts.

The article traces what happens next, step by step.

Step 1 is intent classification. The Speech-to-Intent model, 50 million parameters in Zone 1, converts Margaret’s sentence to a structured intent vector in about 40 milliseconds. The Voice Tone Analyzer runs in parallel and confirms a conversational register rather than distress. Raw audio never leaves Zone 1. The Intent Classifier in Zone 2, a 150-million-parameter mixture-of-experts model, then categorizes: domain healthcare, sub-domain medication side effects, urgency moderate, confidence 0.92. When classifier confidence falls below 0.85, the architecture triggers multi-path processing to hedge the categorization. At 0.92, single-path proceeds. Total time elapsed: 50 milliseconds.

Step 2 is context routing. The MoC Router selects four layers in 22 milliseconds. Layer 0 (core identity) always loads. Layer 1 (session context) loads. Layer 2 (historical patterns, including a metformin timing question from the prior month) loads. Layer 3 (full medication list, six months of blood pressure readings, Dr. Patel as cardiologist, last cardiology visit March 4) loads. Layer 4, retrieval-augmented generation, does not activate. Total context package: approximately 800 tokens, against a naive full-context load of approximately 5,000 tokens. Token reduction: 84%. Cumulative time: 72 milliseconds.

Step 3 is H-layer delegation. The orchestrator receives the classified intent and context package and makes delegation decisions in roughly 45 milliseconds. Primary delegation goes to the Health Concierge and its Medication Manager. Supporting delegations fire in parallel: the Symptom Monitor checks for dizziness patterns in recent vital signs, the Cognitive State Assessor checks whether this is a cognitive concern presenting as a medication one. The H-layer also evaluates the Human Agency Scale: Margaret’s healthcare autonomy is 0.6, meaning observational responses can proceed autonomously but recommendations involving irreversible action require her approval. Cumulative time: 117 milliseconds.

Step 4 is infrastructure agent execution, three parallel paths. The Medication Advisor identifies amlodipine and furosemide as a documented cause of dizziness, confidence 0.88, latency 73 milliseconds. The Symptom Monitor finds that Margaret’s blood pressure has trended from 135/82 to 118/72 over seven days, consistent with the complaint, confidence 0.91, latency 58 milliseconds. The Cognitive State Assessor confirms normal orientation and articulate speech, ruling out a cognitive presentation, confidence 0.94, latency 51 milliseconds. All three complete within the ceiling set by the slowest path. Cumulative time: 190 milliseconds.

Step 5 is response synthesis. The Response Generator, a 400-million-parameter transformer, receives the three structured results plus Margaret’s P-RLHF preference profile: data first, recommendation second, direct language, no medical disclaimers that read as liability protection. The generated response names the medication interaction, cites the blood pressure trend with specific numbers, names Dr. Patel, offers to prepare a summary, and flags positional dizziness as warranting earlier escalation. The Safety Filter validates the response in 19 milliseconds at the Zone 1 boundary. The Empathy Responder confirms the response register matches Margaret’s concerned-but-not-distressed state. Cumulative time: 286 milliseconds.

Step 6 is delivery and learning. The response reaches Margaret in 47 more milliseconds. In parallel, the P-RLHF system logs the interaction for preference learning. The Audit Trail Logger records every component activation, model invocation, context layer loaded, and response generated, cryptographically signed, available for after-the-fact reconstruction. Total elapsed time: 333 milliseconds.

At Phase 1 launch, every step in this trace runs through Zone 3 for every subscriber. The orchestration decomposition is identical. The latency profile is slower because each inference step crosses the network rather than running locally or regionally. As Phase 2 and Phase 3 bring Zone 1 and Zone 2 online for relevant subscribers, those subscribers’ trace shifts toward the target above. For Zone 3-only subscribers, the trace remains Zone 3-anchored throughout, with Zone 3 as the permanent inference substrate under the healthcare data processing agreement.

The article also covers three failure modes: intent misclassification triggering multi-path processing, a stale medication database producing a flagged-uncertainty response rather than a confident wrong answer, and missing vital signs data producing a graceful degradation rather than a fabricated trend.

The full article, including complete failure mode enumeration and cascade rules when multiple steps degrade simultaneously, is at BlueMirror.tech.