Agent Workflow Diagram
Visual infrastructure and workflow overview for MUTA, based on the shared frontend and split-model execution path
This page provides a visual sketch of the current MUTA runtime and execution flow.
The diagram below reflects the currently preferred architecture view: users start and inspect runs in the shared frontend, execution happens on the T4 runner, and the split-model path delegates grounding and reasoning to separate hosts.

Split Responsibilities
The runner host keeps the user-facing control plane and the browser execution environment together in one place.
The split-model responsibilities are:
- Grounding / vision on A40: receives screenshots and visual context, then returns grounded visual information.
- Thinking / reasoning on L4: produces planning, judgement, and step-level reasoning.
- Runtime on T4: orchestrates the loop, drives the browser, and persists artifacts.
Notes
- The shared frontend is the central user entry point for MUTA.
- The runner on T4 is the execution host for the standard Surfer H path.
- The A40 host handles visual grounding.
- The L4 host handles reasoning.