Agent Workflow Diagram

Visual infrastructure and workflow overview for MUTA, based on the shared frontend and split-model execution path

This page provides a visual sketch of the current MUTA runtime and execution flow.

The diagram below reflects the currently preferred architecture view: users start and inspect runs in the shared frontend, execution happens on the T4 runner, and the split-model path delegates grounding and reasoning to separate hosts.

Surfer H server deployment overview

Split Responsibilities

The runner host keeps the user-facing control plane and the browser execution environment together in one place.

The split-model responsibilities are:

  • Grounding / vision on A40: receives screenshots and visual context, then returns grounded visual information.
  • Thinking / reasoning on L4: produces planning, judgement, and step-level reasoning.
  • Runtime on T4: orchestrates the loop, drives the browser, and persists artifacts.

Notes

  • The shared frontend is the central user entry point for MUTA.
  • The runner on T4 is the execution host for the standard Surfer H path.
  • The A40 host handles visual grounding.
  • The L4 host handles reasoning.