The 7 Fusion Operations
1. Navigate + Observe
Fuse navigation with observation in one server call instead of two.page.goto() then runs handleAgentObserve in-process, returning both the page state and interactive elements in a single response.
2. Act + Post-Observe
Get an observation after executing actions, without an extra roundtrip.postObserve: true in the act tool parameters.
3. Act + Pre-Observe
Capture page state before the action executes. Useful for diffing before/after states. In MCP, passpreObserve: true in the act tool parameters. The server calls handleAgentObserve before the step loop executes.
Pre-observe runs BEFORE steps execute (fixed in v0.6.0). Earlier versions had a bug where
pre-observe ran after all steps completed.
4. Incremental Observe
Only return changes since the last observation instead of the full element list.incremental: true to the observe tool.
5. Response Tiers
Control how much data the observe response includes:| Tier | What is included | Use case |
|---|---|---|
full | Interactive elements + accessibility tree + screenshot | First observation of a page |
interactive | Interactive elements only (skips tree and screenshot) | Mid-workflow checks |
minimal | Elements stripped to {ref, role, name, selector, tagName} | Tight token budgets |
6. Selector Caching
EachElementRegistryEntry stores a cachedCssSelector. When resolving a selector, the server tries the cached CSS path for a direct lookup before falling back to semantic resolution. This eliminates DOM traversal for repeat interactions.
Selector caching is automatic. No flags needed — it kicks in after the first successful
resolution of any element.
7. Speculative Prefetch
AfterhandleAgentAct, if the last step was a click or navigation, the server fires off an observe call 200ms later. The result is cached in ClientState.speculativeObservation (URL-matched, 5-second TTL). If the agent requests an observation within that window, the cached result is returned instantly.
Fusion in the TypeScript SDK
Fusion in MCP Tools
Impact
Fusion operations can significantly reduce the number of server calls in multi-step workflows:| Scenario | Without fusion | With fusion | Reduction |
|---|---|---|---|
| Navigate + observe | 2 calls | 1 call | 50% |
| Login form (fill + fill + click + observe) | 4 calls | 1 call | 75% |
| Browse 5 pages with observation | 10 calls | 5 calls | 50% |