Treni

Routing Comparison

Internal vs external routing benchmark results from Plan v2 Track B.

Overview

Plan v2 Track B measures whether in-process routing (monolith runtime) is faster than external routing (controller + remote tool/model endpoints) on the same hardware.

Date: 2026-02-17 (UTC) Hardware: AWS g5.xlarge (NVIDIA A10G)

Headline

MetricInternalExternalExternal/Internal
Mean latency94.849 ms97.927 ms1.032x

Internal routing is faster. Ratio > 1 means internal wins.

Stage Timing

StageValue
Internal route mean23.380 ms
Internal infer mean68.286 ms
Internal TTFT mean53.425 ms
External controller route mean0.003 ms
External tool hop mean2.206 ms
External model hop mean94.859 ms

Per-Task Breakdown

TaskInternalExternal
general_short150.767 ms152.274 ms
receipt_extract80.732 ms81.270 ms
search_grounded46.945 ms57.237 ms
summarize_short100.950 ms100.928 ms

Integrity

  • Errors: top-level 0, warmup 0, internal 0, external 0
  • Warmup ordering bias from earlier run was corrected in this final comparison.

Raw Artifacts

On this page