hackernews client

Micro-Agent: Beat Frontier Models with Collaboration Inside Model API

37 pointsposted 4 hours ago

11 Comments

kristjansson

an hour ago

> The phrase "frontier model" is starting to mean two things. One is a checkpoint. The other is a system boundary.

LLM-isms aside, I don't think we want this to be the case? An LLM, for all its complexity, is something that can be reasoned about. It's picking the next token, until it hits an EOS. The semantics imposed on those tokens (reasoning ,tool call, etc.) are up to the user('s harness) to decide and act on. The more that's pushed behind the facade, the harder it is achieve sufficient understanding of the model's behavior s.t. one can compose it into larger abstractions. Perhaps the performance (and the adherence to an interface/contract) compensate? But swapping from Opus or 5.5 to this or Fugu seems like a much bigger change than swapping between different 'base' models.

Xx_crazy420_xX

an hour ago

I might be wrong, but strongly suspect that Fable 5 is already something in this shape, considering long time to first token while having normal troughput.

meander_water

17 minutes ago

I thought all model providers are doing this under the hood anyway in their UI?

They certainly seem to when A/B testing different models, and Fable routes to Opus 4.8 when guardrails fail.

Also, openrouter recently released a fusion router - https://openrouter.ai/blog/announcements/fusion-beats-fronti...

getcrunk

35 minutes ago

Every one has been saying it’s all about the harness. This is an obvious result of that.

I think an optimal solution would be to have more seamless integration between harness and router roles. As each are only half the picture

Micro-Agent: Beat Frontier Models with Collaboration Inside Model API

11 Comments

kristjansson

Xx_crazy420_xX

meander_water

getcrunk

jerpint

droidjj

tensegrist

jghn

folkrav

Escapade5160

alchemist1e9