WorldVLA: Towards Autoregressive Action World Model

25 pointsposted 7 months ago
by chrsw

2 Comments

blovescoffee

7 months ago

More details on the actual arch would be nice. It seems like this autoregressive action world model space is a local max until the JEPA work takes over.

gunalx

7 months ago

Dont see how we could not hack on jepa instead of any other input encoders on a transformers llm .