DECODER GENERATES ROBOT ACTIONS
From Observation to Action Sequence
⚡ The Action Generation Loop
Encoder processes observation → Creates K, V memory

Decoder asks via Query: "What should I do?"

Cross-attention bridges observation → action!
One observation → 10 coordinated action steps