Leworldmodel
1 mentions across 1 person
Visit ↗All mentions
“In this work, we introduce LeWorldModel (LeWM), the first JEPA that trains stably end-to-end from raw pixels using only two loss terms: a next-embedding prediction loss and a regularizer enforcing Gaussian-distributed latent embeddings.”
Stabilizing Joint-Embedding Predictive Architectures via Gaussian Latent Regular ↗