#WorldModel — Search

No JavaScript? That's cool, but you'll need to disable Turbo mode as it uses JavaScript in the client.

World model for video-to-video inference. InSpatio-World: 14B and 1.3B models. Transform video with AI - depth estimation, diffusion, text-to-video capabilities. HuggingFace ready. ⭐ 596 stars #AI #VideoGeneration #WorldModel

Quant Signals @QuantSignalsXYZ · 20h

18 days. The World Model is coming. 5,000 AI agents simulating the social battlefield of trading. April 15, 2026. QS 1-year anniversary. Be ready. #QuantSignals #WorldModel

Laurent Cheylus @lcheylus · 21h

Learning to Drive from a World Model - #Research Paper about self-driving Systems by comma[.].ai Team #AI #WorldModel #AutoPilot arxiv.org/abs/2504.19077

Learning to Drive from a World Model

Most self-driving systems rely on hand-coded perception outputs and engineered driving rules. Learning directly from human driving data with an end-to-end method can allow for a training...

From arxiv.org

Yu-Cheng Chou @johnson111788 · 1d

CVPR 2026🎥We built a model that lets you fly through a generated video world. Not just generating frames — but maintaining a consistent 3D world under complex camera motion. Code, ckpt, and even the data pipeline are all open-sourced ↓ #AI #worldmodel #videogen #cvpr #drone9

222

21.8K

Quant Signals @QuantSignalsXYZ · 2d

5,000 AI agents. One world simulation. Infinite alpha. QS V5 World Model - April 15, 2026 #QuantTrading #AIAlpha #WorldModel #QSV5

58name @58name_com · 2d

Eternalism.ai represents more than just "eternity." It represents: Time + World Model + Future AI Theory #ai #agi #worldmodel #futureai #aiinnovation #aiagents #domainnames #startup #branding #aitechnology @elonmusk @ylecun

274

PlanX @PlanX_DEX · 2d

Deeper understanding leads to better execution PlanX-Execution Beyond Human #AI #LLM #WorldModel #Xgent

Lex @PlanX_Lex · 2d

LLMs vs World Models — what’s the real difference? Most people treat them as the same thing. They are not. They solve fundamentally different problems. 1. LLMs (Large Language Models) Core function: pattern completion in token space (1) Learn statistical relationships between next token given context (3) Operate on language, not reality They are excellent at: (1) Natural language understanding & generation (2) Code synthesis (3) Structured output (JSON, workflows, APIs) (4) Interface between humans and systems LLMs are best understood as: a universal interface layer for cognition 2. World Models Core function: modeling state transitions of a system (1) Learn how environments evolve over time (2) Capture dynamics, causality, and feedback loops (3) Operate on states, not tokens They are used for: (1) Simulation and planning (2) Control systems and robotics (3) Financial market dynamics (4) Risk and environment modeling World Models are: predictive representations of how the world changes 3. Key Difference LLMs answer: → “Given this context, what should be said next?” World Models answer: → “Given this state, what will happen next?” 4. Where each fits Use LLMs when: (1) The problem is semantic (2) You need interpretation, structuring, or communication (3) Output is consumed by humans or systems Use World Models when: (1) The problem is dynamic (2) You need prediction under uncertainty (3) Decisions depend on system evolution 5. In advanced systems, they are not competitors — they are complements. A powerful stack looks like: (1) LLM → understands intent & constructs strategy (2) World Model → simulates outcomes & evaluates risk (3) Execution system → acts on validated decisions Example (Trading Systems) (1) LLM: translates intent → strategy structure (2) World Model: evaluates regime, risk, expected behavior (3) Engine: executes under constraints 6. Take away LLMs model language. World Models model reality. Confusing the two leads to fragile systems. Combining them correctly leads to intelligent ones. #LLM #worldmodel #AI #Xgent

181

Danielle Tichner @danielletichner · 2d

Another take on #worldmodel by @Yoshua_Bengio & others, defined as: non-agentic Al system that is trustworthy and safe by design #ScientistAl arxiv.org/abs/2502.15657

Superintelligent Agents Pose Catastrophic Risks: Can Scientist AI...

The leading AI companies are increasingly focused on building generalist AI agents -- systems that can autonomously plan, act, and pursue goals across almost all tasks that humans can perform....

From arxiv.org

Quant Signals @QuantSignalsXYZ · 2d

The future of trading isn't a signal. It's a simulation. 5000 AI agents modeling the entire market battlefield in real-time. QS V5 World Model. April 15. 🌍⚡ #QuantSignals #WorldModel

Hod Lipson @hodlipson · 2d

We found evidence of an emergent "Self" in a robot learning in a nonstationary environment. There is a lot of talk about robot #WorldModel learning, but the #SelfModel is where the magic really happens. See the paper here: lnkd.in/eycu5GEs

171

9.7K

Simbuilder | McKale Olson @Simbuilder · 2d

ARC-AGI-3 is out now, and models will need #EustressEngine to build their most capable world model. Only Eustress can achieve industrial scale due to the throughput of Rust as the programming language powering the engine. #AI #WorldModel #AGI #Frontier #ARGAGI3

350

YorkJong @YorkJong · 4d

世界模型开始做减法？LeCun团队和清华团队给出两种思路 163.com/dy/article/KOQ… #AI #WorldModel #LeWM #LeCun #JEPA #FastWAM

Repandre.com @repandrecom · 4d

Intéressant @ylecun #worldmodel

Alex Ruben @rubenxela · 5d

Recent PNAS paper from Coimbra & Carnegie Mellon: the brain builds actions from recombinable kinematic synergies, like words from letters Seems our gyrus supramarginal has been doing compositional world models all along Ça va dans le bon sens @ylecun ? pnas.org/doi/10.1073/pn…

Object-directed action representations are componentially built in parietal cortex | PNAS

The inferior parietal lobule supports action representations that are necessary to grasp and use objects in a functionally appropriate manner [S. H...

From pnas.org

Marouene Chaibi @Marouenechaibi · 4d

#AI #WorldModel

alphaXiv @askalphaxiv · 4d

Yann LeCun and his team can't stop cooking "LeWorldModel: Stable End-to-End Joint-Embedding Predictive Architecture from Pixels" One of the biggest bottlenecks of JEPA is they are hard to train, and this new research changes that. They propose LeWorldModel, which shows that a usable world model directly from raw pixels end-to-end. Sitting at 15M parameters, they made it without needing heuristics and avoiding anti-collapse hacks while staying competitive and planning up to 48x faster. Making JEPA based modeling much more accessible, cheaper, and stabler.