Modular 26.3: Mojo 1.0 Beta, Max Video Gen, and More

tosh1 pts0 comments

Modular: Modular 26.3: Mojo 1.0 Beta, MAX Video Gen, and more

Hippocratic AI + Modular to power real-time patient conversations. Read More →

May 7, 2026

Modular 26.3: Mojo 1.0 Beta, MAX Video Gen, and more<br>Modular Team

Product

Surprise: Mojo 1.0 is officially in beta! Modular’s 26.3 release includes new features and modalities, but the headline is that we’ve officially hit beta for Mojo 1.0, with a clear plan to finalize Mojo 1.0 in the coming months. We share details below, alongside other key announcements in our 26.3 release including video generation in MAX with Wan 2.2 and MAX framework updates.<br>Mojo 1.0: now in beta! 🔥🔥🔥<br>Mojo is the foundation for everything we do at Modular, from pushing the state-of-the-art in kernel performance, to running on new and novel accelerator hardware. Back in December, we provided a roadmap to 1.0 for the Mojo language, and we’re now excited to announce that a beta for Mojo 1.0 is available today!<br>Mojo 1.0 will be finalized later this year, along with opening the compiler and providing language stability. This marks the start of a new era for the language. You can now build your projects against known versions of Mojo and they won’t break on you tomorrow. The beta provides what we believe is a “feature complete” Mojo 1.0 language, but there’s a lot to be polished before final release.<br>The 1.0 beta ships several features we've been working toward for a long time. Those include:<br>Safe closures with a new capturing syntax.<br>Conditional conformance to traits.<br>Major improvements to variadics.<br>We’re also introducing the successor to LayoutTensor, called TileTensor, that makes it even easier to write high-performance kernels. TileTensor makes memory layout a compile-time property of the tensor itself, so the swizzles, strides, and indexing that GPU kernels require are checked by the type system rather than maintained by hand. We’ve started a dedicated blog post series on this neat new type, and it underpins the new paradigm of structured kernels we write about in our ongoing series.<br>But that’s not all! We felt that with how far Mojo has come, it was time to give it a proper new home.<br>Introducing mojolang.org<br>That’s right, Mojo now has its own website at mojolang.org!<br>Along with the Mojo 1.0 beta release, launching this website represents a significant milestone; Mojo is nearly ready for widespread adoption. Mojolang.org is an important step towards opening up Mojo to the world with a full 1.0 release, which we aim to complete by the fall.<br>Whether you’re new to Mojo or an experienced contributor, it’s now more clear than ever where to get everything you need.<br>With all the Mojo documentation on its own site, docs.modular.com is now focused on what you need to build and serve models with MAX. You only need Mojo with MAX if you’re extending or building custom kernels, which is why the MAX AI kernels library is still at docs.modular.com. For a full look at all the new Mojo updates that accompany the beta, see the mojolang.org changelog.<br>Video Generation in MAX<br>It’s not just Mojo that has major updates. We’re adding a new modality to the unified Modular platform: video generation.<br>We started with text, expanded to audio, added image generation / editing, and with the addition of video generation, if your application needs to move from a static image to a living scene, you no longer need to step outside the Modular Platform to do it.<br>Today's release brings support for Wan 2.2, one of the leading open video generation models, with many more coming very soon, alongside multiple improvements. Video generation with MAX is available today and will be coming soon to Modular Cloud. If you're building video into your workflow and want to discuss what this means for your infrastructure, contact us.<br>MAX Framework<br>Unified, distributed-aware tensor<br>Real-world models now routinely span multiple GPUs. In 26.3 we've expanded multi-GPU support in max.experimental: a distributed-aware Tensor type, multi-device compilation, and the collective ops you need for tensor-parallel code.<br>PyTorch's DTensor established that placement metadata on a tensor is the right abstraction for reasoning about distribution. JAX's jax.Array showed how readable a single tensor type plus named mesh axes can be. MAX borrows from both and adds one thing neither has: the same .to(...) call accepts both a NamedMapping (JAX-style "this tensor axis maps to that mesh axis") and a PlacementMapping (DTensor-style Replicated / Sharded / Partial). You pick whichever fits the problem; both lower to the same representation.<br>The practical result: Tensor is the same type whether it lives on one device or is sharded across a mesh. Sharding is metadata, not a separate code path.<br>Additional highlights

MAX’s fast eager interpreter reaches 100% operator coverage for eager mode: until recently, graphs in MAX had to go through the full compiler before they could run, we’re fixing that with the MO graph interpreter in max.experimental a...

mojo modular beta video tensor generation

Related Articles