Nvidia CUDA 13.3 Rolls Out CUDA Python 1.0, CUDA Tile for C++

Bender1 pts0 comments

NVIDIA CUDA 13.3 Rolls Out CUDA Python 1.0, CUDA Tile For C++ - Phoronix

Articles & Reviews

News Archive

Forums

Premium Ad-Free<br>Contact

Popular Categories

Close

Articles & Reviews

News Archive

Forums

Premium

Contact

Categories

Computers Display Drivers Graphics Cards Linux Gaming Memory Motherboards Processors Software Storage Operating Systems Peripherals

NVIDIA CUDA 13.3 Rolls Out CUDA Python 1.0, CUDA Tile For C++

Written by Michael Larabel in NVIDIA on 27 May 2026 at 04:22 PM EDT. Add A Comment

NVIDIA on Tuesday released CUDA 13.3 as another significant advancement for their unified GPU programming stack for NVIDIA hardware.

For those wanting to tap the power of CUDA from the Python programming language, CUDA 13.3 marks the CUDA Python 1.0 milestone as a stable, supported means of being able to leverage CUDA in Python apps for AI, data science, scientific computing, and related uses.

For C++ fans, CUDA 13.3 brings CUDA Tile for C++ in bringing the CUDA Tile programming model to the C++ world.

In addition to these programming enhancements, CUDA 13.3 also introduces the CompileIQ compiler auto-tuning framework that can provide up to 15% speed-ups on kernels like GEMM and attention.

CUDA 13.3 also brings a Numba CUDA MLIR back-end , various math library updates, C++23 support in the NVCC and NVRTC code, mmap() support, and other improvements.

More details on this CUDA 13.3 feature update via the NVIDIA Developer Blog.

Add A Comment

Tweet

NVIDIA 610.43.02 Linux Driver Released With Vulkan Improvements, DRM Color Pipeline API<br>DXVK-NVAPI 0.9.2 Further Improves NVIDIA Integration For Steam Play Linux Gaming<br>NVIDIA-VAAPI-Driver 0.0.17 Fixes Support For GB10 Powered Systems<br>NVIDIA Releases CUDA-Oxide 0.1 For Experimental Rust-To-CUDA Compiler<br>NVIDIA Looking To Create New Tool For Generating AutoFDO Profiles For GCC<br>NVIDIA Ships Fixes For Descriptor Heaps, More Vulkan Performance Optimizations

Michael Larabel is the principal author of Phoronix.com and founded the site in 2004 with a focus on enriching the Linux hardware experience. Michael has written more than 20,000 articles covering the state of Linux hardware support, Linux performance, graphics drivers, and other topics. Michael is also the lead developer of the Phoronix Test Suite, Phoromatic, and OpenBenchmarking.org automated benchmarking software. He can be followed via Twitter, LinkedIn, or contacted via MichaelLarabel.com.

FreeBSD Foundation Executive Director Tries Daily Driving FreeBSD On Laptop<br>Intel Introducing USB4STREAM Protocol For Linux - Opening Up Some Nifty Uses For USB4<br>AV2 Codec Looks Like It Will Be Officially Released Next Week<br>Linux Sound Subsystem Also Seeing Many Fixes Driven By AI/LLMs<br>California's Age Verification Law May End Up Exempting Most Linux Distributions<br>Today's Linux Networking Fixes: "Craziness Continues With No End In Sight"<br>GNOME Commander 2.0 Released Following Rewrite In Rust & GTK4<br>HP Now Sponsoring The Linux Vendor Firmware Service / Fwupd

NVIDIA CUDA 13.3 Rolls Out CUDA Python 1.0, CUDA Tile For C++

Ubuntu 26.10 Planning To Ship With The Linux 7.2 Kernel

VKD3D-Proton Merges Vulkan Descriptor Heap Support

Linux Developers Looking At Retiring The x32 ABI

Linux Driver To Expose Voltage Inputs For Raspberry Pi SBCs

Canonical Releases Workshop As New Way Of Launching Development Environments

ReactOS Now Running On ARM64 In Experimental Form

Google's ANGLE Merges Wayland Support, Unblocking Chromium Embedded Framework On Wayland

AlmaLinux 10.2 Released For Latest Community-Driven RHEL 10.2 Experience

Pavona Aims To Provide A Certification-Ready, Open-Source Silicon Ecosystem

Phoronix Premium allows ad-free access to the site, multi-page articles on a single page, and other features while supporting this site's continued operations.

Cache Aware Scheduling Shows Nice Wins For AMD Zen 5 On PostgreSQL, Valkey, Network Performance

NVIDIA Vera CPU Benchmarks: Olympus Cores Delivering The Best Performance Ever Seen On ARM

Linux Provides Better Performance With The AMD Ryzen 9 9950X3D2 Over Windows 11

NVIDIA RTX PRO Blackwell Performance Delivering Excellent Linux Performance

Initial Benchmarks Of The SpacemiT K3 RVA23 RISC-V CPU With The K3 Pico-ITX

The mission at Phoronix since 2004 has centered around enriching the Linux hardware experience. In addition to supporting our site through advertisements, you can help by subscribing to Phoronix Premium. You can also contribute to Phoronix through tips/donations via PayPal or Stripe.

Contact

Michael Larabel

Support Phoronix

While Having Ad-Free Browsing,

Single-Page Article Viewing

Facebook

Twitter / X

Legal Disclaimer, Privacy Policy, Cookies | Privacy Manager | Contact

Copyright &copy; 2004 - 2026 by Phoronix Media.

All trademarks used are properties of their respective owners. All rights reserved.

cuda linux nvidia phoronix python support

Related Articles