Skip to main content
Select your country and language

Depending on your country, different offers might be available

Europe

Africa

Middle East

 

Cuda Driver Release News - Exclusive

Our security contacts have confirmed that R570 closes a three-year-old vulnerability in the CUDA driver’s JIT compiler (CVE-2025-0148). The flaw allowed a malicious CUDA binary to escape the driver’s memory sandbox and read host kernel memory.

Consolidates smaller workloads into massive concurrent execution blocks.

Released in September 2025 but continuing to deliver value, CUDA 13.0 Update 1 brought measurable performance improvements:

for Linux) are now standard, ensuring full compatibility with the RTX Pro 6000 Blackwell and GB200/GB300 systems. Decoupled cuBLAS Patches cuda driver release news exclusive

Introduced the "largest update in two decades," featuring NVIDIA CUDA Tile , a tile-based programming model that abstracts specialized hardware like Tensor Cores.

The recent release of (April 2026) and the earlier major launch of CUDA 13.0 (August 2025) represent a transformative shift in GPU computing, specifically tailored for the Blackwell architecture. 0;16;

This is the painful but expected exclusive: Starting with R575 (expected Q3 2026), CUDA 13+ drivers will require compute capability 8.0 (Ampere) or higher for full features, and Turing (7.5) will be moved to a legacy branch. Our security contacts have confirmed that R570 closes

So, what can users expect from the new CUDA driver? Here are just a few of the highlights:

For Kubernetes-managed environments, the driver introduces native container group (cgroup) v2 hardware throttling. Data center administrators can now hard-cap GPU compute shares and memory bandwidth at the driver level, preventing a single misconfigured container from starving adjacent workloads. Developer Roadmap: Migration and Compatibility

Superior support for virtualized GPU (vGPU) environments, allowing multiple virtual machines to utilize a single GPU for parallel tasks. Released in September 2025 but continuing to deliver

This direct peer-to-peer mapping reduces memory allocation overhead by up to 40% in multi-node clusters. It eliminates the standard serialization bottlenecks that frequently stall Large Language Model (LLM) synchronization phases. 2. Stream-Ordered Memory Allocator Refinements

For developers testing CUDA workloads on consumer GPUs, NVIDIA's Game Ready drivers often incorporate the same CUDA user-mode driver improvements as their data center counterparts. Versions (WHQL) launched May 12, 2026, timed to new game releases featuring DLSS and ray tracing. Version 596.21 arrived April 16, 2026, with Game Ready support and fixes for several titles.

Every massive language model training cluster, autonomous vehicle simulation, and quantum-classical hybrid algorithm runs on top of NVIDIA CUDA (Compute Unified Device Architecture). While the hardware—from the historic H100 to the massive Blackwell B200 and Ultra architectures—grabs the mainstream media headlines, the underlying software drivers do the heavy lifting.

For cloud providers, the updated driver means higher tenant density per node and reduced thermal throttling under sustained heavy loads. For AI researchers, it drastically shortens time-to-train metrics, allowing for faster experimentation cycles. Deployment and Compatibility Notes