NVIDIA Launches 96GB Version RTX 5090

At the 2025 GTC conference, Nvidia officially unveiled the RTX Pro 6000 GPU, based on the Blackwell architecture, targeting professional workstations and server markets. This GPU utilizes the same GB202 chip as the consumer-grade RTX 5090 but delivers significant upgrades in performance and specifications, making it a device tailored for professional users such as designers, data scientists, and AI developers. The RTX Pro 6000 comes in three variants: a workstation edition, a Max-Q workstation edition, and a server edition, designed to meet high-performance demands across various scenarios.

The core specifications of the RTX Pro 6000 are impressive. It features 188 streaming multiprocessors (SMs), with only 2% disabled compared to the GB202 chip’s maximum of 192 SMs. Compared to the RTX 5090’s 170 SMs, this represents a 10.6% increase, delivering a powerful lineup of 24,064 CUDA cores, 752 Tensor cores, and 188 ray-tracing cores. Nvidia claims its single-precision floating-point performance (FP32) reaches up to 125 TFLOPS, while its AI computing power hits an astonishing 4,000 TOPS (based on FP4 precision), suggesting a boost clock of approximately 2.6GHz. Additionally, the card comes equipped with a full 128MB L2 cache—outperforming the RTX 5090’s 96MB—along with four NVENC encoders and four NVDEC decoders, enhancing video processing capabilities.

Memory configuration is a standout feature of the RTX Pro 6000. It employs 24Gb (3GB) GDDR7 memory chips, unlike the 2GB chips commonly found in the RTX 50 series, achieving a massive 96GB memory capacity via a 512-bit memory bus. With a memory speed of 28Gbps, it delivers a total bandwidth of 1,792GB/s. This design is particularly suited for handling large datasets and complex AI model training tasks. In contrast, the RTX 5090 offers only 32GB of memory, with the same bandwidth but a clear gap in capacity.

The three variants differ in appearance and power consumption. The Blackwell workstation edition resembles the RTX 5090 in design, featuring a glossy black finish and four DisplayPort 2.1b outputs, with a total graphics power (TGP) of 600W—slightly higher than the RTX 5090’s 575W. The Max-Q workstation edition prioritizes efficiency and low power consumption, with a TGP of 300W, equipped with dual blower fans, and retains a full-height, full-length dual-slot design with four DP 2.1b ports. The server edition shares a similar form factor to the Max-Q variant but opts for a fanless passive cooling design, relying on server chassis airflow for cooling, with power configurable up to a maximum of 600W. All models use a 16-pin 12V-2x6 power connector, with the server and workstation editions benefiting from higher manufacturing standards for enhanced connector reliability.

Nvidia also introduced other models in the RTX Pro series, including the RTX Pro 5000 (based on GB202, with 14,080 CUDA cores, 48GB memory, and 300W TGP), the RTX Pro 4500 (based on GB203, with 10,496 CUDA cores, 32GB memory, and 200W TGP), and the RTX Pro 4000 (8,960 CUDA cores, 24GB memory, and 140W TGP). These models cater to desktop workstations and mobile devices, covering professional needs from high-end to entry-level. All models support PCIe 5.0 x16 interfaces, offering double the bandwidth of the previous generation, and are compatible with DisplayPort 2.1b, supporting display outputs up to 4K at 480Hz or 8K at 165Hz.

The RTX Pro 6000 series also boasts unique features. For instance, the workstation and server editions support Multi-Instance GPU (MIG) technology, allowing a single GPU to be split into up to four independent instances (each with 24GB of memory), enhancing multitasking and parallel processing capabilities. Additionally, the ninth-generation NVENC encoder adds 4:2:2 encoding support, while the sixth-generation NVDEC decoder doubles H.264 decoding throughput, making it ideal for video editing and real-time streaming applications.

In terms of performance, the RTX Pro 6000 outperforms the previous Ada Lovelace-based L40S GPU. In AI inference tasks, its large language model throughput increases by up to 5x, genomic sequencing speeds improve by nearly 7x, text-to-video generation accelerates by 3.3x, and recommendation system inference and rendering performance each see roughly 2x gains. These figures highlight the RTX Pro 6000’s clear advantages in AI, scientific computing, and visual creation fields.

The RTX Pro 6000 workstation edition is expected to hit the market through distributors in April, with manufacturer availability starting in May. The server edition will be offered by vendors such as Cisco, Dell, HP, Lenovo, and Supermicro, while cloud providers like AWS, Google Cloud, and Microsoft Azure will roll out related instances later this year. The remaining RTX Pro models are slated for release in the summer (desktop versions) and later in the year (laptop versions).

Pricing has not yet been disclosed, but based on historical pricing for professional-grade GPUs, the RTX Pro 6000 is likely to exceed $10,000—far above the RTX 5090’s approximate $3,000 retail price. This premium reflects its professional market positioning and top-tier hardware specifications.

The release of the RTX Pro 6000 Blackwell series marks yet another leap forward for Nvidia in the professional computing space. With its massive 96GB memory, formidable AI computing power, and versatile variant designs, it promises to deliver a groundbreaking experience for tech enthusiasts and professionals requiring extreme performance. More details and real-world test data are eagerly anticipated in the coming days.

NVIDIA Is to Launch 5060Ti the Next Generation Mainstream GPU Soon

24 March 2025·691 words·4 mins

5060Ti NVIDIA

NVIDIA RTX 5070 Ti Also Faces ROP Deficiency

24 February 2025·761 words·4 mins

NVIDIA RTX 5070 Ti

Nvidia B300或提前上市

31 December 2024·24 words·1 min

NVIDIA B300

Related