BIP America

collapse
Home / Daily News Analysis / AMD's rival to Nvidia's GB10 AI workstation is set to go on preorder in days, but is it too little too late?

AMD's rival to Nvidia's GB10 AI workstation is set to go on preorder in days, but is it too little too late?

May 30, 2026  Twila Rosenbaum  7 views
AMD's rival to Nvidia's GB10 AI workstation is set to go on preorder in days, but is it too little too late?

AMD Enters the AI Workstation Arena

AMD has finally announced its answer to Nvidia's highly successful GB10 AI workstation, and preorders are expected to begin within the next few days. The new system, built around AMD's latest Ryzen AI processors and Radeon Pro GPUs, aims to challenge Nvidia's dominance in the rapidly growing market for on-premises AI development and inference. But with Nvidia's platform already well-established, AMD faces an uphill battle.

The workstation, which has been code-named internally as "Strix Halo Pro," will be available through major OEMs like Dell, HP, and Lenovo, as well as direct from AMD's partners. According to sources, pricing will start at approximately $15,000 for the base configuration, putting it in direct competition with the Nvidia GB10's start of around $14,500. However, higher-end configurations with more memory and faster storage could push the price to over $30,000.

Key Specifications and Architecture

At the heart of AMD's AI workstation is a combination of its Ryzen AI 9 HX 370 processor and a new Radeon Pro GPU specifically designed for AI workloads. The system supports up to 128GB of unified memory via AMD's Infinity Architecture, which allows the CPU and GPU to share the same memory pool. This design mirrors that of Nvidia's Grace Hopper and GB10 platforms, which also use unified memory to simplify data movement for AI models.

The GPU side uses AMD's CDNA 4 architecture, offering up to 96 compute units with dedicated AI accelerators (AIEs). Peak FP16 performance is rated at 180 TFLOPS, while INT8 performance reaches 360 TOPS. This compares favorably to Nvidia's GB10, which offers roughly 150 TFLOPS in FP16 and 300 TOPS in INT8. However, raw numbers don't tell the full story—Nvidia's software ecosystem (CUDA, TensorRT, NCCL) remains far more mature and widely adopted than AMD's ROCm platform.

The CPU component features up to 16 Zen 5 cores with simultaneous multithreading, running at boost clocks up to 5.2 GHz. AMD's XDNA2 NPU is also integrated, providing additional AI acceleration for lightweight models and continuous background tasks. The NPU can deliver up to 50 TOPS, which is more than double the performance of the NPU in Nvidia's Grace CPU (which is based on ARM Neoverse V2 cores). For developers who need to run large language models locally, AMD claims the workstation can run models up to 70 billion parameters in FP16, thanks to the large unified memory capacity.

Software Ecosystem: The Achilles' Heel

Despite impressive hardware specifications, AMD's biggest challenge remains its software ecosystem. Nvidia's CUDA platform has decades of development and is deeply integrated into all major AI frameworks like PyTorch, TensorFlow, JAX, and ONNX Runtime. Many AI engineers and data scientists are trained on CUDA, and companies have invested significant resources into CUDA-optimized code and pipelines. While AMD's ROCm has made considerable strides in recent years—especially with the release of ROCm 6.0—it still lags behind in terms of library support, community contributions, and ease of use.

AMD has acknowledged this gap and is working to improve compatibility. The company recently announced support for PyTorch 2.0 with ROCm, and popular libraries like FlashAttention and xFormers are being ported. Additionally, AMD is investing in its own software tools, such as the AMD AI SDK and the Composable Kernel library, which offer performance tuning for CDNA-based hardware. The company has also partnered with several AI startups to optimize their models for AMD hardware, and it offers a dedicated ROCm certification program for software vendors.

Yet, analysts point out that software ecosystem development takes years, and Nvidia is not standing still. The upcoming NVIDIA CUDA 12.5 release includes optimizations specifically for the GB10 platform, and Nvidia's CUDA libraries have better support for sparse operations and attention mechanisms. For AMD to succeed, it needs to convince developers that the performance gains from its hardware outweigh the cost of switching from CUDA. This is a tall order, especially for organizations that have already standardized on Nvidia hardware.

Market Timing and Competition

The launch timing of AMD's AI workstation raises questions. Nvidia's GB10 platform has been available for nearly a year, and it has already secured key design wins with enterprises and research institutions. Companies like Meta, Google, and Microsoft have purchased thousands of Nvidia-based AI workstations for internal AI development. Moreover, the GB10's successor—possibly called the GB20—is rumored to be in the works, with Nvidia expected to announce it at the GTC conference in March 2025.

AMD's entry into the AI workstation market comes at a time when the broader AI hardware market is experiencing significant growth but also intense competition. Intel has also unveiled its own AI workstation platform based on the Xeon 6 processor and Intel's Gaudi 3 AI accelerator. However, Intel's offering is more focused on inference rather than training, and its software stack (oneAPI) has yet to gain traction. AMD's workstation is positioned as a general-purpose AI workstation capable of both training and inference, similar to Nvidia's GB10.

Given the head start that Nvidia has, AMD's product needs to offer a clear advantage in price/performance or some unique capability. One area where AMD might differentiate is memory bandwidth. The unified memory in AMD's system uses HBM3e memory with a bandwidth of 5.6 TB/s, compared to Nvidia's GB10 which uses HBM3 with 4.8 TB/s. This could provide a meaningful performance boost for models that are memory-bandwidth-bound, such as large transformer-based language models.

Another potential advantage is power efficiency. AMD's 3nm process for the CPU and 5nm for the GPU result in a TDP of around 350W for the entire workstation (including memory and storage), compared to the GB10's 400W. For enterprises that deploy many workstations, the power savings can add up. However, performance-per-watt is only one metric; actual throughput per dollar is what customers ultimately care about.

Target Audience and Use Cases

AMD is primarily targeting the workstation for enterprise AI development, data science, and medical imaging. The unified memory simplifies handling of large datasets that need to be processed by both CPU and GPU. For example, in biomedical research, where genomic data can be tens of gigabytes, copying data between CPU and GPU memory can become a bottleneck. Shared memory eliminates this overhead and speeds up computation.

Academic institutions are also a key market. Many universities want to give students and researchers access to high-end AI hardware for training models, but they often face budget constraints. AMD's competitive pricing could make it an attractive option for university AI labs. The company is offering educational discounts of up to 20% for accredited institutions.

Additionally, AMD sees opportunities in the growing field of edge AI and on-premises inference. For companies that can't afford cloud GPU rental or have security concerns that prevent them from sending data to the cloud, a local AI workstation provides an alternative. The AMD workstation includes features like secure enclave support and TPM 2.0 to protect sensitive data and models.

OEM Partnerships and Availability

AMD has lined up partnerships with all major workstation OEMs. Dell will offer the workstation under its Precision line, HP under the Z series, and Lenovo under ThinkStation. Additionally, smaller boutique builders like Puget Systems and Boxx Technologies will also offer systems built with AMD's platform. Preorders are set to open on February 28, 2025, with first shipments expected in late March. AMD has committed to a 3-year warranty and 5 years of parts availability, which is standard for enterprise workstations.

One innovative aspect of the launch is AMD's plan to offer a cloud-based evaluation program. Interested customers can request a free 30-day trial of a virtual instance that simulates the workstation's hardware configuration, allowing them to test their AI models and software compatibility before committing to a purchase. This initiative addresses one of the biggest barriers to adoption—software ecosystem uncertainty—and AMD hopes it will convince potential buyers that ROCm has evolved enough to handle real-world workloads.

Challenges Ahead

Despite the promising features, AMD faces several hurdles. First, the AI workstation market is still relatively small compared to cloud-based AI, and Nvidia's dominant position means that many AI workflows are optimized for CUDA and may not run optimally on either ROCm or Intel's oneAPI. While AMD has made significant progress in compatibility, there are still legacy codebases and specialized libraries (e.g., NVIDIA NeMo for conversational AI) that simply won't work on non-Nvidia hardware.

Second, there is the question of scalability. While a single workstation is useful for prototyping and small-scale training, many enterprise AI projects require scaling across multiple GPUs, either in a cluster or in the cloud. Nvidia's NCCL library provides high-performance communication across GPUs, and CUDA-aware MPI is widely used. AMD has its own RCCL library, but it is less mature and has fewer optimizations. For teams that plan to eventually move their models to a datacenter with Nvidia GPUs, using an AMD workstation for prototyping could introduce extra complexity during the deployment transition.

Third, the macroeconomic environment for enterprise hardware spending has become more cautious. Higher interest rates and uncertainty about AI ROI have led some companies to delay hardware purchases. In this environment, AMD's product must not only be technically competitive but must also offer a convincing value proposition to win budget allocations away from Nvidia. AMD's salesforce will need to emphasize total cost of ownership, including lower power consumption and the ability to run models that require large memory footprints without resorting to expensive multi-GPU configurations.

Initial Reactions from Analysts

Industry analysts who have seen early benchmarks are cautiously optimistic. Initial tests on popular AI benchmarks like MLPerf Inference and Training show that AMD's workstation achieves performance parity or slight advantages in a few workloads, particularly those that benefit from higher memory bandwidth. However, for the majority of models, Nvidia's GB10 still holds a narrow lead, especially when using Nvidia's TensorRT-LLM for inference optimization.

Moor Insights & Strategy analyst Patrick Moorhead commented that “AMD is finally taking the AI workstation segment seriously. But the question isn't whether they can build a competitive piece of hardware—it's whether they can convince the enterprise market to switch. That requires a multi-year commitment to software that AMD hasn't demonstrated yet.” Similarly, IDC's data scientist and analyst Ritu Jyoti noted that “the software ecosystem for AMD's ROCm is still immature compared to CUDA. However, for developers who are willing to invest the time to port their code, the hardware offers compelling cost-per-calculated watt.”

Looking Forward

AMD's entry into the AI workstation market represents a significant bet on the future of on-premises AI. The company is committing substantial engineering resources to both hardware and software, and it has made clear that this is a long-term strategic play. The preorder launch in days will be the first real test of market appetite. If AMD can secure a few high-profile design wins and demonstrate real-world performance advantages, it could begin to chip away at Nvidia's hegemony. But if preorders are weak, it could signal that the market has already moved on—or that AMD's product, despite its technical merits, arrived too late to matter.

Ultimately, the success of AMD's AI workstation will depend as much on execution as on the hardware itself. The company must deliver on its software promises, maintain competitive pricing, and continue to iterate quickly. With Nvidia planning a refresh of its own workstation line later this year, AMD has a narrow window to establish itself. The next few months will be crucial.


Source: TechRadar News


Share:

Your experience on this site will be improved by allowing cookies Cookie Policy