NVIDIA A2 vs NVIDIA RTX PRO 6000 Blackwell Server

Comparison NVIDIA A2 with 16 GB GDDR6 and 1,280 cores vs NVIDIA RTX PRO 6000 Blackwell Server with 96 GB GDDR7 and 24,064 cores.

Loading...

Performance Rating

NVIDIA RTX PRO 6000 Blackwell Server outperforms NVIDIA A2 by 1,351.26% in the overall GPU ARK performance rating

A100 A100
H200 H200
MI325X MI325X

NVIDIA A2

4.0

NVIDIA A2

4.0
RX 7900 XTX RX 7900 XTX
MI250 MI250
Instinct MI300X Instinct MI300X

NVIDIA RTX PRO 6000 Blackwell Server

57.5

NVIDIA RTX PRO 6000 Blackwell Server

57.5

Expert Comparison

NVIDIA A2 более экономичный вариант с меньшим количеством ядер и памяти, но достаточным для базовых задач, таких как визуализация и轻化版本: NVIDIA A2 是一个更经济的选择,适用于基本任务如可视化和图形处理,具有较少的核心数和内存。相比之下,NVIDIA RTX PRO 6000 Blackwell Server 配备更多核心和更大内存,适用于高性能计算和专业图形工作负载,性能远超 A2。

Contents:

Memory ML Performance Compute Power Architecture & Compatibility ML Software Support Clocks & Performance Power Consumption Rendering Benchmarks Additional

Memory

Memory Size

16 GB
🔥 +500% 96 ГБ

Memory Type

GDDR6 GDDR7

Memory Bandwidth

200.1 GB/s
🔥 1.79 TB/s

Memory Bus Width

128 бит 512 бит

ML Performance

FP16 (Half Precision)

4.531 TFLOPS
🔥 +2,681% 126.0 TFLOPS

BF16 (Brain Float)

No No

TF32 (TensorFloat)

No No

Compute Power

FP32 (Single Precision)

4.531 TFLOPS
🔥 +2,681% 126.0 TFLOPS

FP64 (Double Precision)

0.0708 TFLOPS
🔥 +2,680% 1.968 TFLOPS

CUDA Cores

1,280
🔥 +1,780% 24,064

RT Cores

10
🔥 +1,780% 188

Architecture & Compatibility

GPU Architecture

Ampere Blackwell 2.0

SM (Streaming Multiprocessor)

10
🔥 +1,780% 188

PCIe Version

PCIe 4.0 x8 PCIe 5.0 x16

ML Software Support

CUDA Version

8.6
🔥 12.0

CUDA Toolkit (first supported)

v11 v12

CUDA Toolkit status

Supported Supported

Clocks & Performance

Base Clock

1,440
🔥 +10% 1,590

Boost Clock

1,770
🔥 +48% 2,617

Memory Clock

1,563
🔥 +12% 1,750

Power Consumption

Recommended PSU

🔥 -75% 250 W
1000 W

Power Connector

None 1x 16-pin

TDP/TGP

🔥 -90% 60 W
600 W

Rendering

Texture Units (TMU)

40
🔥 +1,780% 752

ROP

10
🔥 +1,780% 188

L2 Cache

🔥 2 MB
128 MB

Benchmarks

MLPerf, llama2-70b-99.9 (fp4)

3 250 tokens/s

MLPerf, llama3.1-8b (fp4)

5 758 tokens/s

Geekbench AI, FP16

53 322 points

Geekbench AI, INT8

28 264 points

Geekbench AI, FP32

37 299 points

MLPerf, mixtral-8x7b (fp8)

3 767 tokens/s

Additional

Slots

Single-slot Dual-slot

Release Date

Nov. 10, 2021 March 18, 2025

Display Outputs

No outputs
4x DisplayPort 2.1b

Renting is cheaper than buying