|
GPU |
B30A (rumored) |
HGX H20 |
H100 |
B200 |
B300 (Ultra) |
|
Packaging |
CoWoS-S |
CoWoS-S |
CoWoS-S |
CoWoS-L |
CoWoS-L |
|
FP4 PFLOPs (per Package) |
7.5 |
– |
– |
10 |
15 |
|
FP8/INT6 PFLOPs (per Package) |
5 |
0.296 |
2 |
4.5 |
10 |
|
INT8 PFLOPs (per Package) |
0.1595 |
0.296 |
2 |
4.5 |
0.319 |
|
BF16 PFLOPs (per Package) |
2.5 |
0.148 |
0.99 |
2.25 |
5 |
|
TF32 PFLOPs (per Package) |
1.25 |
0.074 |
0.495 |
1.12 |
2.5 |
|
FP32 PFLOPs (per Package) |
0.0415 |
0.044 |
0.067 |
1.12 |
0.083 |
|
FP64/FP64 Tensor TFLOPs (per Package) |
0.695 |
0.01 |
34/67 |
40 |
1.39 |
|
Memory |
144 GB HBM3E |
96 GB HBM3E |
80 GB HBM3 |
192 GB HBM3E |
288 GB HBM3E |
|
Memory Bandwidth |
4 TB/s |
4 TB/s |
3.35 TB/s |
8 TB/s |
8 TB/s |
|
HBM Stacks |
4 |
4 |
5 |
8 |
8 |
|
NVLink |
? |
? |
NVLink 4.0, 50 GT/s |
NVLink 5.0, 200 GT/s |
NVLink 5.0, 200 GT/s |
|
GPU TDP |
700W (?) |
400W |
700 W |
1200 W |
1400 W |
Source: Latest from Tom’s Hardware.


Leave a Reply