Benchmarks: Deep Learning Nvidia P100 vs. V100 GPU

Artificial intelligence, and in particular deep learning, has become hugely popular in recent years. It has shown outstanding performance in solving a wide variety of tasks from almost all fields of science. The mainstream has primarily focused on applications for computer vision and language processing, but deep learning also shows great potential for a wider […]

Read more

Benchmarks: Intel Xeon Phi (KNL) vs. Broadwell CPU

Xeon Phi KNL Knights Landing

Intel’s Knights Landing processor is the latest generation of the Xeon Phi many-core processor family. It is a host processor, x86 binary compatible, so it can host any box-standard x86 operating system. It comes with 64-68 cores, high-performance stacked memory, and a 512bits wide vector unit – designed for massively parallel workloads. The Xeon Broadwell […]

Read more

Benchmarks: Nvidia P100 vs K80 GPU

Nvidia’s Pascal generation GPUs, in particular the flagship compute-grade GPU P100, is said to be a game-changer for compute-intensive applications. Compared to the Kepler generation flagship Tesla K80, the P100 provides 1.6x more GFLOPs (double precision float). P100’s stacked memory features 3x the memory bandwidth of the K80, an important factor for memory-intensive applications. In […]

Read more

HPC Benchmarks: What is a Fair Baseline?

There are many studies and publications claiming large speedups on High Performance Computing (HPC) hardware. The reality is that pretty much all kinds of speedups can be reported depending on the baseline used… What does this mean in practice? How should one interpret these numbers? And – more importantly – what should be the baseline […]

Read more

top