Assessing IBM's POWER8, Part 1: A Low Level Look at Little Endian

Name: Assessing IBM's POWER8, Part 1: A Low Level Look at Little Endian
Item: Assessing IBM's POWER8, Part 1: A Low Level Look at Little Endian
Author: Johan De Gelas

by Johan De Gelas on July 21, 2016 8:45 AM EST

Posted in
CPUs
IBM
POWER
POWER8

124 Comments | Add A Comment

124 Comments

System Specs

Lastly, let's take a look at some high level specs. It is interesting to note that the IBM POWER8 inside our S812LC server is a 10-core Single Chip Module. In other words it is a single 10-core die, unlike the 10-core chip in our S822L server which was made of two 5-core dies. That should improve performance for applications that use many cores and need to synchronize, as the latency of hopping from one chip to another is tangible.

The SKU inside the S812LC is available to third parties such Supermicro and Tyan. This cheaper SKU runs at "only" 2.92 GHz, but will easily turbo to 3.5 GHz.

Feature	IBM POWER8 (Available in LC servers)	Intel Broadwell (Xeon E5 v4)
Process tech.	22nm SOI	14nm FinFET
Max clock	2.92-3.5 GHz	2.2-3.6 GHz
Max. core count Max. thread count	10@2.92 GHz (3.5 GHz Turbo) 80 SMT	22@2.2 GHz (2.8 GHz turbo) 44 SMT
TDP	190W	145W
L1-I / L1-D Cache	32 KB/64 KB	32 KB/32 KB
L2 Cache	512 KB SRAM per core	256 KB SRAM per core
L3 Cache	8 MB eDRAM per core	2.5 MB SRAM per core
L4 Cache	16 MB eDRAM per MBC (64 MB total)	None
Memory	1 TB per socket - 32 slots (32 GB per DIMM)	0.768 TB per socket - 12 slots (64 GB per DIMM)
Theoretical Memory Bandwidth	76.8 GB/s Read 38.4 GB/s Write	76.8 GB/s Read or Write
PCIe 3.0 Lanes	32 Lanes	40 Lanes

The Xeon and IBM POWER8 have totally different memory subsystems. The IBM POWER8 connects to 4 "Centaur" buffer cache chips, which have each a 19.2 GB/s read and 9.6 GB/s write link to the processor, or 28.8 GB/s in total. This is a more efficient connection than the Xeon which has a simpler half-duplex connection to the RAM: it can either write with 76.8 GB/s to the 4 channels or read from the 4 channels. Considering that reads happen twice as much as writes, the IBM architecture is - in theory - better balanced and has more aggregated bandwidth.

Heavy SMT: Multi Threading Prowess Configuration and Benchmark Selection

PRINT THIS ARTICLE

Post Your Comment
Please log in or sign up to comment.

Comments Locked

124 Comments

View All Comments

JohanAnandtech - Thursday, July 28, 2016 - link
Send me a mail at johan@anandtech.com
abufrejoval - Thursday, August 4, 2016 - link
Hmm, a bit fuzzy after the first paragraph or so and evidently because I dislike malwaretizement: Such links should be banned!
mystic-pokemon - Friday, July 22, 2016 - link
Hi floobit
For virtualization: powerVM and out of the box KVM (tested on Fedora 23, Ubuntu 15.04 / 15.10 / 16.04) work quite well. Xen doesn't work well or hasn't been officially tested / released.
tipoo - Thursday, July 21, 2016 - link
Fun! I was always curious about this processor.
tipoo - Thursday, July 21, 2016 - link
Interesting that the L3 eDRAM not only allows them to pack in much more L3 (what was it, 3 SRAM transistors per eDRAM or something?), but it's also low latency which was a cited concern with eDARM by some people. Appears to be an unfounded fear.

And then on top of that they put another large L4 eDRAM cache on.

Maybe Intel needs to play with eDRAM more...
tipoo - Thursday, July 21, 2016 - link
Lol, eDRAM, not eDARM
Kevin G - Thursday, July 21, 2016 - link
There was a change in how the L4 cache works from Broadwell to SkyLake on the mobile parts. The implication is that Intel was exploring the idea of a large L4 eDRAM for SkyLake-EP/EX parts. We'll see how that turns out as Intel also has explored using HMC as a cache for high bandwidth applications in Knights Landing. So either way, Intel has thus idea on there radar and we'll see how it pans out next year.
tsk2k - Thursday, July 21, 2016 - link
Is it possible to run Windows on one of these?
ZeDestructor - Thursday, July 21, 2016 - link
At the moment, a very solid no.

That said, if enough partners ask for it and/or if the numbers make sense for Azure, MS will at the very least have a damn good look at porting Windows over.
DanNeely - Thursday, July 21, 2016 - link
It's probably just a case of doing QA and releasing it. They've sold a PPC build in the past; and maintain internal builds for a number of other CPU architectures to avoid accidentally baking x86isms into the core code.

Assessing IBM's POWER8, Part 1: A Low Level Look at Little Endian

System Specs

Post Your Comment

124 Comments

View All Comments

JohanAnandtech - Thursday, July 28, 2016 - link

abufrejoval - Thursday, August 4, 2016 - link

mystic-pokemon - Friday, July 22, 2016 - link

tipoo - Thursday, July 21, 2016 - link

tipoo - Thursday, July 21, 2016 - link

tipoo - Thursday, July 21, 2016 - link

Kevin G - Thursday, July 21, 2016 - link

tsk2k - Thursday, July 21, 2016 - link

ZeDestructor - Thursday, July 21, 2016 - link

DanNeely - Thursday, July 21, 2016 - link

Log in

Don't have an account? Sign up now