Why CCX?

Question

Why CCX?

Samuel Johnson

Since the CCX'es are on the same die whats even the point of them? Seems like its just unecessarily introducing complications and added latency. A similar design to Intel where the cores communicate through the shared L3 cache would have been much better, so would even Ringbus for up to 8 cores.

I get the chiplets since they are different dies and increase yields, but the CCX design seems completely unnecessary.

Attached: aHR0cDovL21lZGlhLmJlc3RvZm1pY3JvLmNvbS84L1QvODQ2NjA1L29yaWdpbmFsL0FNRC03LmpwZw.jpg (1500x1041, 685K)

July 28, 2019 - 13:09

Other urls found in this thread:

en.wikipedia.org/wiki/Complete_graph
twitter.com/SFWRedditVideos

Easton King

Good yields. Monolithic garbage is dead.

July 28, 2019 - 13:12

Cameron James

I addressed this in the post, its on the same die so it does nothing for yields

July 28, 2019 - 13:17

Daniel Bailey

Attached: Satellite-image-of-residential-area-with-apartment-buildings-Al-Khobar-city-Eastern.png (850x466, 326K)

July 28, 2019 - 13:30

Christian Lee

Because the CCX layout is a known entity from Zen and Zen+

When moving to 7nm, you don't want to drastically alter EVERYTHING especially when you don't know how that will effect the final silicon.

July 28, 2019 - 14:00

Austin Bennett

Because the ringbus is only substantially lower latency than the inter CCX lines for the nearest cores. It's a different solution to the same problem of inter core data path bloat. They did choose to attack the problem more aggressively than Intel to push density higher. IMO it's a good tradeoff because the extra area can be dumped into more L3 instead but I'm not an engineer so who knows.

July 28, 2019 - 14:11

Parker Mitchell

I'll add some more. The lowest latency highest bandwidth solution would be to have every core directly connected to every other core. That would be a massive waste of die area at high core counts so some compromise is needed. Trade inter core data paths for die area to use on other things. The ringbus is one compromise, the skylake-x mesh is another, the infinity fabric is another.

July 28, 2019 - 14:20

Charles Gomez

>every core directly connected to every other core
Is that even possible past 2 cores?

July 28, 2019 - 14:50

Mason Jackson

I coudnt find a number for the ringbus latency but up to 8 cores i have a hard time believing its anywhere near cross CCX latency, which is afaik 120 ms for Ryzen 1 and i dont think it has changed for Ryzen 3. For comparison the latency within a CCX is 40ms, i cant imagine Ring being higher than like 60 at the highest point at or below 8 cores.

July 28, 2019 - 15:10

Leo Rivera

You can keep increasing the core counts without having to worry about yields

July 28, 2019 - 15:22

Lincoln Moore

user, I...
en.wikipedia.org/wiki/Complete_graph

July 28, 2019 - 15:39

Ethan King

Because it's meant to be something you buy if you can't afford Intel.

July 28, 2019 - 15:40

Noah Foster

Imagine how poor Google Amazon and Microsoft must be if they're switching to Epyc.

July 28, 2019 - 15:42

Ethan Lopez

Zen has lower latency core to core inside of the CCX than intel does with their ring bus or ring mesh. A pretty damn big benefit to the arch. They're pretty tightly integrated, however signal routing is the most complex part of IC design. The dies are broken up into these groups for the sake of simplicity.

Attached: 1562816788027.png (822x777, 539K)

July 28, 2019 - 15:46

Kayden Sanders

Hello, don't listen to all of these faggots OP, im here to give you the real informed answer.

Firstly the anons who said it's because of segmentation arent wrong but that doesn't answer your question why they still do it for zen 2. Pic related is a zen 2 chiplet and the dual ccx's are in the same configuration as zen 1 just beside one another instead of being placed across.

Simply put, they use the ccx design with zen 2 because its cheaper to keep the cores mostly the same, than it is to spend tons of money developing a new core from scratch. There is absolutely no real reason to use the ccx design over ringbus other than its cheap and already designed and implemented. And they already have full if scalability implemented so a lot of work is already done for them.

At computex lisa su said they were going after low hanging fruit and latency with the zen design as it goes on. Im betting as time moves forward the ccx is going to see some massive changes for the sake of IO and reducing latency, and will effectively cease existing in their current form

Attached: IMG_9252.jpg (145x145, 16K)

July 28, 2019 - 15:52

Kayden Reyes

Its on the same die though

July 28, 2019 - 16:08

Kayden Richardson

So? Just make low binned chips.

July 28, 2019 - 16:20

Caleb Stewart

Inter-CCX latencies are lower than Intel

July 28, 2019 - 16:24

Easton King

Why do you think mainstream Ryzens are 6 core instead of 8?

July 28, 2019 - 16:24

William Sanders

Imagine having to make shit up because you've got nothing else going on

July 28, 2019 - 16:25

Lincoln Lee

Denial is the first step

July 28, 2019 - 16:28

Eli Lee

Scaling cores inside a CCX (Every core is directly connected to every other) becomes increasingly difficult past 4, because adding one core means you have to fit it and route new buses for it to every core. That's why Intel uses Bingbus/Mesh, this is just AMD's solution to the same issue.

July 28, 2019 - 16:28

Connor Price

You are an idiot.
CCX design lets them more easily disable large chunks of the die without affecting the rest of the CPU. It SUBSTANTIALLY increases the amount of useable dies.

Attached: b24574689cc104c17ec036c358e0d01b.jpg (600x526, 32K)

July 28, 2019 - 16:29

William Evans

>Phenom II x2/x3
>4/6 core FX
Binning/die recycling isn't special to zen

July 28, 2019 - 16:32

James Sanchez

Not really. They just disable 2 cores and bin it as as a lower spec CPU. Same thing that both Intel and AMD have been doing for years

You can do that regardless of CCX.

July 28, 2019 - 16:43

Angel Peterson

But you can't just shove more of them and create behemoths like Threadripper with the same dies.

July 28, 2019 - 16:46

Thomas Clark

You are either smarter than every single AMD engineer or just dumb as half this board, and I think we already know the answer.

July 28, 2019 - 16:48

Justin Martinez

Latency inside CCX is LOWER than ringbus,

July 28, 2019 - 16:48

Owen Stewart

It decreases memory latency within the CCX itself due to lower distance between the cores to the cache.
Ringbus starts really falling off after 8-10 cores, and also consumes a ton of power.
With the CCX design, each core has a direct connection to the cache sections.
Even zen1 had lower intra-CCX latency than Intel core architecture did.

It also fascilitates wiring to the cores to cache outside of the CCX.

I was saying for a long time that, no, Zen2 is likely not going to be more than 4 cores per CCX and will likely stay 2 CCX per die because of how more cores per CCX would fundamentally and negatively change the cache layout.
You can see in the way the cache is broken up into 4 L3 segments. Each core is wired to all 4 of them. If it was 8 cores per CCX, there would be a lot more wires to wire each core to 8 L3 segments instead of 4. Instead of 4*4+4*4 it's 8*8. So that's another reason, other than how it'd slow down intra-core latency.

Attached: zen2 inter core latency.png (812x768, 526K)

July 28, 2019 - 16:53

John Brown

You are not getting the point, they are using the same dies for virtually their whole lineup, that's where the increased yields come from.

July 28, 2019 - 16:54

Ethan Brown

But all their dies are 8 core dies, its not like they are producing dies with more than 2 CCX'es anyways that they could benefit from any possible increase in yields CCX'es would provide. They just take multiple 8 core dies and put them on one chip to make high core count CPUs, so why not make those dies monolithic instead of unecessarily dividing them into two.

July 28, 2019 - 19:13

Andrew Scott

Why are you guys all NEET losers if you have all the answers that a multi-billion dollar corporation doesn't? Really makes me think.

July 28, 2019 - 19:17

Aiden Hughes

>so why not make those dies monolithic instead of unecessarily dividing them into two.

There are hundreds of considerations for die layout. To assume that the CCX arrangement is "unnecessary" just makes you sound like a retard. Signal routing, thermals, mitigating some potential defects by not consolidating all of your dense logic in a single area, etc.
Making an 8 core CCX vs the current existing quad core structure would require drastically more data fabric, more associativity in the caches. Nothing is as rudimentary as you seem to think it is. Believe it or not people who design ICs for a living actually know more than you do.

July 28, 2019 - 19:18

Matthew Reyes

Ask the genius OP that can't use google and find his answer in 10 seconds instead of thinking he's smarter than literally thousands of engineers.

July 28, 2019 - 19:21

Brandon Foster

Im not saying there is no point to the CCX design, im just saying yields are not the reason.

July 28, 2019 - 19:31

Jackson Allen

>ms
You're off by a few orders of magnitude

July 28, 2019 - 19:34

William Wilson

>its on the same die so it does nothing for yields
this is where you're wrong dumbass

July 28, 2019 - 19:35

Robert King

Source?

July 28, 2019 - 20:45

Camden Powell

because they're poor and copy-pasting them is essentially free

July 28, 2019 - 20:53

Alexander Howard

It would require exactly the same amount of IO other than cache links (which is essentially the only limitation of IF. You can already consider them as 8 core ccx chiplets, the difference between each ccx is minimal, and AMD isn't even binning them into quad cores like they did zepplin dies because yeilds are so good.

July 28, 2019 - 22:10

Jayden Lewis

Lol
So thats why they are so cheap

July 29, 2019 - 00:50

Levi Lewis

These right here are what you call niggers, folks.

July 29, 2019 - 01:56

Julian Ramirez

Hit a nerve, huh

July 29, 2019 - 03:55

1 2 ... 5 Next

Why CCX?

Last threads