Data centers care a lot about power. The ai products run around 2ghz in the sweet spot. Consumer cards target 3ghz this gen and use 3-4x the power that they do at ~2ghz. The die in the 4080 is a mid range size. It is what used to be in things like the 60 series or maybe a 70 series card. They have been overclocking the snot out of them stock and putting them on massively expensive pcb instead of giving us the larger dies we used to get. That shifts the costs to the board partners and lets them get away with selling the dies at a huge profit compared to their older products.
Back to data centers. You pay a lot for your spot based on power and location. If they stay efficient and pack lots of chips in, that is the cheapest way over the life of the server. If you save 10 or 20% power due to using a new node that is worth a huge reduction in data center fees. On the consumer desktop side, they can overclock to double the power instead of using a larger more expensive die and pocket the difference with no one really caring.
Tsmc 4n is a 6nm process based on improving their 7nm. 3n and 5n are both experimental process. 5n is smaller than 4n at lower density. The consumer cards are 7nm then 4n (the cheap ones). The data center cards are 5n and 3n (the high end expensive processes.) Ordering more consumer or data center do not conflict with each other. Doing more of the workstation cards could since they are full feature consumer dies, but those are not the ai cards.
AI is much more taxing than gaming. Machine learning will peg a gpu at a flat 100% constant use, while gaming fluctuates up and down depending on what’s going on on screen. So being more power efficient while running a card at 100% 24/7 saves money on power costs, and corporations love saving money.
Why would AI cards need to be more efficient?
Data centers care a lot about power. The ai products run around 2ghz in the sweet spot. Consumer cards target 3ghz this gen and use 3-4x the power that they do at ~2ghz. The die in the 4080 is a mid range size. It is what used to be in things like the 60 series or maybe a 70 series card. They have been overclocking the snot out of them stock and putting them on massively expensive pcb instead of giving us the larger dies we used to get. That shifts the costs to the board partners and lets them get away with selling the dies at a huge profit compared to their older products.
Back to data centers. You pay a lot for your spot based on power and location. If they stay efficient and pack lots of chips in, that is the cheapest way over the life of the server. If you save 10 or 20% power due to using a new node that is worth a huge reduction in data center fees. On the consumer desktop side, they can overclock to double the power instead of using a larger more expensive die and pocket the difference with no one really caring.
Tsmc 4n is a 6nm process based on improving their 7nm. 3n and 5n are both experimental process. 5n is smaller than 4n at lower density. The consumer cards are 7nm then 4n (the cheap ones). The data center cards are 5n and 3n (the high end expensive processes.) Ordering more consumer or data center do not conflict with each other. Doing more of the workstation cards could since they are full feature consumer dies, but those are not the ai cards.
AI is much more taxing than gaming. Machine learning will peg a gpu at a flat 100% constant use, while gaming fluctuates up and down depending on what’s going on on screen. So being more power efficient while running a card at 100% 24/7 saves money on power costs, and corporations love saving money.