Viz. earlier discussions related to memory speeds,
here's a single chip (wafer-sized) with just under
a million cores and 44GB of SRAM. Four trillion
transistors. 21PB/s memory bandwidth. 23kW.
https://www.theregister.com/2024/03/13/cerebras_claims_to_have_revived/
On Wed, 13 Mar 2024 19:51:34 GMT
scott@slp53.sl.home (Scott Lurndal) wrote:
Viz. earlier discussions related to memory speeds,
here's a single chip (wafer-sized) with just under
a million cores and 44GB of SRAM. Four trillion
transistors. 21PB/s memory bandwidth. 23kW.
https://www.theregister.com/2024/03/13/cerebras_claims_to_have_revived/
You were talking about 1ns latency then.
The latency on "chip" like that, corner to corner, would be measured in microseconds, at best. Quite possibly, over 10 usec.
Viz. earlier discussions related to memory speeds,
here's a single chip (wafer-sized) with just under
a million cores and 44GB of SRAM. Four trillion
transistors. 21PB/s memory bandwidth. 23kW.
https://www.theregister.com/2024/03/13/cerebras_claims_to_have_revived/
Scott Lurndal wrote:
Viz. earlier discussions related to memory speeds,
here's a single chip (wafer-sized) with just under
a million cores and 44GB of SRAM. Four trillion
transistors. 21PB/s memory bandwidth. 23kW.
https://www.theregister.com/2024/03/13/cerebras_claims_to_have_revived/
CPUs were limited to 100 W thermal dissipation,
GPUs got up to 300 W thermal dissipation,
Now you are looking at the thermal dissipation of 10% of an ECL CRAY-1
As to making racks of these things. The reason CRAYs were limited to 300
KVA is because that is the largest load a non-governmental electrical >consumer can turn on or off without calling the power company to coordinate >changing the grid (so the power company can prepare to ramp (up or down) >their generating capacity.) It generally takes them 15-30 minutes to prepare >for such a change in load.
mitchalsup@aol.com (MitchAlsup1) writes:
Scott Lurndal wrote:
Viz. earlier discussions related to memory speeds,
here's a single chip (wafer-sized) with just under
a million cores and 44GB of SRAM. Four trillion
transistors. 21PB/s memory bandwidth. 23kW.
https://www.theregister.com/2024/03/13/cerebras_claims_to_have_revived/
CPUs were limited to 100 W thermal dissipation,
GPUs got up to 300 W thermal dissipation,
Now you are looking at the thermal dissipation of 10% of an ECL CRAY-1
As to making racks of these things. The reason CRAYs were limited to 300 >>KVA is because that is the largest load a non-governmental electrical >>consumer can turn on or off without calling the power company to coordinate >>changing the grid (so the power company can prepare to ramp (up or down) >>their generating capacity.) It generally takes them 15-30 minutes to prepare >>for such a change in load.
Modern datacenters dissipate 16 to 20kW per rack, with hundreds
or thousands of racks. Basically a megawatt-hour per square meter
with cooling factored in.
Scott Lurndal wrote:
mitchalsup@aol.com (MitchAlsup1) writes:
Scott Lurndal wrote:
Viz. earlier discussions related to memory speeds,CPUs were limited to 100 W thermal dissipation,
here's a single chip (wafer-sized) with just under
a million cores and 44GB of SRAM. Four trillion
transistors. 21PB/s memory bandwidth. 23kW.
https://www.theregister.com/2024/03/13/cerebras_claims_to_have_revived/ >>>
GPUs got up to 300 W thermal dissipation,
Now you are looking at the thermal dissipation of 10% of an ECL CRAY-1
As to making racks of these things. The reason CRAYs were limited to 300 >>>KVA is because that is the largest load a non-governmental electrical >>>consumer can turn on or off without calling the power company to coordinate >>>changing the grid (so the power company can prepare to ramp (up or down) >>>their generating capacity.) It generally takes them 15-30 minutes to prepare >>>for such a change in load.
Modern datacenters dissipate 16 to 20kW per rack, with hundreds
or thousands of racks. Basically a megawatt-hour per square meter
with cooling factored in.
Yes, but racks (or motherboards with the racks) are power cycled individually.
mitchalsup@aol.com (MitchAlsup1) writes:
Scott Lurndal wrote:
mitchalsup@aol.com (MitchAlsup1) writes:
Scott Lurndal wrote:
Viz. earlier discussions related to memory speeds,CPUs were limited to 100 W thermal dissipation,
here's a single chip (wafer-sized) with just under
a million cores and 44GB of SRAM. Four trillion
transistors. 21PB/s memory bandwidth. 23kW.
https://www.theregister.com/2024/03/13/cerebras_claims_to_have_revived/ >>>>
GPUs got up to 300 W thermal dissipation,
Now you are looking at the thermal dissipation of 10% of an ECL CRAY-1
As to making racks of these things. The reason CRAYs were limited to 300 >>>>KVA is because that is the largest load a non-governmental electrical >>>>consumer can turn on or off without calling the power company to coordinate >>>>changing the grid (so the power company can prepare to ramp (up or down) >>>>their generating capacity.) It generally takes them 15-30 minutes to prepare
for such a change in load.
Modern datacenters dissipate 16 to 20kW per rack, with hundreds
or thousands of racks. Basically a megawatt-hour per square meter
with cooling factored in.
Yes, but racks (or motherboards with the racks) are power cycled individually.
Sometimes, perhaps even usually. Unless the datacenter looses power completely (which does happen, if rarely).
Scott Lurndal wrote:
mitchalsup@aol.com (MitchAlsup1) writes:
Scott Lurndal wrote:
mitchalsup@aol.com (MitchAlsup1) writes:
Scott Lurndal wrote:
Viz. earlier discussions related to memory speeds,CPUs were limited to 100 W thermal dissipation,
here's a single chip (wafer-sized) with just under
a million cores and 44GB of SRAM. Four trillion
transistors. 21PB/s memory bandwidth. 23kW.
https://www.theregister.com/2024/03/13/cerebras_claims_to_have_revived/ >>>>>
GPUs got up to 300 W thermal dissipation,
Now you are looking at the thermal dissipation of 10% of an ECL CRAY-1 >>>>>
As to making racks of these things. The reason CRAYs were limited to 300 >>>>>KVA is because that is the largest load a non-governmental electrical >>>>>consumer can turn on or off without calling the power company to coordinate
changing the grid (so the power company can prepare to ramp (up or down) >>>>>their generating capacity.) It generally takes them 15-30 minutes to prepare
for such a change in load.
Modern datacenters dissipate 16 to 20kW per rack, with hundreds
or thousands of racks. Basically a megawatt-hour per square meter
with cooling factored in.
Yes, but racks (or motherboards with the racks) are power cycled individually.
Sometimes, perhaps even usually. Unless the datacenter looses power
completely (which does happen, if rarely).
When the entire data center looses power, its 300KVA+ is the least of the >power companies worries.
Sysop: | Keyop |
---|---|
Location: | Huddersfield, West Yorkshire, UK |
Users: | 547 |
Nodes: | 16 (2 / 14) |
Uptime: | 71:40:47 |
Calls: | 10,398 |
Files: | 14,070 |
Messages: | 6,417,621 |