BRIGHT SIDE OF NEWS About | Advertise | Contact BSN USER Login
| Register
SUBSCRIBE Newsletter | RSS Feeds
Saturday, March 13, 2010
Email this to a friend.
Your friend's e-mail:
Your Name:
Your e-mail:
Message subject:

Intel shows 48-core "cloud" CPU



Wednesday, Intel showed reporters a 48-core processor nicknamed the SCC [single-chip cloud computer] that consumes about the same power as today's desktop processors. With 1.3 billion transistors, the chip boasts 10 to 20 times the processing power of Intel's top of the Nehalem product line. 

Intel's CTO Justin Rattner proudly showed a multi-die manufacturing wafer. He said that they are beyond the wafer level, and have packaged and running parts. He laughingly admitted that this is not the typical Intel "flash the wafer and then wait six months."

Justin Rattner, Intel's CTO explains the details behind SCC
Justin Rattner, Intel's CTO explains the details behind SCC

Rattner said that SCC's 48 IA-32 [Intel Architecture, 32-bit] cores are simple, in-order designs and not sophisticated out-of-order processors. He said that these are more of an Atom-like core design as opposed to a Nehalem-class design. Rattner said the fully programmable 48 processing cores are the most Intel has ever had on a single silicon chip.

Talking about similarities between Atom core and the SCC... cloud computer is another variation of in-order core recently returned to Intel by DoD
Talking about similarities between Atom core and the SCC...

Rattner said that a data center could replace a rack full of equipment with one or a number of high-core count processors like the SCC. The SCC can operate between 1GHz to 3GHz and use from 25 watts up to 125 watts. Rattner said that this is an experimental chip and it never will be a product. Intel's lab plans to hand-build about 100 of these experimental chips for academic research and specialized software development. Rattner said that when you put real silicon and hardware in front of people it speeds the development cycle, compared to working from emulators.

Demo machine with 48 IA-32 cores on a single die
Demo machine with 48 IA-32 cores on a single die

The SCC measures about 567 square millimeters – about the size of a postage stamp. It is fabbed using 45nm CMOS High-K metal gate process. SCC's design includes four DDR3 channels in a 6-by-4 2D-mesh network. The 24 dual-core modules are linked together and communicate by means of a software-configurable message-passing scheme using 384KB of on-die shared memory.

The SCC is the second generation successor to the 80-core "Polaris" that Intel's Tera-Scale research project showed in 2007. The Polaris was just a proof-of-concept project. However, the SCC is based on the Intel Architecture [IA-32], so it runs standard x86 software.

Earlier this year, Tilera, a startup spun out of MIT [Massachusetts Institute of Technology], promised a 100-core processor. Their processor would be fabricated using 40nm technology and be available early next year, Tilera predicted.

Rattner said that they found only one significant bug and that was fixed with only a metal layer change. Because they are using IA-32, x86, cores they are able to run Windows and Linux on SCC systems. Clearly, the major hurdle to overcome with multi-core architecture is the traditional single threaded applications.

Rattner said that programmers need the tools and experience to develop applications with independent tasks running in parallel. The SCC has been extensively tested using JavaScript. Intel says that JavaScript has been under utilized because of the lack of multiple threads. Treating the SCC experimental chip as a "server farm" lets them divide the work involved in calculating complex renderings.

During the megahertz race of yesterday, processor clock frequency got faster and faster, letting single threaded applications execute faster. Then, the higher and higher heat and increasing power consumption shifted designers towards multicore CPUs for increasing computing power.

The SCC uses message passing which is an architectural change from the traditional cache coherency approach. Tim Matson of Intel Research explained that message passing is the idea of sharing data by moving messages directly to other processors over a network rather than reading and writing to a pool of shared memory. An important part of the message passing architecture is extremely low latency and high bandwidths.

Matson said that each core will communicate with the fabric like a mesh network instead of having cache coherency like Larrabee which requires that each core know all about the cache in another core. That limits the number of cores that can be tied together. Another reason for choosing a message passing architectural approach for SCC was to find out how the theory worked in the real world.

The SCC power management can independently control eight-variable voltages and 28 variable-frequency areas of the chip. A programmer can use the API and set break points for changing frequency and power consumptions for the cores. The linked video shows the experimental chip divided into eight cores that are graphically represented on the screen.

Microsoft showed their Visual Studio graphical application with extensions to control 2 through 48 cores. By increasing the number of cores, they sped up the action of a fractal image on screen. This allowed a programmer to see how their code can be refined to more evenly distribute the parallell work load across all the cores.

Intel said they would start delivering their experimental SCC chips in the spring of 2010. By this time next year, we will be hearing how the researchers are doing with their new powerful parallel x86 working environment.



© 2009 - 2010 Bright Side Of News*, All rights reserved.



Related articles:

Tags:

Share and enjoy :)

  • Digg
  • del.icio.us
  • StumbleUpon
  • TwitThis
  • Reddit
  • Furl
  • Google
  • Technorati
  • Sphinn
  • Mixx
  • Facebook
  • LinkedIn
  • Slashdot
  • Newsvine
  • Ma.gnolia
  • BlinkList
  • connotea
  • Fark
  • MisterWong
  • Netvouz
  • PlugIM
  • Propeller
  • Simpy
  • SphereIt
  • Spurl
  • ThisNext
  • YahooMyWeb
  • co.mments
  • Live
  • MySpace
  • Yahoo! Buzz


Comments:

sounds a lot like what cell was originally supposed to be... by: Anonymous on 3/9/2010
This sounds a lot like what cell was originally supposed to be... Kutagari wasn't as crazy as everyone before thought... he was just too advanced for his own good.
by: Anonymous on 1/10/2010
thank god over heat over power use and loud ass fans all dieing isnt it funny nintendo works like this SCREAMS efficency and everyone giggles at them THEN THE ENTIRE INDUSTRY FOLLOWS SUIT

GBA WAS SYSTEM ON CHIP

Wii IS EASY2.5X XBOX 1 AND GAMECUBE AT 480P RENDERING

YET USES BELOW 20 WATTS IN TV OUT DESKTOP FORM

NINTENDO ALWAYS LEADS WERE EVERYONE ELSE IS GOING

FUNNY THAT HAY FOR A DOOMED COMPANY THATS THE MOST PROFITABLE ON EARTH PER EMPLYEE

IF NINTENDO DID COMPUTERS THEY WOULD BE 10X MORE EFFOCENT 10 X COOLER AND NOT HAVE VACUM CLEANER SOUNDING COOLING FANS

INTEL/MICROSOFT JUST DIE PLEASE
by: Anonymous on 1/3/2010
think you're so smart? Put your brain where your ego is - theresaprizeforthat.com <http://theresapri
by: Kakkoii on 12/3/2009
Oh look, Intel's moving towards a programming structure it once used to bash now that it's invading their own territory. A GPU!
by: Kakkoii on 12/3/2009
Oh look, Intel's moving towards a programming structure it once used to bash now that it's invading their own territory. A GPU!
Weird by: Anonymous on 12/3/2009
The one thing that gets on my nerves with this article is the way the phrase "Rattner said" is spat all over.

On-topic: 48 cores on a dieO_O. And it uses as much as a desktop CPU. How did that happen?
RE: Tesla used in networking... by: Theo Valich on 12/3/2009
There are few research projects where engineers are using nVidia Tesla boards to do network routing... the future is definitely fusion, but if AMD/nVidia develop hardware that is as efficient and capable of processing, it isn't hard to imagine how the world will look like in less than five years.

For instance, what if future nVidia GPUs accept ARM instruction set? If NV100 had 16 multicore blocks capable of handling ARM or AMD's Evergreen had 20 multicore blocks capable of ARM/x86...we're getting there.

Ed.
by: Anonymous on 12/3/2009
48 3GHz Pentium-level cores on a die.

That's 500x the processing power of a 266MHz Pentium from around 12 years ago. In a single chip.

Albeit a rather large chip.

I'm sure a 2.66GHz quad-Nehalem is only 100x as fast on a good day.
ARM Cortex A-9 by: Anonymous on 12/3/2009
I wonder if ARM pulled something similar with the Cortex A-9 core.

Anyway an impressive chip.
This might be mainstream to servers about 10 years down the road.
Just...Wow! by: Anonymous on 12/3/2009
48 cores on one silicon chip! For an old-timer like me, that's almost incomprehensible. Forget "blades", racks,
and "dual CPU", or heck, may even lead to having not only CPU but GPU rivaling "today's" cutting edge kits!
<{;-)
Leave a comment:

Author:

Title:

Comment:


Enter the code shown above:

(Note: If you cannot read the numbers in the above
image, reload the page to generate a new one.)




Highlight
  • The Joos Orange: Solar Components preps 20x more efficient personal solar charger
  • Next 3Dmark is DirectX 11 only, "seriously awesome"
  • Photos of future Asus GPUs and Motherboards arrive at BSN*
  • Super Micro comes out with a 12-Core ready Motherboard
  • Super Micro comes out with a 12-Core ready Motherboard
New servers, new features

Welcome to BSN*!

We have just completed the server switch and brought three new server to power Bright Side of News*. We will be introducing new features to the site over the course of next few days, so stay tuned.

For those wanting to know, all three servers are powered by Intel Xeon processors at 2.6 GHz each.

Best regards,

The BSN* team

© 2009 - 2010 Bright Side Of News*, All rights reserved.