Jump to content
Logo

Hybrid core X86 based computer tips up at SC08

With Intel inside
Tuesday, 18 November 2008, 10:14

THE WORLD’S FIRST hybrid-core computer was unveiled yesterday at the Supercomputing Conference 2008 in Austin, Texas (SC08), by start-up firm Convey Computer Corporation.

The Convey HC-1, boasts a combination of patent-pending computer architecture and compiler technology with commercial, off-the-shelf hardware – namely an Intel Xeon processor and Xilinx Field Programmable Gate Arrays. This purportedly melds the low cost, simple programming model of a commodity system with the performance of customised hardware architecture.

There has been buzz and fascination around Field Programmable Gate Arrays (FPGAs) chip technology for yonks now, mainly due to them being easily reprogrammable and offering amazing computer hardware speed. For whatever reason, however, FPGAs have not really been harnessed properly until now.

In the HC-1, FPGAs are wrapped into a reconfigurable coprocessor, running next to a standard multicore x86 CPU. Both the CPU and coprocessor can be programmed with C/C++ and Fortran, meaning that one could literally take legacy code, pass it through the Convey compiler and get a stunningly fast-running executable out the other end. Running the Linux operating system in a 2U chassis and boasting bandwidths of 80 GB/s, the HC-1 also keeps it all together and in sequence with a nifty cache-coherency system.

This sets the HC-1 apart from environments like Nvidia’s CUDA for GPUs or ImpulseC (for FPGAs) which use extended forms of C, meaning legacy code has to be ported before it can be accelerated. Convey also reckons the HC-1 saves on human productivity compared to other environments which are tied to a particular architecture or are forced to rely on a configuration management system to keep separate source trees.

When it comes to usage, Convey believes its HC-1 could be used for a plethora of compute-intensive applications, including financial, seismic and biomedical. The system can adapt itself to its task thanks to the unique compiler design which integrates "personalities". This lets whichever application is targeted, render the FPGA in specific ways to boost throughput for that particular application model and workload.

What this also means is that a framework is created which simultaneously allows for full binary compatibility across the board, whilst letting some operations in a configuration use more of the custom core, thereby speeding up data processing. In other words, developers get to build their very own " procedural" personalities using the Convey supplied Personality Development Kit, turning the HC-1 into a veritable shape shifter. Might be a first that, developers with personalities.

Intel’s Pat Gelsinger, senior Veep and general manager of the firm’s Digital Enterprise Group noted, "Convey’s system is unique in that it does not require programmers to instrument their code but, instead, provides an open-standard programming model that does not use proprietary mechanisms that have historically limited adoption of heterogeneous solutions."

For developers this essentially means getting to work on what appears to be a single x86-based machine but which actually has the added benefit of add-on extensions like a customisable co-processor, making it more similar to older 8087/80287/80387 FPU co-processors which plugged into motherboards all those years back.

On the face of things though, that privilege doesn’t come cheap, with an HC-1 server selling for about $32,000. What Convey plans to, er, convey however, is that since ye old average HPC app can be accelerated 10 times on this platform, every HC-1 is basically the equivalent of 10 regular x86 boxes, which, when put that way, represents significant cost savings as well as savings for power and cooling.

The first version of HC-1 will still use Chipzilla’s front-side bus to talk to the coprocessor, but now that Nehalem has been unleashed, Convey will soon be switching to QuickPath Interconnect-based system.

Steve Wallach, the HC-1’s big daddy and Convey’s creator is no stranger to supercomputers having been co-founder and CTO of Convex Computer, developing vector supercomputers, back in the 1980s and 90s. Wallach had originally approached AMD with his plans for HC-1, but when the firm proved skeptical, he packed up his blueprints and took them down to his old friend Justin Rattner, supercomputer buff and CTO of Intel.

Of course, bringing out new architecture is always a bit of a risk, especially in uncertain economic times, but Convey has, up till now, proved it can pull together and operate on a pretty tight budget. The firm raised only $15.1 million to date.

The first order for a HC-1 has already come in from the University of California, San Diego (UCSD) for its GreenLight power and energy efficiency initiative. µ

Share this:

Comments
Hmmmm

"...one could literally take legacy code, pass it through the Convey compiler and get a stunningly fast-running executable out the other end..." That is a very strong claim. Did the company back it up with any demonstrations?

posted by : hoohoo, 18 November 2008Complain about this comment
x, y, ... z

So, Convex was followed up with Convey. Will Convez be next?

posted by : mike, 18 November 2008Complain about this comment
Next steps

You will get fairly poor performance if you take legacy code written in C, C++ or Fortran and translate that into something for the FPGA. This has been done before (see for example the Impulse C Compiler). You better write your specialized code in VHDL or Verilog, translate it and put the output into the FPGA and let the CPU do the rest. The Xilinx Virtex-5 already comes with up to two PowerPC 440 cores. Combining CPU and FPGA on a motherboard is nothing new, and as you just learned, Xilinx has already done the next step by merging the CPU and FPGA on one chip. I am a bit surprised that AMD turned this project down. But I bet they would love it now. As I wrote in a prior posting, the merger of ATI and AMD only makes sense when you see the future of general-purpose CPUs combined with DSP. And FPGAs are certainly an interesting part of that picture. Let me lay out the next developments: - FPGAs become integrated components of CPUs, - fully customizable microcode, per application (see how the Virtex-5 FPGA is being accessed from the PPC440 cores, a first step), - CPUs based on photonic transistors, - CPUs based on organic material, - synthesizable organic CPUs, - self-organizing organic CPUs, - synthetic brains. In case you were wondering why India sent a probe to the moon a few days ago, you should take a look at their FPGA development plans. Some people realize that a scientific program can be a great vehicle to drive an industrial development. Just wondering what is happened to basic research in the USA...

posted by : HGJ, 18 November 2008Complain about this comment
Tesla

Hmmmmm, now I'm thinking if this is standard they can chuck some Tesla cards in there too.....

posted by : Matty, 18 November 2008Complain about this comment
Replies

Matty: I looked at one of these servers at Supercomputing '08 earlier this week. While the architecture probably would support a Tesla or two, the physical arrangement of the HC-1 almost certainly doesn't leave room for it. It's essentially a sandwich of two 1U servers, with the bottom one a relatively standard x86 server layout, and the top one completely given over to the FPGA-based coprocessor board. HGJ: What may not have been entirely clear in the article is that this is something entirely different from the C-to-FPGA-image compilers that you refer to. What Convey calls a "personality" is a fixed FPGA image (well, set of images; there are 14 FPGAs on the board!) that turns the FPGA board into a programmable coprocessor with a very specialized instruction set. Then, the compiler targets this virtual processor much as it would any other coprocessor. (Thus, this is really much more akin to the CPU-on-an-FPGA designs such as the very successful ARM Cortex M1, rather than to the sorts of projects you're thinking of. And the compiler technology is little different from "normal" compiler technology.) Needless to say, the FPGA images that make up the "personality" are almost certainly generated using specialized languages and carefully optimized. And you can make your own, if you think theirs aren't fast enough.

posted by : Brooks Moses, 23 November 2008Complain about this comment
Advertisement
Subscribe to the INQ Newsletter
Sign-up for the INQBot weekly newsletter
Click here to sign up Existing user
Advertisement
INQ Poll

Consumer Electronics Show

CES 2009 shrinks through lack of interest