Skip to navigation
   
Simon Bisson & Mary Branscombe's Blog

Beyond the valley of the CPU

By Simon Bisson & Mary Branscombe in Editorial

Posted in Processors, Software, Applications, Server, Mobile on June 25, 2008 at 9:51 pm

Permalink | Author Profile

(or “The return of the co-processor”)

The white heat of technology in the 1980s was focussed on the BBC Micro. Not only was it the heftiest 8-bit machines around, its open bus made it possible to add more processing power. With everything from music machines to Z-80s running CP/M, the BBC Micro could share its keyboard with many different CPUs.

Those days are on their way back.

Last week Toshiba announced a new range of consumer notebook PCs. Like many of Toshiba’s systems they’re designed to be media players, and in a side swipe at BluRay, they now come with an upscaling DVD drive. That’s where the coprocessor magic comes in, as Toshiba is using a derivative of the same Cell processor in Sony’s PS3 to drive its imaging software. A quad core version of the Cell sits alongside a dual core intel processor, and it’s used to handle a range of processor intensive tasks - acting as a feed to the GPU that drives the screen. Not only does it upscale DVD streams (very impressively) it also can be used to handle file transcoding (so your movies end up on your iPhone that much quicker), and also works well as a way of quickly indexing images and video.

Focused on video, Toshiba’s co-processor is also taking advantage of bundled web cams for a limited form of gesture control. Stopping a film by holding up a hand is effective, as is using a clenched fist as an in air mouse. Bill Gates’ departure reaffirmed his belief in alternative user infterfaces, and this is one approach to delivering those new ways of working.

Co-processors aren’t just for flashy graphics. Back in the 1990s I was writing mathematical simulation software, and at one point I had some electro-thermal models running on one of the MOD’s Crays. It wasn’t just any old Cray - it also had a co-processor in the shape of an additional vector processing unit. That vector co-processor made short shrift of my arrays of partial differential equations. Its direct descendent is a lot closer than an MOD research facility.

In fact, if you’ve got an NVIDIA graphics card it’s right in your PC’s GPU.

Back in January we wrote about Tesla and CUDA, and NVIDIA updated us on the next generation of the Tesla hardware earlier this week. The new G10 Tesla systems are looking very impressive, and the CUDA parallel programming language extensions are now able to work with standard multicore PCs as well as NVIDIA’s GPUs.

Memory is important when you’re using co-processors, and you need a lot if you’re signal-processing seismic data. Tesla will now support 4GB of directly attached memory per GPU, so a quad-GPU system can work with 16GB of data at a time. The numbers look good - and using Folding at Home a single Tesla 10 comes in at more than 40 times faster than a standard CPU, and more than 6 times faster than a PS3. Other demonstrations showed significant savings in space and in cost - one finance house has reduced its annual costs 9 times, replacing a 600 CPU options valuation system with a handful of front-end CPUs and 12 Tesla GPUs.

Of course with Snow Leopard around the corner, one of the obvious questions was about Apple’s support for OpenCL. It turns out that CUDA is best thought of as a personality layer on top of NVIDIA’s parallel thread execution (PTX) hardware, and it produces device-specific assembly code. There’s no reason why other GPU programming environments can’t produce the same PTX code - but CUDA will remain NVIDIA’s own route to the GPU as a processing tool, and it will be adding support for additional languages beyond C and C++ with Fortran just around the corner.

The future of the co-processor seems assured, for now at least. It’s time for software companies to start taking notice and to deliver on the promise of additional power beyond the CPU.

–Simon

Zemanta Pixie
12345
Rated: 80% (1 votes)
Loading ... Loading ...

Previous Post | Next Post

 
 
Comments
This article has no comments yet.

Make a comment

* required

* required

We stop spam using reCaptcha.
Type the words below and click Submit Comment.

   
Tag cloud

isps Jeff Jones WWW IDF Ray Ozzie NVIDIA acquisitions mobile data tariffs machine learning Dell NGSCB TSA green IT Ruby On Rails geotagging spam mobile Windows Server 2008 quiz Trampoline EMC virtualisation Vista Reqall advertising accessories battery Mono Xobni CES thin client Tripit mobility security paradox Apple Nokia lawsuit Tablet Kiosk visualisation analytics bletchley park MacBook Air accelerator spin winhec2008 service oriented enterprise video flash productivity digital signature Moonlight fingerprint HTC dual display IBM SSVAGENT.EXE CUDA WinHEC desktop. PC Xen credit crunch Previous Versions wildfire MacWorld 2008 business technology optimisation bbc iplayer power supply open source SapphireSteel mash-up disk space green printing disk Corsair Hugh Thompson WPF AskEraser VSSAdmin forensics IIW2008b hacking UMPC provisioning legislation IT transformation open Internet Explorer 8 EEE streaming media beta macbook CPU sun Google IO National Insurance Gears social networking storage Nuance MIX08 power cuts wubi iPhone data centre RAZR national museum of computing installer T9 amherst Ruby Silverlight blog ruggedized information cards O2 RSA 2008 processors moscow QWERTY Location TouchSmart Motorola case ADFS 2.0 Jeff Hawkins BT GPS bandwidth firewall timezones Dopplr HMT developer yahoo Toshiba Portege R500 HP network turing windows 7 i-mate CTO Hp 2710p Trend Micro evernote Salesforce distributed computing Live Mesh history mythbusters security theatre terabytes Bill Cheswick politics user experience Ask.com Firefox media OFCOM privacy Netscan support Palm TechEd 2008 ProCurve todo list regulations IT value OQO Google Barracuda BBC email wifi virtual desktop etech fingerprint scanner Internet Explorer isp fibre 24 hours colossus cloud service google online applications DisplayLink networks CERN Adobe Crossfader cracking user interface office merger conferences Visual Studio AuthenTec business technology automation mobile working software DSL community vulnerabilities CardSpace Enterprise 2.0 management electricity price HTML 5 hold music enterprise RBL geocaching Secunia exabytes Tablet PC power Volume Shadow Copy AMD enterprise architecture camera Web 2.0 education Seagate patent phone management hierarchical temporal memory Girl Geek Dinners ucsd business NAS NexT 64-bit anti-virus Bill Gates SP1 transcoding 3G Frauenhofer Palladium patch Tuesday geneva Windows Mobile Mercury business continuity payroll GPU ubuntu fault Numenta ballmerbot traffic whitelist HR automation Microsoft robot images MRDA LiveID MING Tom Hogan biometrics Tim Berners-Lee bombe interoperability deperimeterization high performance computing business intelligence Embarcadero troubleshooting licensing bea identity metasystem exchange offload xT9 fire benchmark Large Hadron Collider regulation Asus spam fighting numbers server Opsware griffin. microsoft research codec cosmic rays Loki parallel computing cisco oracle mscape cables Mozilla mobile Linux upgrade CIO IT automation SBS telecoms Wyse toshiba HSDPA Internet christmas smartphone Delphi Trolltech Google Sets hp microsoft research voice recognition Google Spreadsheets TNT LHC calit2 html data natural interface Facebook geek tourism identity theft gaming active digitiser eu .NET co-processor Windows Live identitity Lenovo laptop SSD SMB 2 Intel Express Gate greenplum nvision08 information Verbatim automation mysql Linux hardware optical interconnects browser control panel utilities fraud adfs performance RIA mobile ofcom network wireless USB migration OEM O'Reilly Credentica conference Fire Eagle Gartner security pgp OpenID Greasemoneky Beacon onboarding pen computing
Advertisement
Advertisement
Advertisement