This innovatively configured processor is a massively parallel programmable device featuring tight coupling of 2,048 processing elements and 1Mbit SRAM, and has been confirmed to achieve 40 GOPS (giga operations per second) performance at a 200 MHz clock frequency.
Renesas Technology researchers unveiled details at the 2006 IEEE International Solid-State Circuits Conference (ISSCC) being held in San Francisco from February 5.
The image and audio multimedia data processing capability is essential for digital home appliances and other electronics, and involves a combination of complex operations such as fast Fourier transform, convolution, and sum of absolute difference operations. Up to now, processing of these operations has generally used hard-wired logic circuits or a DSP (digital signal processor) specialized for digital signal processing. However, recent dramatic advances in multimedia applications such as the rapid increase in pixel counts in image applications have increased demands for major improvements in multimedia data processing performance. At the same time, there is a growing demand for such processing to be implemented by means of programmable devices in order to simplify support for various multimedia data standards.
One way of improving processing performance is to increase the operating frequency through the use of finer semiconductor processes. However, it will be difficult to continue to gain major improvements in performance while maintaining lower power consumption, and to achieve the required levels of performance with conventional DSP and similar architectures. Meanwhile, a coarse-grained MIMD (multiple instruction multiple data) processor has been announced as an architecture that increases processing performance, but this also has issues with reducing power consumption.
To solve these issues, Renesas Technology has developed a matrix type processor based on a different memory technology from that of a DSP or MIMD type processor.
This new processor is a fine-grained SIMD (single instruction multiple data) type massively parallel programmable device, featuring the following structural characteristics.
1. Basic configuration : 2-bit processing elements (PE) and 512-bit SRAM assigned as data registers
2. 2,048 PEs and a total of 1 Mbit SRAM, together with tight coupling between Pes .
The key to the increased performance of this processor lies in how efficiently the individual processing elements are operated. Also, the layout and connection of the processing elements and data registers are important factors in achieving reductions in area and power consumption.
A prototype processor using the new technology was implemented in 90 nm CMOS with a core area of 3.1 mm2, and achieved processing performance of 40 GOPS at a 200 MHz clock frequency and 250 mW power dissipation. These metrics show approximately 70 and 13 times better energy efficiency in terms of unit area ratio and unit power ratio, respectively, compared to a conventional in-house DSP.
Source: Renesas Technology
Related stories:
Review: Google Chrome lacks polish under the hood
(AP) -- Google Inc.'s new Web browser, called Chrome, does much of what a browser needs to do these days: It presents a sleek appearance, groups pages into easy-to-manage "tabs" and offers several ways for people to control their Internet privacy settings.
Intel Unveils New Chip Designs
In his Intel Developer Forum keynote today, Pat Gelsinger detailed the roadmap for Intel's continued march toward pervasive, higher performance and power efficient computing. The senior vice president and general manager of Intel's Digital Enterprise Group discussed new features of the company's next-generation processor family including a new turbo mode that shifts the processor into a higher gear for mind-blowing performance without a heat penalty.
First Details on a Future Intel Design Codenamed 'Larrabee'
Intel Corporation is presenting a paper at the SIGGRAPH 2008 industry conference in Los Angeles on Aug. 12 that describes features and capabilities of its first-ever forthcoming "many-core" blueprint or architecture codenamed "Larrabee."
Intel Outlines Plans for New Category of Smarter, Purpose-Built 'System on Chip' Designs
As Internet access continues to be added to all kinds of computers and devices, Intel executives outlined a plan to use its chip design expertise, factory capacity, advanced manufacturing techniques and the economics of Moore's Law to usher in a new category of highly integrated, purpose-built and Web-savvy System on Chip (SoC) designs and products. The company also unveiled its first eight such products under its Intel EP80579 Integrated Processor family for security, storage, communications, and industrial robotics.
New Intel-Based Laptops Advance All Facets of Notebook PCs
Intel Corporation unveiled its Intel Centrino 2 Processor Technology products for laptops today, powered by five new Intel Core2 Duo processors. Close to 250 innovative consumer and business notebook PC designs are on the way, including those equipped with the right combination of powerful processors, graphics and battery life to enjoy viewing stunning high definition videos and myriad other computer and Internet activities.
Multithreaded supercomputer seeks software for data-intensive computing
The newest breed of supercomputers have hardware set up not just for speed, but also to better tackle large networks of seemingly random data. And now, a multi-institutional group of researchers has been awarded $4.0 million to develop software for these supercomputers. Applications include anywhere complex webs of information can be found: from internet security and power grid stability to complex biological networks.
UC San Diego Unveils Highest Resolution Scientific Display System in the World
As the size of complex scientific data sets grows exponentially, so does the need for scientists to explore the data visually and collaboratively in ultra-high resolution environments. To that end, the California Institute for Telecommunications and Information Technology (Calit2) has unveiled the highest-resolution display system for scientific visualization in the world at the University of California, San Diego.
NVIDIA Announced New Geforce GTX 200 GPUs
Imagine instead of taking over five hours to convert a video for your iPod, it only takes 35 minutes. Imagine using your PC to simulate protein folding to help find a cure for debilitating diseases. Imagine that your PC can dramatically accelerate everyday tasks, and deliver an exciting visual experience in the process.