Rapid VLIW Processor Customization for Signal Processing Applications Using Combinational Hardware Functions

PDF / 910,024 Bytes
23 Pages / 600.03 x 792 pts Page_size
58 Downloads / 290 Views

Rapid VLIW Processor Customization for Signal Processing Applications Using Combinational Hardware Functions Raymond R. Hoare, Alex K. Jones, Dara Kusic, Joshua Fazekas, John Foster, Shenchih Tung, and Michael McCloud Department of Electrical and Computer Engineering, University of Pittsburgh, Pittsburgh, PA 15261, USA Received 12 October 2004; Revised 30 June 2005; Accepted 12 July 2005 This paper presents an architecture that combines VLIW (very long instruction word) processing with the capability to introduce application-specific customized instructions and highly parallel combinational hardware functions for the acceleration of signal processing applications. To support this architecture, a compilation and design automation flow is described for algorithms written in C. The key contributions of this paper are as follows: (1) a 4-way VLIW processor implemented in an FPGA, (2) large speedups through hardware functions, (3) a hardware/software interface with zero overhead, (4) a design methodology for implementing signal processing applications on this architecture, (5) tractable design automation techniques for extracting and synthesizing hardware functions. Several design tradeoﬀs for the architecture were examined including the number of VLIW functional units and register file size. The architecture was implemented on an Altera Stratix II FPGA. The Stratix II device was selected because it oﬀers a large number of high-speed DSP (digital signal processing) blocks that execute multiply-accumulate operations. Using the MediaBench benchmark suite, we tested our methodology and architecture to accelerate software. Our combined VLIW processor with hardware functions was compared to that of software executing on a RISC processor, specifically the soft core embedded NIOS II processor. For software kernels converted into hardware functions, we show a hardware performance multiplier of up to 230 times that of software with an average 63 times faster. For the entire application in which only a portion of the software is converted to hardware, the performance improvement is as much as 30X times faster than the nonaccelerated application, with a 12X improvement on average. Copyright © 2006 Hindawi Publishing Corporation. All rights reserved.

1.

INTRODUCTION

In this paper, we present an architecture and design methodology that allows the rapid creation of application-specific hardware accelerated processors for computationally intensive signal processing and communication codes. The target technology is suitable for field programmable gate arrays (FPGAs) with embedded multipliers and for structured or standard cell application-specific integrated circuits (ASICs). The objective of this work is to increase the performance of the design and to increase the productivity of the designer, thereby enabling faster prototyping and time-to-market solutions with superior performance. The design process in a signal processing or communications product typically involves a top-down design approach with successively lower level impleme

Data Loading...

Rapid VLIW Processor Customization for Signal Processing Applications Using Combinational Hardware Functions

Recommend Documents

Designing BEE: A Hardware Emulation Engine for Signal Processing in Low-Power Wireless Applications

Multisensor Processing for Signal Extraction and Applications

Mobius: Packet Re-processing Hardware Architecture for Rich Policy Handling on a Network Processor

Transforming Signal Processing Applications into Parallel Implementations

Signal and Image Processing in Medical Applications

Microphone Arrays Signal Processing Techniques and Applications

Numerical Linear Algebra in Signal Processing Applications

Signal Processing for Auralization

Emerging Signal Processing Techniques for Power Quality Applications

Device Applications of Rapid Thermal Processing

Signal Processing Technologies for Ambient Intelligence in Home-Care Applications

Mathematical Summary for Digital Signal Processing Applications with Matlab