Efficient Compilation for Application Specific Instruction set DSP Processors with Multi-bank Memories

Efficient Compilation for Application Specific Instruction set DSP Processors with Multi-bank Memories PDF Author: Joar Sohl
Publisher: Linköping University Electronic Press
ISBN: 9175191512
Category : Signal processing
Languages : en
Pages : 188

Get Book

Book Description
Modern signal processing systems require more and more processing capacity as times goes on. Previously, large increases in speed and power efficiency have come from process technology improvements. However, lately the gain from process improvements have been greatly reduced. Currently, the way forward for high-performance systems is to use specialized hardware and/or parallel designs. Application Specific Integrated Circuits (ASICs) have long been used to accelerate the processing of tasks that are too computationally heavy for more general processors. The problem with ASICs is that they are costly to develop and verify, and the product life time can be limited with newer standards. Since they are very specific the applicable domain is very narrow. More general processors are more flexible and can easily adapt to perform the functions of ASIC based designs. However, the generality comes with a performance cost that renders general designs unusable for some tasks. The question then becomes, how general can a processor be while still being power efficient and fast enough for some particular domain? Application Specific Instruction set Processors (ASIPs) are processors that target a specific application domain, and can offer enough performance with power efficiency and silicon cost that is comparable to ASICs. The flexibility allows for the same hardware design to be used over several system designs, and also for multiple functions in the same system, if some functions are not used simultaneously. One problem with ASIPs is that they are more difficult to program than a general purpose processor, given that we want efficient software. Utilizing all of the features that give an ASIP its performance advantage can be difficult at times, and new tools and methods for programming them are needed. This thesis will present ePUMA (embedded Parallel DSP platform with Unique Memory Access), an ASIP architecture that targets algorithms with predictable data access. These kinds of algorithms are very common in e.g. baseband processing or multimedia applications. The primary focus will be on the specific features of ePUMA that are utilized to achieve high performance, and how it is possible to automatically utilize them using tools. The most significant features include data permutation for conflict-free data access, and utilization of address generation features for overhead free code execution. This sometimes requires specific information; for example the exact sequences of addresses in memory that are accessed, or that some operations may be performed in parallel. This is not always available when writing code using the traditional way with traditional languages, e.g. C, as extracting this information is still a very active research topic. In the near future at least, the way that software is written needs to change to exploit all hardware features, but in many cases in a positive way. Often the problem with current methods is that code is overly specific, and that a more general abstractions are actually easier to generate code from.

Efficient Compilation for Application Specific Instruction set DSP Processors with Multi-bank Memories

Efficient Compilation for Application Specific Instruction set DSP Processors with Multi-bank Memories PDF Author: Joar Sohl
Publisher: Linköping University Electronic Press
ISBN: 9175191512
Category : Signal processing
Languages : en
Pages : 188

Get Book

Book Description
Modern signal processing systems require more and more processing capacity as times goes on. Previously, large increases in speed and power efficiency have come from process technology improvements. However, lately the gain from process improvements have been greatly reduced. Currently, the way forward for high-performance systems is to use specialized hardware and/or parallel designs. Application Specific Integrated Circuits (ASICs) have long been used to accelerate the processing of tasks that are too computationally heavy for more general processors. The problem with ASICs is that they are costly to develop and verify, and the product life time can be limited with newer standards. Since they are very specific the applicable domain is very narrow. More general processors are more flexible and can easily adapt to perform the functions of ASIC based designs. However, the generality comes with a performance cost that renders general designs unusable for some tasks. The question then becomes, how general can a processor be while still being power efficient and fast enough for some particular domain? Application Specific Instruction set Processors (ASIPs) are processors that target a specific application domain, and can offer enough performance with power efficiency and silicon cost that is comparable to ASICs. The flexibility allows for the same hardware design to be used over several system designs, and also for multiple functions in the same system, if some functions are not used simultaneously. One problem with ASIPs is that they are more difficult to program than a general purpose processor, given that we want efficient software. Utilizing all of the features that give an ASIP its performance advantage can be difficult at times, and new tools and methods for programming them are needed. This thesis will present ePUMA (embedded Parallel DSP platform with Unique Memory Access), an ASIP architecture that targets algorithms with predictable data access. These kinds of algorithms are very common in e.g. baseband processing or multimedia applications. The primary focus will be on the specific features of ePUMA that are utilized to achieve high performance, and how it is possible to automatically utilize them using tools. The most significant features include data permutation for conflict-free data access, and utilization of address generation features for overhead free code execution. This sometimes requires specific information; for example the exact sequences of addresses in memory that are accessed, or that some operations may be performed in parallel. This is not always available when writing code using the traditional way with traditional languages, e.g. C, as extracting this information is still a very active research topic. In the near future at least, the way that software is written needs to change to exploit all hardware features, but in many cases in a positive way. Often the problem with current methods is that code is overly specific, and that a more general abstractions are actually easier to generate code from.

Embedded DSP Processor Design

Embedded DSP Processor Design PDF Author: Dake Liu
Publisher: Elsevier
ISBN: 9780080569871
Category : Computers
Languages : en
Pages : 808

Get Book

Book Description
This book provides design methods for Digital Signal Processors and Application Specific Instruction set Processors, based on the author's extensive, industrial design experience. Top-down and bottom-up design methodologies are presented, providing valuable guidance for both students and practicing design engineers. Coverage includes design of internal-external data types, application specific instruction sets, micro architectures, including designs for datapath and control path, as well as memory sub systems. Integration and verification of a DSP-ASIP processor are discussed and reinforced with extensive examples. Instruction set design for application specific processors based on fast application profiling Micro architecture design methodology Micro architecture design details based on real examples Extendable architecture design protocols Design for efficient memory sub systems (minimizing on chip memory and cost) Real example designs based on extensive, industrial experiences

Integrated Circuit and System Design. Power and Timing Modeling, Optimization and Simulation

Integrated Circuit and System Design. Power and Timing Modeling, Optimization and Simulation PDF Author: Johan Vounckx
Publisher: Springer
ISBN: 3540390979
Category : Computers
Languages : en
Pages : 677

Get Book

Book Description
This book constitutes the refereed proceedings of the 16th International Workshop on Power and Timing Modeling, Optimization and Simulation, PATMOS 2006. The book presents 41 revised full papers and 23 revised poster papers together with 4 key notes and 3 industrial abstracts. Topical sections include high-level design, power estimation and modeling memory and register files, low-power digital circuits, busses and interconnects, low-power techniques, applications and SoC design, modeling, and more.

System-on-Chip

System-on-Chip PDF Author: Bashir M. Al-Hashimi
Publisher: IET
ISBN: 0863415520
Category : Technology & Engineering
Languages : en
Pages : 940

Get Book

Book Description
This book highlights both the key achievements of electronic systems design targeting SoC implementation style, and the future challenges presented by the continuing scaling of CMOS technology.

Single-Instruction Multiple-Data Execution

Single-Instruction Multiple-Data Execution PDF Author: Christopher J. Hughes
Publisher: Springer Nature
ISBN: 3031017463
Category : Technology & Engineering
Languages : en
Pages : 105

Get Book

Book Description
Having hit power limitations to even more aggressive out-of-order execution in processor cores, many architects in the past decade have turned to single-instruction-multiple-data (SIMD) execution to increase single-threaded performance. SIMD execution, or having a single instruction drive execution of an identical operation on multiple data items, was already well established as a technique to efficiently exploit data parallelism. Furthermore, support for it was already included in many commodity processors. However, in the past decade, SIMD execution has seen a dramatic increase in the set of applications using it, which has motivated big improvements in hardware support in mainstream microprocessors. The easiest way to provide a big performance boost to SIMD hardware is to make it wider—i.e., increase the number of data items hardware operates on simultaneously. Indeed, microprocessor vendors have done this. However, as we exploit more data parallelism in applications, certain challenges can negatively impact performance. In particular, conditional execution, non-contiguous memory accesses, and the presence of some dependences across data items are key roadblocks to achieving peak performance with SIMD execution. This book first describes data parallelism, and why it is so common in popular applications. We then describe SIMD execution, and explain where its performance and energy benefits come from compared to other techniques to exploit parallelism. Finally, we describe SIMD hardware support in current commodity microprocessors. This includes both expected design tradeoffs, as well as unexpected ones, as we work to overcome challenges encountered when trying to map real software to SIMD execution.

Modern Processor Design

Modern Processor Design PDF Author: John Paul Shen
Publisher: Waveland Press
ISBN: 147861076X
Category : Computers
Languages : en
Pages : 657

Get Book

Book Description
Conceptual and precise, Modern Processor Design brings together numerous microarchitectural techniques in a clear, understandable framework that is easily accessible to both graduate and undergraduate students. Complex practices are distilled into foundational principles to reveal the authors insights and hands-on experience in the effective design of contemporary high-performance micro-processors for mobile, desktop, and server markets. Key theoretical and foundational principles are presented in a systematic way to ensure comprehension of important implementation issues. The text presents fundamental concepts and foundational techniques such as processor design, pipelined processors, memory and I/O systems, and especially superscalar organization and implementations. Two case studies and an extensive survey of actual commercial superscalar processors reveal real-world developments in processor design and performance. A thorough overview of advanced instruction flow techniques, including developments in advanced branch predictors, is incorporated. Each chapter concludes with homework problems that will institute the groundwork for emerging techniques in the field and an introduction to multiprocessor systems.

Journal of VLSI Signal Processing Systems for Signal, Image, and Video Technology

Journal of VLSI Signal Processing Systems for Signal, Image, and Video Technology PDF Author:
Publisher:
ISBN:
Category : Integrated circuits
Languages : en
Pages : 528

Get Book

Book Description


Handbook of Pattern Recognition and Computer Vision (5th Edition)

Handbook of Pattern Recognition and Computer Vision (5th Edition) PDF Author: Chi-hau Chen
Publisher: World Scientific
ISBN: 9814656534
Category : Computers
Languages : en
Pages : 582

Get Book

Book Description
The book provides an up-to-date and authoritative treatment of pattern recognition and computer vision, with chapters written by leaders in the field. On the basic methods in pattern recognition and computer vision, topics range from statistical pattern recognition to array grammars to projective geometry to skeletonization, and shape and texture measures. Recognition applications include character recognition and document analysis, detection of digital mammograms, remote sensing image fusion, and analysis of functional magnetic resonance imaging data, etc.

Electronics

Electronics PDF Author:
Publisher:
ISBN:
Category : Electronics
Languages : en
Pages : 766

Get Book

Book Description


Real-time Digital Signal Processing

Real-time Digital Signal Processing PDF Author: Sen-Maw Kuo
Publisher: 清华大学出版社有限公司
ISBN: 9787302077008
Category : Signal processing
Languages : en
Pages : 524

Get Book

Book Description