8002824274

8002824274



Increasing data locality of parallel programs executed in embedded Systems

Włodzimierz Bielecki, Krzysztof Kraska

Szczecin University of Technology,

Faculty of Computer Science and Information Technology

Abstract:

Increasing data locality in a program is a necessary factor to improve performance of software parts of embedded systems, to decrease power consumption and reduce memory on chip size. A possibility of applying a method of ąuantifying data locality to a novel method of extracting synchronization-free threads is introduced. It can be used to agglomerate extracted synchronization-free threads for adopting a parallel program to a target architecture of an embedded system under various loop Schedule options (space-time mapping) and the influence of well-known techniąues to improve data locality. The choice of the best combination of loop transformation techniąues regarding to data locality makes possible improving program performance. A way of an analysis of data locality is presented. Experimental results are depicted and discussed. Conclusion and futurę research are outlined.

Keywords:

data locality, compilers, parallel processing, embedded systems

1. Introduction

Embedded systems involved in data processing consist of programmable processors, program components processed by the processors and hardware components often rea-lized in FPGA cooperating with software parts of the system. Software components enable making corrections ąuickly, codę reusing, elastic changing a program permitting for reducing the time of delivering product to the market. But programmable processors consume considerably morę energy and they are significantly slower than their hardware counterparts. Hardware Solutions assure greater performance and smaller power consumption however designing time may be long and the design process is expen-sive [9].

Multiprocessor architectures for embedded systems are widespread on the contem-porary electronic market. For example, the Xilinx FPGA Virtex-4FX chip includes up to two PowerPC405 processors, National Semiconductor’s Geode chips enable to join several processors to build a multiprocessor system based on the x86 architecture, the HPOC project (Hundred Processors, One Chip) undertaken at Hewlett Packard attempts to consolidate hundreds of processors on one chip using co-resident on-chip memory [4].



Wyszukiwarka

Podobne podstrony:
Increasing data locality of parallel programs executed in embedded systems 9 Increasing data localit
Spis treści Włodzimierz Bielecki, Krzysztof Kraska INCREASING DATA LOCALITY OF PARALLEL PROGRAMS EXE
Increasing data locality ofparallel programs executed in embedded Systems 11 Obviously, increase in
Increasing data locality ofparallel programs executed in embedded Systems 13References [1]
7 lncreasing data locality ofparallel programs executed in embedded systems Livermore loop Kernel
GDAŃSK UNIVERSITY OF TECHNOLOGY PROGRAMME GUIDESTUDY IN GDAŃSK
1746 Humań Molecular Genetics. 1999. Vol. S. No. 9 Figurę 4. Localization of the STAT55 gene in rela
Włodzimierz Bielecki, Krzysztof Kraska The assurance of the optimal performance for a program with t
PROGRAM ROZWOJOWY^1 POLITECHNIKI WARSZAWSKIEJ Fig. 1.15. Instantaneous value of voltage and current
(31) In order to increase the availability of information on the use of medicinal products in t
S5003156 170 Fig. i Stara Słupia sneltlng site. The remains consist of slag pits arranged in paralle
Working in patient-facing roles or working outside of home significantly increases the likelihood of
Physics in Canada/12 shall be voting members of the local Executive. 5.    Sections s
PROGRAM ROZWOJOWY^1 POLITECHNIKI WARSZAWSKIEJ Fig. 1.15. Instantaneous value of voltage and current
20 STKESS ANALYSIS    I Stresses in a stiff jointed polygonal frame under a system of
GDAŃSK UNIVERSITY OFTECHNOLOGYSTUDY IN GDAŃSK at our University of Technology PROGRAMME GUIDEPublish
47 Structure of the solute yield in the Vistula.. it decreased rather than increased, because the do
285 production drifts in each sublevel, Cp; cost of parallel drilling, r2; Index of Determination, H

więcej podobnych podstron