Increasing data locality ofparallel programs executed in embedded Systems 13
[1] Bielecki W., Siedlecki K. Extracting synchronization-free slices in perfectly nested uniform and non-uniform loops. Electonic Modeling, 2007.
[2] Bielecki W., Kraska K., Siedlecki K. Increasing Program Locality by Extracting Synchronization-Free Slices in Arbitrarily Nested Loops. Proceedings of the Four-teenth International Multi-Conference on Advanced Computer Systems ACS2007, 2007.
[3] Wolfe M. High Performance Compilers for Parallel Computing. Addison-Wesley, 1996.
[4] Richardson S. MPOC. A Chip Multiprocessor for Embedded Systems, [online] http://www.hpl.hp.com/techreports/2002/HPL-20Q2-186.pdf. HP Laboratories, 2002.
[5] Netlib Repository at UTK and ORNL [online]. http://www.netlib.org/benchmarkAivermorec.
[6] Aho A. V., Lam M. S., Sethi R., Ullman J. D. Compilers: Principles, Techniąues and Tools, 2nd Edition. Addison-Wesley, 2006.
[7] IBM PowerPC Multi-Core Instruction Set Simulator. User’s Guide, IBM Corporation, 2008.
[8] IBM RlSCWatch Debugger. User’s Manuał, IBM Corporation, 2008.
[9] Stasiak A. Klasyfikacja Systemów Wspomagających Proces Przetwarzania i Sterowania. II Konferencja Naukowa KNWS'05, 2005.
[10] Griebl M. Habilitation. Automatic Parallelization ofLoop Programs for Distri-buted Memory Architectures. Iniversitat Passau, 2004.
[11] Kelly W., Maslov V., Pugh W., Rosser E., Shpeisman T., Wonnacott D. The omega library interface guide. Technical Report CS-TR-3445, University of Maryland, 1995.
[12] Chandra R., Dagum L., Kohr D., Maydan D., McDonald J., Menon R. Parallel Programing In OpenMP. Morgan Kaufmann, 2001.