Advanced SearchSearch Tips
Exploiting Static Non-Uniform Cache Architectures for Hard Real-Time Computing
facebook(new window)  Pirnt(new window) E-mail(new window) Excel Download
 Title & Authors
Exploiting Static Non-Uniform Cache Architectures for Hard Real-Time Computing
Ding, Yiqiang; Zhang, Wei;
  PDF(new window)
High-performance processors using Non-Uniform Cache Architecture (NUCA) are increasingly used to deal with the growing wire delays in multicore/manycore processors. Due to the convergence of high-performance computing with embedded computing, NUCA caches are expected to benefit high-end embedded systems as well. However, for real-time systems that use multicore processors with NUCA caches, it is crucial to bound worst-case execution time (WCET) accurately and safely. In this paper, we developed a WCET analysis approach by considering the effect of static NUCA caches on WCET. We compared the WCET in real-time applications with different topologies of static NUCA caches. Our experimental results demonstrated that the static NUCA cache could improve the worst-case performance of realtime applications using multicore processor compared to the cache with uniform access time.
Non-Uniform Cache Architecture;Worst-case execution time;Real-time systems;Multicore processors;
 Cited by
C. Kim, D. Burger, and S. W. Keckler, "An adaptive, nonuniform cache structure for wire-delay dominated on-chip caches," in Proceedings of the 10th International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS-X), San Jose, CA, 2002, pp. 211-222.

J. Yan and W. Zhang, "WCET analysis for multi-core processors with shared L2 instruction caches," in Proceedings of 14th IEEE Real-Time and Embedded Technology and Applications Symposium (RTAS'08), St. Louis, MO, 2008, pp. 80-89.

Y. Li, V. Suhendra, Y. Liang, T. Mitra, and A. Roy-choudhury, "Timing analysis of concurrent programs running on shared cache multi-cores," in Proceedings of 30th IEEE Real-time System Symposium (RTSS), Washington, DC, 2009, pp. 57-67.

M. Lv, W. Yi, N. Guan, and G. Yu, "Combining abstract interpretation with model checking for timing analysis of multicore software," in Proceedings of 31st IEEE International Real-Time System Symposium (RTSS), San Diego, CA, 2010, pp. 339-349.

T. Kelter, H. Falk, P. Marwedel, S. Chattopadhyay, and A. Roychoudhury, "Bus-aware multicore WCET analysis through TDMA offset bounds," in Proceedings of the 23rd Euromicro Conference on Real-Time Systems (ECRTS), Porto, Portugal, 2011, pp. 3-12.

S. Chattopadhyay, L. K. Chong, A. Roychoudhury, T. Kelter, P. Marwedel, and H. Falk, "A unified WCET analysis framework for multi-core platforms," in Proceedings of IEEE 18th Real-Time and Embedded Technology and Applications Symposium (RTAS), Beijing, China, 2012, pp. 99-108.

Y. Ding and W. Zhang, "WCET analysis of static NUCA caches," in Proceedings of the 33rd IEEE International Performance Computing and Communications Conference (IPCCC), Austin, TX, 2014, pp. 1-6.

W. Zhang and J. Yan, "A unified timing analysis approach for shared caches of multicores," in Proceedings of the Work-in-Progress (WIP) session of 17th IEEE Real-Time and Embedded Technology and Applications Symposium (RTAS), Chicago, IL, 2011.

W. Zhang and J. Yan, "Static timing analysis of shared caches for multicore processors," Journal of Computing Science and Engineering, vol. 6, no. 4, pp. 267-278, 2012. crossref(new window)

Y. S. Li and S. Malik, "Performance analysis of embedded software using implicit path enumeration," in Proceedings of the ACM SIGPLAN 1995 Workshop on Languages, Compilers, & Tools for Real-Time Systems (LCT-RTS 1995), La Jolla, CA, 1995, pp. 88-98.

Y. S. Li, S. Malik, and A. Wolfe, "Performance estimation of embedded software with instruction cache modeling," in Proceedings of IEEE/ACM International Conference on Computer-Aided Design, San Jose, CA, 1995, pp. 380-387.

Y. S. Li, S. Malik, and A. Wolfe, "Cache modeling for realtime software: beyond direct mapped instruction caches," in Proceedings of the 17th IEEE Real-Time Systems Symposium (RTSS'96), Washington, DC, 1993, pp. 254-263.

C. A. Healy, R. D. Arnold, F. Mueller, D. Whalley, and M. G. Harmon, "Bounding pipeline and instruction cache performance," IEEE Transactions on Computers, vol. 48, no. 1, pp. 53-70, 1999. crossref(new window)

Homepage of CPLEX,

Malardalen WCET Research Group, Malardalen WCET benchmark suite,