Cache-Based Architectures for High Performance Computing

Pressel, Daniel M.

Cache-Based Architectures for High Performance Computing

Active / Technical Report | Accession Number: ADA399720 |

Open PDF

Abstract:

Many researchers have noted that scientific codes perform poorly on computer architectures involving a memory hierarchy cache. Furthermore, a number of researchers and some vendors concluded that simply making the caches larger would not solve this problem. Alternatively, some vendors of HPC systems have opted to equip their systems with fast memory interfaces, but with a limited amount of on-chip cache and no off-chip cache. Some RISC-based HPC systems supported some sort of prefetching or streaming facility that allows one to more efficiently stream data between main memory and the processor e.g., the Cray T3E. However, there are fundamental limitations on the benefits of these approaches which makes it difficult to see how these approaches by themselves will eliminate the Memory Wall. It has been shown that if one relies solely on this approach for the Cray T3E, one is unlikely to achieve much better than 4-6 of the machines peak performance. Does this mean that as the speed of RISCCISC processors increases, systems designed to process scientific data are doomed to hit the Memory Wall The answer to that question depends on the ability of programmers to find innovative ways to take advantage of caches. This report discusses some of the techniques that can be used to overcome this hurdle allowing one to consider what types of hardware resources are required to support these techniques.

Author(s):

Pressel, Daniel M.

Author Organization(s):

ARMY RESEARCH LAB ABERDEEN PROVING GROUND MD

Descriptive Note:

Final rept. Oct 1999-Jun 2000

Supplementary Note:

DOI: 10.21236/ADA399720

Pagination:

0028

Security Markings

DOCUMENT & CONTEXTUAL SUMMARY

Distribution:

Approved For Public Release

Distribution Statement:

Approved For Public Release; Distribution Is Unlimited.

RECORD

Collection: TR

Identifying Numbers

Report Number(s):

ARL-MR-528

Monitor Series:

ARL*

Subject Terms

Joint Capability Areas:

JCA_5_Command and Control; JCA_4.2.2_Inventory Management; JCA_4.2_Supply; JCA_1.2_Force Preparation; JCA_1_Force Support; JCA_5.4.3_Establish Rule Sets; JCA_5.4_Decide; JCA_1.2.5_Lessons Learned; JCA_1.2.3_Educating; JCA_6_Net Centric

Modernization Areas:

Quantum Science and Computing; Microelectronics

Communities of Interest:

Energy and Power Technologies

Descriptor(s):

*COMPUTER ARCHITECTURE, *SOFTWARE ENGINEERING, *SUPERCOMPUTERS, CHIPS(ELECTRONICS), CODING, HIERARCHIES, INTERFACES, MEMORY DEVICES, PERFORMANCE(ENGINEERING)

Field(s)/Group(s):

Computer Programming and Software

Keyword(s):

HIGH PERFORMANCE COMPUTING

Report Date:

2002 Feb 01

Creation Date:

2002 Mar 27