

| Motivation                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                        |  |
|-----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|--|
| <ul> <li>Cache uses 30-60% processor energy in embedded systems.</li> <li><i>Example: 43% for StrongArm-1</i></li> </ul>                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                          |  |
| <ul> <li>Many academic studies on cache         <ul> <li>[Albera, Bahar, '98] – Power and performance trade-offs</li> <li>[Amrutur, Horowitz, '98,'00] – Speed and power scaling</li> <li>[Bellas, Hajj, Polychronopoulos, '99] – Dynamic cache management</li> <li>[Ghose, Kamble,'99] – Power reduction through sub-banking, etc.</li> <li>[Inoue, Ishihara, Murakami, '99] – Way predicting set-associative cache</li> <li>[Kin, Gupta, Mangione-Smith, '97] – Filter cache</li> <li>[Ko, Balsara, Nanda, '98] – Multilevel caches for RISC and CISC</li> <li>[Wilton, Jouppi, '94] – CACTI cache model</li> </ul> </li> </ul> |  |
| <ul> <li>Many Industrial Low-Power Processors use CAM (content-<br/>addressable-memory)</li> <li>ARM3 – 64-way set-associative – [Furber et. al. '89]</li> <li>StrongArm – 32-way set-associative – [Santhanam et. al. '98]</li> <li>Intel XScale – 32-way set-associative – '01</li> </ul>                                                                                                                                                                                                                                                                                                                                       |  |
| CAM: Fast and Energy-Efficient                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                    |  |





























## Conclusion



- CAM tags high performance and low-power
  - □ Energy consumption of 32-way CAM < 2-way RAM
  - □ Easy to implement highly-associative tags
  - □ Low area overhead (10%)
  - Comparable access delay
  - □ Better CPI by reducing miss rate

## Thank You! http://www.cag.lcs.mit.edu/scale/