## 6.893 "A comparison of Full and Partial Predicated Execution Support for ILP Processors"

Serhii Zhak

10/26/2000

| 6.893 Presentation                                                 | Serhii Zhak                                                              | 10/26/2000                |
|--------------------------------------------------------------------|--------------------------------------------------------------------------|---------------------------|
| Full Predication Support v                                         | vs. Partial Predication Support                                          |                           |
| Sull Prodication Sunnar                                            | <b>f</b> •                                                               |                           |
| un i reuleation Suppor                                             |                                                                          |                           |
| <ul><li>most flexibility and</li><li>computation of pred</li></ul> | largest potential performance im-<br>icate values is highly efficient an | provements<br>nd parallel |
| Partial Predication Supp                                           | oort                                                                     |                           |
| - requires very little of                                          | change to existing architectures                                         |                           |
| - increase in the num                                              | ber of instructions executed and                                         | requires a                |
| 1                                                                  | gistors to hold intermediate value                                       |                           |

| 6.893 Presentation                | Serhii Zhak            | 10/26/2000 |
|-----------------------------------|------------------------|------------|
|                                   |                        |            |
| ISA Extentions                    |                        |            |
| <b>Extentions for Full Predic</b> | ation Support          |            |
| Suppression of Execution          |                        |            |
| Expression of Condition           |                        |            |
| - predicate register file         |                        |            |
| - set of predicate instruction    | ons                    |            |
| <b>Extentions for Partial Pre</b> | lication Support       |            |
| - conditional move                |                        |            |
| - register from the int           | eger or floating-point |            |
| register file is used t           | o hold the condition   |            |
| rather than special r             | egister file           |            |
| - select                          | -                      |            |
| ~                                 |                        |            |

| ompiler Support          |                                     |          |
|--------------------------|-------------------------------------|----------|
| ompiler Support for Ful  | l Predication                       |          |
| - hyperblocks            |                                     |          |
| Compiler Support for Par | tial Predication                    |          |
| - Predicate Promotion    |                                     |          |
| - removing the predica   | te from a predicated instruction    |          |
| - Basic Conversion       | red instruction by a sequence of    |          |
| instructions with equi   | ivalent functionality               |          |
| - Peephole Optimizations | 5                                   |          |
| - ineficiencies since ea | ach instruction is considered indep | endently |

| 6.893 Presentation | Serhii Zhak | 10/26/2000 |
|--------------------|-------------|------------|
|                    |             |            |

**Results:** 

8-issue Processor 1 branch/cycle Assuming 2-cycle branch prediction miss penalty

**Conditional Move** allows about 30% performance gain **Full Predication** - another 30% gain over *conditional move* 

## 6.893 "The Program Decision Logic Approach to Predicated Execution"

Serhii Zhak

10/26/2000

| 6.893 Presentation    | Serhii Zhak            | 10/26/2000 |
|-----------------------|------------------------|------------|
| Architecture Suppo    | rt                     |            |
| Baseline Predicate A  | rchitecture            |            |
| - PlayDoh is a EPIC   | architecture           |            |
| Limitations of PlayD  | oh                     |            |
| - no convenient way   | to perform arbitrary   |            |
| logical operations    | on predicate register  |            |
| values: hurt Boole    | an minimization approa | ach        |
| Predicate Define Exte | ensions                |            |
|                       |                        |            |
|                       |                        |            |

| 6.893 Presentation                                                                         | Serhii Zhak                        | 10/26/2000 |
|--------------------------------------------------------------------------------------------|------------------------------------|------------|
| verview of Compile                                                                         | r Techniques                       |            |
| - hyperblock                                                                               |                                    |            |
| - predicate promotion                                                                      | 1                                  |            |
| inimization of Progra                                                                      | am Decision Logic                  |            |
| <ul> <li>Optimizations of pr</li> <li>Two-Level predicat</li> <li>Factorization</li> </ul> | edicate expressions<br>e synthesis |            |

|  | 6.893 Presentation | Serhii Zhak | 10/26/2000 |
|--|--------------------|-------------|------------|
|--|--------------------|-------------|------------|

**Results:** 

*Program decision logic minimization* provides an average overall speedup of 13% for an 8-issue Processor