llvm-project

Files

michaelselehov 621fc8774e [AMDGPU] Implement LSR cost model for GFX9+ (#184138 )

AMDGPU previously had no target-specific LSR cost model, so the generic
heuristic would often introduce extra induction variables and base-add
chains that hurt VALU throughput on GFX9+ (observed on gfx942).

Implement a custom cost model:

- isLSRCostLess(): prioritize per-iteration instruction count over setup
costs, penalize IV multiplies, and demote register count. Pre-GFX9 falls
back to the default comparator.
- getScalingFactorCost(): report that base+scale*index addressing
requires an extra ADD instruction.
- isNumRegsMajorCostOfLSR(): return false.
- shouldDropLSRSolutionIfLessProfitable(): return true.

Assisted-by: Claude Opus

2026-03-23 12:18:11 +01:00

atomics.ll

…

different-addrspace-addressing-mode-loops.ll

…

different-addrspace-crash.ll

…

lit.local.cfg

…

lsr-invalid-ptr-extend.ll

…

lsr-postinc-pos-addrspace.ll

…

lsr-void-inseltpoison.ll

…

lsr-void.ll

…

preserve-addrspace-assert.ll

…