llvm-project/llvm/lib/Target/AMDGPU/AMDGPULateCodeGenPrepare.cpp at main

Files

michaelselehov 3645cef1ef [AMDGPU] LiveRegOptimizer: consider i8/i16 binops on SDWA (#155800 )

PHI-node part was merged with PR#160909.

Extend `isOpLegal` to treat 8/16-bit vector add/sub/and/or/xor as
profitable on SDWA targets (stores and intrinsics remain profitable).
This repacks loop-carried values to i32 across BBs and restores SDWA
lowering instead of scattered lshr/lshl/or sequences.

Testing:
- Local: `check-llvm-codegen-amdgpu` is green (4314/4320 passed, 6
XFAIL).
- Additional: validated in AMD internal CI

2025-12-15 12:04:33 -05:00

21 KiB

Raw Permalink Blame History

View Raw

21 KiB Raw Permalink Blame History

21 KiB

Raw Permalink Blame History