You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Default no GPU usage, that's the moderate way I thought to temporarily face the situation that flux Clip_l loaded in CPU: We cannot load it in vram then at least speed it up.
But as the issue, that disable ROCm ability but only use OpenBLAS.
Why OpenBLAS?
Not like years ago, current OpenBLAS speed is much faster than avx2, and faster than Blis either. (About Blis Vs OpenBLAS, I only tested in arm.)
The text was updated successfully, but these errors were encountered:
FNsi
changed the title
Force use GPU backend while compile both OpenBLAS and ROCm
Autoselect CPU instead GPU backend while compile both OpenBLAS and ROCm
Jan 10, 2025
Default no GPU usage, that's the moderate way I thought to temporarily face the situation that flux Clip_l loaded in CPU: We cannot load it in vram then at least speed it up.
But as the issue, that disable ROCm ability but only use OpenBLAS.
Why OpenBLAS?
Not like years ago, current OpenBLAS speed is much faster than avx2, and faster than Blis either. (About Blis Vs OpenBLAS, I only tested in arm.)
The text was updated successfully, but these errors were encountered: