Series
Infrastructure & Engineering
3 min read
LLMs on Strix Halo: Three Days Chasing the MES Firmware 0x83 Bug
Series: home-kubernetes-journal
Running llama.cpp on my k3s + AMD GPU cluster kept hitting memory access faults. The culprit: a bug in MES firmware 0x83 shipped with amdgpu-dkms-firmware.