#GPU - Tags | Otama's Playground

2026年5月31日 2 min read

Local LLMs and ChatGPT on One Endpoint with LiteLLM

Using LiteLLM Proxy to unify Lemonade (AMD ROCm llama.cpp) and a ChatGPT subscription behind a single OpenAI-compatible API in my homelab.

#LiteLLM #Kubernetes #Homelab +6

Series

Infrastructure & Engineering

2026年5月23日 3 min read

LLMs on Strix Halo: Three Days Chasing the MES Firmware 0x83 Bug

Series: home-kubernetes-journal

Running llama.cpp on my k3s + AMD GPU cluster kept hitting memory access faults. The culprit: a bug in MES firmware 0x83 shipped with amdgpu-dkms-firmware.

#Kubernetes #Homelab #AMD +5

Series

Infrastructure & Engineering

2026年5月23日 3 min read

I Tried GPU Passthrough in an Incus VM and Ended Up on Bare Metal

Series: home-kubernetes-journal

I set up Strix Halo as a k3s worker via Incus VM + VFIO, then hit a wall: once the GPU enters a dirty state, recovery is impossible without bare metal.

#Kubernetes #Homelab #AMD +5