Tags

2 articles

2026年5月31日 2 min read

Local LLMs and ChatGPT on One Endpoint with LiteLLM

Using LiteLLM Proxy to unify Lemonade (AMD ROCm llama.cpp) and a ChatGPT subscription behind a single OpenAI-compatible API in my homelab.

2026年5月31日 2 min read

Commit message generation was quietly draining my Copilot free quota. Fixed by pointing the utility model at a local Gemma-4-E4B via LiteLLM.