Infrastructure & Engineering
2 min read
Local LLMs and ChatGPT on One Endpoint with LiteLLM
Using LiteLLM Proxy to unify Lemonade (AMD ROCm llama.cpp) and a ChatGPT subscription behind a single OpenAI-compatible API in my homelab.
2 articles
Using LiteLLM Proxy to unify Lemonade (AMD ROCm llama.cpp) and a ChatGPT subscription behind a single OpenAI-compatible API in my homelab.
Commit message generation was quietly draining my Copilot free quota. Fixed by pointing the utility model at a local Gemma-4-E4B via LiteLLM.