mirror of
https://github.com/mudler/LocalAI.git
synced 2025-06-16 15:55:00 +00:00
![]() * feat: Add backend gallery This PR add support to manage backends as similar to models. There is now available a backend gallery which can be used to install and remove extra backends. The backend gallery can be configured similarly as a model gallery, and API calls allows to install and remove new backends in runtime, and as well during the startup phase of LocalAI. Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Add backends docs Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * wip: Backend Dockerfile for python backends Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * feat: drop extras images, build python backends separately Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * fixup on all backends Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * test CI Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Tweaks Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Drop old backends leftovers Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Fixup CI Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Move dockerfile upper Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Fix proto Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Feature dropped for consistency - we prefer model galleries Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Add missing packages in the build image Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * exllama is ponly available on cublas Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * pin torch on chatterbox Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Fixups to index Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * CI Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Debug CI * Install accellerators deps Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Add target arch * Add cuda minor version Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Use self-hosted runners Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * ci: use quay for test images Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * fixups for vllm and chatterbox Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Small fixups on CI Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * chatterbox is only available for nvidia Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Simplify CI builds Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Adapt test, use qwen3 Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * chore(model gallery): add jina-reranker-v1-tiny-en-gguf Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * fix(gguf-parser): recover from potential panics that can happen while reading ggufs with gguf-parser Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Use reranker from llama.cpp in AIO images Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Limit concurrent jobs Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io> Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com> |
||
---|---|---|
.. | ||
alpaca.yaml | ||
arch-function.yaml | ||
cerbero.yaml | ||
chatml-hercules.yaml | ||
chatml.yaml | ||
codellama.yaml | ||
command-r.yaml | ||
deephermes.yaml | ||
deepseek-r1.yaml | ||
deepseek.yaml | ||
dreamshaper.yaml | ||
falcon3.yaml | ||
flux-ggml.yaml | ||
flux.yaml | ||
gemma.yaml | ||
granite.yaml | ||
granite3-2.yaml | ||
hermes-2-pro-mistral.yaml | ||
hermes-vllm.yaml | ||
index.yaml | ||
llama3-instruct.yaml | ||
llama3.1-instruct-grammar.yaml | ||
llama3.1-instruct.yaml | ||
llama3.1-reflective.yaml | ||
llama3.2-fcall.yaml | ||
llama3.2-quantized.yaml | ||
llava.yaml | ||
mathstral.yaml | ||
mistral-0.3.yaml | ||
moondream.yaml | ||
mudler.yaml | ||
noromaid.yaml | ||
openvino.yaml | ||
parler-tts.yaml | ||
phi-2-chat.yaml | ||
phi-2-orange.yaml | ||
phi-3-chat.yaml | ||
phi-3-vision.yaml | ||
phi-4-chat-fcall.yaml | ||
phi-4-chat.yaml | ||
piper.yaml | ||
qwen-fcall.yaml | ||
qwen3-openbuddy.yaml | ||
qwen3.yaml | ||
rerankers.yaml | ||
rwkv.yaml | ||
sd-ggml.yaml | ||
sentencetransformers.yaml | ||
smolvlm.yaml | ||
stablediffusion3.yaml | ||
tuluv2.yaml | ||
vicuna-chat.yaml | ||
virtual.yaml | ||
vllm.yaml | ||
whisper-base.yaml | ||
wizardlm2.yaml |