LocalAI/docs/content/docs/reference/compatibility-table.md at 96c080cc6431536e22122bf0352d767bb0669aa4

mirror of https://github.com/mudler/LocalAI.git synced 2025-05-20 10:35:01 +00:00

Gianluca Boiano 032a33de49

chore: remove deprecated tinydream backend (#4631 )

Signed-off-by: Gianluca Boiano <morf3089@gmail.com>

2025-01-18 18:35:30 +01:00

6.4 KiB

Raw Blame History

+++ disableToc = false title = "Model compatibility table" weight = 24 url = "/model-compatibility/" +++

Besides llama based models, LocalAI is compatible also with other architectures. The table below lists all the backends, compatible models families and the associated repository.

LocalAI will attempt to automatically load models which are not explicitly configured for a specific backend. You can specify the backend to use by configuring a model with a YAML file. See [the advanced section]({{%relref "docs/advanced" %}}) for more details.

Backend and Bindings	Compatible models	Completion/Chat endpoint	Capability	Embeddings support	Token stream support	Acceleration
[llama.cpp]({{%relref "docs/features/text-generation#llama.cpp" %}})	LLama, Mamba, RWKV, Falcon, Starcoder, GPT-2, and many others	yes	GPT and Functions	yes	yes	CUDA, openCL, cuBLAS, Metal
llama.cpp's ggml model (backward compatibility with old format, before GGUF) (binding)	LLama, GPT-2, and many others	yes	GPT and Functions	yes	yes	CUDA, openCL, cuBLAS, Metal
whisper	whisper	no	Audio	no	no	N/A
stablediffusion (binding)	stablediffusion	no	Image	no	no	N/A
langchain-huggingface	Any text generators available on HuggingFace through API	yes	GPT	no	no	N/A
piper (binding)	Any piper onnx model	no	Text to voice	no	no	N/A
sentencetransformers	BERT	no	Embeddings only	yes	no	N/A
`bark`	bark	no	Audio generation	no	no	yes
`autogptq`	GPTQ	yes	GPT	yes	no	N/A
`exllama`	GPTQ	yes	GPT only	no	no	N/A
`diffusers`	SD,...	no	Image generation	no	no	N/A
`vall-e-x`	Vall-E	no	Audio generation and Voice cloning	no	no	CPU/CUDA
`vllm`	Various GPTs and quantization formats	yes	GPT	no	no	CPU/CUDA
`mamba`	Mamba models architecture	yes	GPT	no	no	CPU/CUDA
`exllama2`	GPTQ	yes	GPT only	no	no	N/A
`transformers-musicgen`		no	Audio generation	no	no	N/A
stablediffusion	no	Image	no	no	N/A
`coqui`	Coqui	no	Audio generation and Voice cloning	no	no	CPU/CUDA
`openvoice`	Open voice	no	Audio generation and Voice cloning	no	no	CPU/CUDA
`parler-tts`	Open voice	no	Audio generation and Voice cloning	no	no	CPU/CUDA
rerankers	Reranking API	no	Reranking	no	no	CPU/CUDA
`transformers`	Various GPTs and quantization formats	yes	GPT, embeddings	yes	yes*	CPU/CUDA/XPU
bark-cpp	bark	no	Audio-Only	no	no	yes
stablediffusion-cpp	stablediffusion-1, stablediffusion-2, stablediffusion-3, flux, PhotoMaker	no	Image	no	no	N/A
silero-vad with Golang bindings	Silero VAD	no	Voice Activity Detection	no	no	CPU

Note: any backend name listed above can be used in the backend field of the model configuration file (See [the advanced section]({{%relref "docs/advanced" %}})).

* Only for CUDA and OpenVINO CPU/XPU acceleration.

6.4 KiB Raw Blame History

6.4 KiB

Raw Blame History