LocalAI/core/http/endpoints/openai
Ettore Di Giacinto c89271b2e4
feat(llama.cpp): add distributed llama.cpp inferencing (#2324)
* feat(llama.cpp): support distributed llama.cpp

Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

* feat: let tweak how chat messages are merged together

Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

* refactor

Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

* Makefile: register to ALL_GRPC_BACKENDS

Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

* refactoring, allow disable auto-detection of backends

Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

* minor fixups

Signed-off-by: mudler <mudler@localai.io>

* feat: add cmd to start rpc-server from llama.cpp

Signed-off-by: mudler <mudler@localai.io>

* ci: add ccache

Signed-off-by: mudler <mudler@localai.io>

---------

Signed-off-by: Ettore Di Giacinto <mudler@localai.io>
Signed-off-by: mudler <mudler@localai.io>
2024-05-15 01:17:02 +02:00
..
assistant.go Fix cleanup sonarqube findings (#2106) 2024-04-23 18:43:00 +02:00
assistant_test.go fix: reduce chmod permissions for created files and directories (#2137) 2024-04-26 00:47:06 +02:00
chat.go feat(llama.cpp): add distributed llama.cpp inferencing (#2324) 2024-05-15 01:17:02 +02:00
completion.go feat(functions): support models with no grammar, add tests (#2068) 2024-04-18 22:43:12 +02:00
edit.go Revert #1963 (#2056) 2024-04-17 23:33:49 +02:00
embeddings.go Revert #1963 (#2056) 2024-04-17 23:33:49 +02:00
files.go feat(assistant): Assistant and AssistantFiles api (#1803) 2024-03-26 18:54:35 +01:00
files_test.go fix: reduce chmod permissions for created files and directories (#2137) 2024-04-26 00:47:06 +02:00
image.go Revert #1963 (#2056) 2024-04-17 23:33:49 +02:00
inference.go Revert #1963 (#2056) 2024-04-17 23:33:49 +02:00
list.go refactor(application): introduce application global state (#2072) 2024-04-29 17:42:37 +00:00
request.go feat(ui): prompt for chat, support vision, enhancements (#2259) 2024-05-08 00:42:34 +02:00
transcription.go Revert #1963 (#2056) 2024-04-17 23:33:49 +02:00