LocalAI/core/http/endpoints/openai
Ettore Di Giacinto aae7ad9d73
feat(llama.cpp): guess model defaults from file (#2522)
* wip: guess informations from gguf file

Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

* update go mod

Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

* Small fixups

Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

* Identify llama3

Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

* Do not try to guess the name, as reading gguf files can be expensive

Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

* Allow to disable guessing

Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

---------

Signed-off-by: Ettore Di Giacinto <mudler@localai.io>
2024-06-08 22:13:02 +02:00
..
assistant.go Fix cleanup sonarqube findings (#2106) 2024-04-23 18:43:00 +02:00
assistant_test.go fix: reduce chmod permissions for created files and directories (#2137) 2024-04-26 00:47:06 +02:00
chat.go fix(stream): do not break channel consumption (#2517) 2024-06-07 17:20:42 +02:00
completion.go models(gallery): add Codestral (#2442) 2024-05-30 18:50:26 +02:00
edit.go Revert #1963 (#2056) 2024-04-17 23:33:49 +02:00
embeddings.go Revert #1963 (#2056) 2024-04-17 23:33:49 +02:00
files.go feat(assistant): Assistant and AssistantFiles api (#1803) 2024-03-26 18:54:35 +01:00
files_test.go fix: reduce chmod permissions for created files and directories (#2137) 2024-04-26 00:47:06 +02:00
image.go feat(image): support response_type in the OpenAI API request (#2347) 2024-05-29 14:40:54 +02:00
inference.go Revert #1963 (#2056) 2024-04-17 23:33:49 +02:00
list.go refactor(application): introduce application global state (#2072) 2024-04-29 17:42:37 +00:00
request.go feat(llama.cpp): guess model defaults from file (#2522) 2024-06-08 22:13:02 +02:00
transcription.go Revert #1963 (#2056) 2024-04-17 23:33:49 +02:00