mirror of
https://github.com/mudler/LocalAI.git
synced 2025-06-03 09:24:59 +00:00
chore(model-gallery): add more quants for popular models (#3365)
* models(gallery): add higher quants for some llama and hermes Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * models(gallery): vllm: specify a reasonable max_tokens Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io>
This commit is contained in:
parent
ac5f6f210b
commit
84d6e5a987
3 changed files with 53 additions and 0 deletions
|
@ -2,6 +2,9 @@
|
|||
name: "vllm"
|
||||
|
||||
config_file: |
|
||||
context_size: 8192
|
||||
parameters:
|
||||
max_tokens: 8192
|
||||
backend: vllm
|
||||
function:
|
||||
disable_no_action: true
|
||||
|
|
Loading…
Add table
Add a link
Reference in a new issue