feat(speculative-sampling): allow to specify a draft model in the model config (#1052)

mirror of https://github.com/mudler/LocalAI.git synced 2025-05-22 03:24:59 +00:00

**Description**

This PR fixes #1013.

It adds `draft_model` and `n_draft` to the model YAML config in order to
load models with speculative sampling. This should be compatible as well
with grammars.

example:

```yaml
backend: llama                                                                                                                                                                   
context_size: 1024                                                                                                                                                                        
name: my-model-name
parameters:
  model: foo-bar
n_draft: 16                                                                                                                                                                      
draft_model: model-name
```

---------

Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

This commit is contained in:

Ettore Di Giacinto

2023-09-14 17:44:16 +02:00

• committed by

GitHub

parent 247d85b523

commit 8ccf5b2044

No known key found for this signature in database

GPG key ID: 4AEE18F83AFDEB23

12 changed files with 485 additions and 427 deletions

64

extra/grpc/diffusers/backend_pb2.py

View file

File diff suppressed because one or more lines are too long

Rows
Columns

feat(speculative-sampling): allow to specify a draft model in the model config (#1052)

64 extra/grpc/diffusers/backend_pb2.py View file

64

extra/grpc/diffusers/backend_pb2.py

View file