LocalAI/backend/python/exllama
Ludovic Leroux 12c0d9443e
feat: use tokenizer.apply_chat_template() in vLLM (#1990)
Use tokenizer.apply_chat_template() in vLLM

Signed-off-by: Ludovic LEROUX <ludovic@inpher.io>
2024-04-11 19:20:22 +02:00
..
backend_pb2.py feat: use tokenizer.apply_chat_template() in vLLM (#1990) 2024-04-11 19:20:22 +02:00
backend_pb2_grpc.py feat: use tokenizer.apply_chat_template() in vLLM (#1990) 2024-04-11 19:20:22 +02:00
exllama.py exllama(v2): fix exllamav1, add exllamav2 (#1384) 2023-12-05 08:15:37 +01:00
exllama.yml exllama(v2): fix exllamav1, add exllamav2 (#1384) 2023-12-05 08:15:37 +01:00
install.sh feat(intel): add diffusers/transformers support (#1746) 2024-03-07 14:37:45 +01:00
Makefile Bump vLLM version + more options when loading models in vLLM (#1782) 2024-03-01 22:48:53 +01:00
README.md refactor: move backends into the backends directory (#1279) 2023-11-13 22:40:16 +01:00
run.sh Bump vLLM version + more options when loading models in vLLM (#1782) 2024-03-01 22:48:53 +01:00

Creating a separate environment for the exllama project

make exllama