feat: allow to run parallel requests (#1290)

* feat: allow to run parallel requests Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * fixup Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io>
2025-05-28 14:35:00 +00:00 · 2023-11-16 08:20:05 +01:00 · 2023-11-16 08:20:05 +01:00 · fdd95d1d86
commit fdd95d1d86
parent 66a558ff41
9 changed files with 91 additions and 44 deletions
--- a/.env
+++ b/.env
@ -69,4 +69,7 @@ MODELS_PATH=/models
 # PYTHON_GRPC_MAX_WORKERS=1

 ### Define the number of parallel LLAMA.cpp workers (Defaults to 1)
-# LLAMACPP_PARALLEL=1
+# LLAMACPP_PARALLEL=1
+
+### Enable to run parallel requests
+# PARALLEL_REQUESTS=true