feat: fix CUDA images and update go-llama to use full GPU offloading (#618)

Signed-off-by: mudler <mudler@localai.io>
Co-authored-by: mudler <mudler@localai.io>
This commit is contained in:
Ettore Di Giacinto 2023-06-18 08:27:29 +02:00 committed by GitHub
parent 6c94f3cd67
commit d3d3187e51
No known key found for this signature in database
GPG key ID: 4AEE18F83AFDEB23
5 changed files with 11 additions and 47 deletions

View file

@ -41,4 +41,4 @@ jobs:
- name: Test
run: |
make test
CMAKE_ARGS="-DLLAMA_F16C=OFF -DLLAMA_AVX512=OFF -DLLAMA_AVX2=OFF -DLLAMA_FMA=OFF" make test