Default branch

master
Some checks are pending
Explorer deployment / build-linux (push) Waiting to run
GPU tests / ubuntu-latest (1.21.x) (push) Waiting to run
generate and publish intel docker caches / generate_caches (intel/oneapi-basekit:2025.1.0-0-devel-ubuntu22.04, linux/amd64, ubuntu-latest) (push) Waiting to run
build container images / hipblas-jobs (-aio-gpu-hipblas, rocm/dev-ubuntu-22.04:6.1, hipblas, true, ubuntu:22.04, extras, latest-gpu-hipblas-extras, latest-aio-gpu-hipblas, --jobs=3 --output-sync=target, linux/amd64, arc-runner-set, auto, -hipblas-extras) (push) Waiting to run
build container images / hipblas-jobs (rocm/dev-ubuntu-22.04:6.1, hipblas, true, ubuntu:22.04, core, latest-gpu-hipblas, --jobs=3 --output-sync=target, linux/amd64, arc-runner-set, false, -hipblas) (push) Waiting to run
build container images / self-hosted-jobs (-aio-gpu-intel-f16, quay.io/go-skynet/intel-oneapi-base:latest, sycl_f16, true, ubuntu:22.04, extras, latest-gpu-intel-f16-extras, latest-aio-gpu-intel-f16, --jobs=3 --output-sync=target, linux/amd64, arc-runner-set, false, -sycl-f16-… (push) Waiting to run
build container images / self-hosted-jobs (-aio-gpu-intel-f32, quay.io/go-skynet/intel-oneapi-base:latest, sycl_f32, true, ubuntu:22.04, extras, latest-gpu-intel-f32-extras, latest-aio-gpu-intel-f32, --jobs=3 --output-sync=target, linux/amd64, arc-runner-set, false, -sycl-f32-… (push) Waiting to run
build container images / self-hosted-jobs (-aio-gpu-nvidia-cuda-11, ubuntu:22.04, cublas, 11, 7, true, extras, latest-gpu-nvidia-cuda-11-extras, latest-aio-gpu-nvidia-cuda-11, --jobs=3 --output-sync=target, linux/amd64, arc-runner-set, false, -cublas-cuda11-extras) (push) Waiting to run
build container images / self-hosted-jobs (-aio-gpu-nvidia-cuda-12, ubuntu:22.04, cublas, 12, 0, true, extras, latest-gpu-nvidia-cuda-12-extras, latest-aio-gpu-nvidia-cuda-12, --jobs=3 --output-sync=target, linux/amd64, arc-runner-set, false, -cublas-cuda12-extras) (push) Waiting to run
build container images / self-hosted-jobs (quay.io/go-skynet/intel-oneapi-base:latest, sycl_f16, true, ubuntu:22.04, core, latest-gpu-intel-f16, --jobs=3 --output-sync=target, linux/amd64, arc-runner-set, false, -sycl-f16) (push) Waiting to run
build container images / self-hosted-jobs (quay.io/go-skynet/intel-oneapi-base:latest, sycl_f32, true, ubuntu:22.04, core, latest-gpu-intel-f32, --jobs=3 --output-sync=target, linux/amd64, arc-runner-set, false, -sycl-f32) (push) Waiting to run
build container images / core-image-build (-aio-cpu, ubuntu:22.04, , true, core, latest-cpu, latest-aio-cpu, --jobs=4 --output-sync=target, linux/amd64,linux/arm64, arc-runner-set, false, auto, ) (push) Waiting to run
build container images / core-image-build (ubuntu:22.04, cublas, 11, 7, true, core, latest-gpu-nvidia-cuda-12, --jobs=4 --output-sync=target, linux/amd64, arc-runner-set, false, false, -cublas-cuda11) (push) Waiting to run
build container images / core-image-build (ubuntu:22.04, cublas, 12, 0, true, core, latest-gpu-nvidia-cuda-12, --jobs=4 --output-sync=target, linux/amd64, arc-runner-set, false, false, -cublas-cuda12) (push) Waiting to run
build container images / core-image-build (ubuntu:22.04, vulkan, true, core, latest-gpu-vulkan, --jobs=4 --output-sync=target, linux/amd64, arc-runner-set, false, false, -vulkan) (push) Waiting to run
build container images / gh-runner (nvcr.io/nvidia/l4t-jetpack:r36.4.0, cublas, 12, 0, true, core, latest-nvidia-l4t-arm64, --jobs=4 --output-sync=target, linux/arm64, ubuntu-24.04-arm, true, false, -nvidia-l4t-arm64) (push) Waiting to run
Security Scan / tests (push) Waiting to run
Tests extras backends / tests-transformers (push) Waiting to run
Tests extras backends / tests-rerankers (push) Waiting to run
Tests extras backends / tests-diffusers (push) Waiting to run
Tests extras backends / tests-coqui (push) Waiting to run
tests / tests-linux (1.21.x) (push) Waiting to run
tests / tests-aio-container (push) Waiting to run
tests / tests-apple (1.21.x) (push) Waiting to run

f8fbfd4fa3 · chore(model gallery): add a-m-team_am-thinking-v1 (#5395) · Updated 2025-05-19 15:31:38 +00:00

Branches

5bf05cec1f · feat(llama.cpp): add reranking · Updated 2025-05-19 16:56:17 +00:00

1
1

cd4c0b8aa6 · wip · Updated 2025-05-14 20:57:56 +00:00    dearwolf

19
2

b652cbc3d2 · chore(deps): bump torch in /backend/python/exllama2 · Updated 2025-04-28 19:21:45 +00:00    dearwolf

112
1

e747d984b3 · chore(deps): bump torch in /backend/python/exllama2 in the pip group · Updated 2025-04-25 19:33:45 +00:00    dearwolf

134
1

8fea82e68b · wire to grpc · Updated 2025-04-19 18:22:31 +00:00    dearwolf

153
2

3826edb9da · chore(deps): bump llama.cpp to '10f2e81809bbb69ecfe64fc8b4686285f84b0c07' · Updated 2025-03-12 08:12:59 +00:00    dearwolf

342
1

99dde76c6c · chore(deps): Bump oneccl-bind-pt in /backend/python/diffusers · Updated 2025-03-11 07:30:14 +00:00    dearwolf

351
1

455aee4eaf · chore(model gallery): add qihoo360_tinyr1-32b-preview · Updated 2025-03-02 09:23:17 +00:00    dearwolf

390
1

d6ea1a67cf · Merge branch 'master' into ci/public-runner · Updated 2025-02-08 10:00:45 +00:00    dearwolf

503
4

27d7ada8dd · feat(l4t): add support for extras images · Updated 2025-02-06 10:53:07 +00:00    dearwolf

526
1

b16a01d0bd · WIP speculative · Updated 2025-01-24 09:17:54 +00:00    dearwolf

582
1

a1d5462ad0 · Stores to chromem (WIP) · Updated 2025-01-21 09:35:01 +00:00    dearwolf

605
1

f272605b95 · more robust approach · Updated 2025-01-14 16:13:58 +00:00    dearwolf

645
29

894a30296a · feat: unify and propagate CMAKE_ARGS to GGML-based backends · Updated 2024-12-11 21:02:58 +00:00    dearwolf

816
1

59cf30a80e · chore: ⬆️ Update ggerganov/llama.cpp to 26a8406ba9198eb6fdd8329fa717555b4f77f05f (#4358) · Updated 2024-12-10 14:51:45 +00:00    dearwolf

849
2

184fbc26bf · Revert "feat: include tokens usage for streamed output (#4282)" · Updated 2024-12-08 15:31:48 +00:00    dearwolf

849
1

a00bbfe3eb · chore(model): add silero-vad model config · Updated 2024-11-26 13:28:41 +00:00    dearwolf

889
1

cc11323d1c · fix(ci): install latest git · Updated 2024-10-24 12:55:24 +00:00    dearwolf

1099
1

83110891fd · fix(go-grpc-server): always close resultChan · Updated 2024-10-04 22:07:58 +00:00    dearwolf

1272
1

63c5d843b6 · chore(gosec): fix CI · Updated 2024-09-13 17:17:27 +00:00    dearwolf

1424
2