Commit graph

  • 6984749ea1
    chore(model gallery): add kalomaze_qwen3-16b-a3b (#5312) Ettore Di Giacinto 2025-05-04 09:39:38 +02:00
  • f3d532697e chore(model gallery): add kalomaze_qwen3-16b-a3b Ettore Di Giacinto 2025-05-04 09:38:33 +02:00
  • c0a206bc7a
    chore(model gallery): add qwen3-30b-a1.5b-high-speed (#5311) Ettore Di Giacinto 2025-05-04 09:38:01 +02:00
  • 4d2e485679 chore(model gallery): add qwen3-30b-a1.5b-high-speed Ettore Di Giacinto 2025-05-04 09:37:08 +02:00
  • 01bbb31fb3
    chore: ⬆️ Update ggml-org/llama.cpp to 36667c8edcded08063ed51c7d57e9e086bbfc903 (#5300) LocalAI [bot] 2025-05-04 09:23:01 +02:00
  • 83ecafced0 ⬆️ Update ggml-org/llama.cpp mudler 2025-05-03 20:06:47 +00:00
  • b77507ab67 simplify golang deps Ettore Di Giacinto 2025-05-03 19:47:24 +02:00
  • 8d0419d3d1 fix(embed): use go-rice for large backend assets Ettore Di Giacinto 2025-05-03 11:16:00 +02:00
  • 72111c597d
    fix(gpu): do not assume gpu being returned has node and mem (#5310) Ettore Di Giacinto 2025-05-03 19:00:24 +02:00
  • a1f1d84556 fix(gpu): do not assume gpu being returned has node and mem Ettore Di Giacinto 2025-05-03 18:59:02 +02:00
  • b2f9fc870b
    chore(defaults): enlarge defaults, drop gpu layers which is infered (#5308) Ettore Di Giacinto 2025-05-03 18:44:51 +02:00
  • 1fc6d469ac
    chore(deps): bump llama.cpp to '1d36b3670b285e69e58b9d687c770a2a0a192194 (#5307) Ettore Di Giacinto 2025-05-03 18:44:40 +02:00
  • 2b987a741a chore(deps): bump llama.cpp to '1d36b3670b285e69e58b9d687c770a2a0a192194' Ettore Di Giacinto 2025-05-03 10:29:16 +02:00
  • 05848b2027
    chore(model gallery): add smoothie-qwen3-8b (#5306) Ettore Di Giacinto 2025-05-03 10:35:20 +02:00
  • 6636badda7 chore(defaults): enlarge defaults, drop gpu layers which is infered Ettore Di Giacinto 2025-05-03 10:35:06 +02:00
  • 39fe259fe6 chore(model gallery): add smoothie-qwen3-8b Ettore Di Giacinto 2025-05-03 10:24:43 +02:00
  • 1da0644aa3
    chore(model gallery): add qwen-3-32b-medical-reasoning-i1 (#5305) Ettore Di Giacinto 2025-05-03 10:24:07 +02:00
  • d2c15eeac2 chore(model gallery): add qwen-3-32b-medical-reasoning-i1 Ettore Di Giacinto 2025-05-03 10:22:53 +02:00
  • c087cd1377
    chore(model gallery): add amoral-qwen3-14b (#5304) Ettore Di Giacinto 2025-05-03 10:21:48 +02:00
  • 8f6fe8ca54 chore(model gallery): add amoral-qwen3-14b Ettore Di Giacinto 2025-05-03 10:20:43 +02:00
  • c621412f6a
    chore(model gallery): add comet_12b_v.5-i1 (#5303) Ettore Di Giacinto 2025-05-03 10:20:03 +02:00
  • ea96f9ad8b chore(model gallery): add comet_12b_v.5-i1 Ettore Di Giacinto 2025-05-03 10:19:09 +02:00
  • 5a8b1892cd
    chore(model gallery): add genericrpv3-4b (#5302) Ettore Di Giacinto 2025-05-03 10:18:31 +02:00
  • 07f0764dfa chore(model gallery): add genericrpv3-4b Ettore Di Giacinto 2025-05-03 10:15:25 +02:00
  • 5b20426863
    chore(model gallery): add planetoid_27b_v.2 (#5301) Ettore Di Giacinto 2025-05-03 10:14:33 +02:00
  • 0c5f664fc5 chore(model gallery): add planetoid_27b_v.2 Ettore Di Giacinto 2025-05-03 10:13:19 +02:00
  • 5c6cd50ed6
    feat(llama.cpp): estimate vram usage (#5299) Ettore Di Giacinto 2025-05-02 17:40:26 +02:00
  • 7f654fece7 feat(llama.cpp): estimate vram usage Ettore Di Giacinto 2025-05-02 10:23:30 +02:00
  • bace6516f1
    chore(model gallery): add webthinker-qwq-32b-i1 (#5298) Ettore Di Giacinto 2025-05-02 09:57:49 +02:00
  • f97b238b2c chore(model gallery): add webthinker-qwq-32b-i1 Ettore Di Giacinto 2025-05-02 09:53:47 +02:00
  • 3baadf6f27
    chore(model gallery): add shuttleai_shuttle-3.5 (#5297) Ettore Di Giacinto 2025-05-02 09:48:11 +02:00
  • 971a4e6423 chore(model gallery): add shuttleai_shuttle-3.5 Ettore Di Giacinto 2025-05-02 09:47:00 +02:00
  • 8804c701b8
    chore(model gallery): add microsoft_phi-4-reasoning (#5296) Ettore Di Giacinto 2025-05-02 09:46:20 +02:00
  • a43851c68b chore(model gallery): add microsoft_phi-4-reasoning Ettore Di Giacinto 2025-05-02 09:44:05 +02:00
  • 7b3ceb19bb
    chore(model gallery): add microsoft_phi-4-reasoning-plus (#5295) Ettore Di Giacinto 2025-05-02 09:43:38 +02:00
  • addd5c17af chore(model gallery): add microsoft_phi-4-reasoning-plus Ettore Di Giacinto 2025-05-02 09:39:56 +02:00
  • e7f3effea1
    chore(model gallery): add furina-8b (#5294) Ettore Di Giacinto 2025-05-02 09:39:22 +02:00
  • 05952d3755 chore(model gallery): add furina-8b Ettore Di Giacinto 2025-05-02 09:37:40 +02:00
  • 61694a2ffb
    chore(model gallery): add josiefied-qwen3-8b-abliterated-v1 (#5293) Ettore Di Giacinto 2025-05-02 09:36:35 +02:00
  • 205afac1b9 chore(model gallery): add josiefied-qwen3-8b-abliterated-v1 Ettore Di Giacinto 2025-05-02 09:34:56 +02:00
  • 573a3f104c
    chore: ⬆️ Update ggml-org/llama.cpp to d7a14c42a1883a34a6553cbfe30da1e1b84dfd6a (#5292) LocalAI [bot] 2025-05-02 09:21:38 +02:00
  • 0e8af53a5b chore: update quickstart Ettore Di Giacinto 2025-05-01 22:36:33 +02:00
  • 52adcede54 ⬆️ Update ggml-org/llama.cpp mudler 2025-05-01 20:07:37 +00:00
  • 960ffa808c
    chore(model gallery): add microsoft_phi-4-mini-reasoning (#5288) Ettore Di Giacinto 2025-05-01 10:17:58 +02:00
  • cdffc55788 chore(model gallery): add microsoft_phi-4-mini-reasoning Ettore Di Giacinto 2025-05-01 10:15:52 +02:00
  • 92719568e5
    chore(model gallery): add fast-math-qwen3-14b (#5287) Ettore Di Giacinto 2025-05-01 10:14:51 +02:00
  • d046c67bd4 chore(model gallery): add fast-math-qwen3-14b Ettore Di Giacinto 2025-05-01 10:13:37 +02:00
  • 163939af71
    chore(model gallery): add qwen3-8b-jailbroken (#5286) Ettore Di Giacinto 2025-05-01 10:13:01 +02:00
  • 797dc5d499 chore(model gallery): add qwen3-8b-jailbroken Ettore Di Giacinto 2025-05-01 10:12:06 +02:00
  • 399f1241dc
    chore(model gallery): add qwen3-30b-a3b-abliterated (#5285) Ettore Di Giacinto 2025-05-01 10:07:42 +02:00
  • b0e8891921 chore(model gallery): add qwen3-30b-a3b-abliterated Ettore Di Giacinto 2025-05-01 10:04:22 +02:00
  • 58c9ade2e8
    chore: ⬆️ Update ggml-org/llama.cpp to 3e168bede4d27b35656ab8026015b87659ecbec2 (#5284) LocalAI [bot] 2025-05-01 10:01:39 +02:00
  • 6e1c93d84f
    fix(ci): comment out vllm tests Ettore Di Giacinto 2025-05-01 10:01:22 +02:00
  • 6d61663b6d ⬆️ Update ggml-org/llama.cpp mudler 2025-04-30 20:07:26 +00:00
  • 4076ea0494
    fix: vllm missing logprobs (#5279) Wyatt Neal 2025-04-30 08:55:07 -04:00
  • 9994ab25bc
    Merge branch 'master' into fix-vllm-logprobs-others Wyatt Neal 2025-04-30 06:31:14 -04:00
  • 26cbf77c0d
    chore(model gallery): add mlabonne_qwen3-4b-abliterated (#5283) Ettore Di Giacinto 2025-04-30 11:09:58 +02:00
  • 1112dd521f chore(model gallery): add mlabonne_qwen3-4b-abliterated Ettore Di Giacinto 2025-04-30 11:08:58 +02:00
  • 640790d628
    chore(model gallery): add mlabonne_qwen3-8b-abliterated (#5282) Ettore Di Giacinto 2025-04-30 11:08:26 +02:00
  • 1d6a2a484b chore(model gallery): add mlabonne_qwen3-8b-abliterated Ettore Di Giacinto 2025-04-30 11:07:01 +02:00
  • 4132adea2f
    chore(model gallery): add mlabonne_qwen3-14b-abliterated (#5281) Ettore Di Giacinto 2025-04-30 11:04:49 +02:00
  • 6e5cceeb65 chore(model gallery): add mlabonne_qwen3-14b-abliterated Ettore Di Giacinto 2025-04-30 11:00:14 +02:00
  • a15deb1353
    removing todo block, test via pipeline Wyatt Neal 2025-04-29 21:07:04 -04:00
  • 3efdec8e8b
    adding in tests to pipeline for execution Wyatt Neal 2025-04-29 21:04:26 -04:00
  • 05d08edda7 adding in vllm tests to test-extras Wyatt Neal 2025-04-29 18:30:42 -04:00
  • 1569bc4959 working to address missing items Wyatt Neal 2025-04-29 15:31:36 -04:00
  • 2b2d907a3a
    chore: ⬆️ Update ggml-org/llama.cpp to e2e1ddb93a01ce282e304431b37e60b3cddb6114 (#5278) LocalAI [bot] 2025-04-29 23:46:08 +02:00
  • fbaa16bb6d ⬆️ Update ggml-org/llama.cpp mudler 2025-04-29 20:07:44 +00:00
  • 6e8f4f584b
    fix(diffusers): consider options only in form of key/value (#5277) Ettore Di Giacinto 2025-04-29 17:08:55 +02:00
  • 662cfc2b48
    fix(aio): Fix copypasta in download files for gpt-4 model (#5276) Richard Palethorpe 2025-04-29 16:08:16 +01:00
  • 930b374d05 fix(diffusers): consider options only in form of key/value Ettore Di Giacinto 2025-04-29 17:03:46 +02:00
  • 98723bd5e8 fix(aio): Fix copypasta in download files for gpt-4 model Richard Palethorpe 2025-04-29 13:41:32 +01:00
  • a25d355d66
    chore(model gallery): add qwen3-0.6b (#5275) Ettore Di Giacinto 2025-04-29 10:10:16 +02:00
  • 5c7a1d9f5d chore(model gallery): add qwen3-0.6b Ettore Di Giacinto 2025-04-29 10:06:46 +02:00
  • 6d1cfdbefc
    chore(model gallery): add qwen3-1.7b (#5274) Ettore Di Giacinto 2025-04-29 10:06:03 +02:00
  • 09f182811b chore(model gallery): add qwen3-1.7b Ettore Di Giacinto 2025-04-29 10:01:52 +02:00
  • 5ecc478968
    chore(model gallery): add qwen3-4b (#5273) Ettore Di Giacinto 2025-04-29 10:01:22 +02:00
  • 66bba5672d chore(model gallery): add qwen3-4b Ettore Di Giacinto 2025-04-29 10:00:07 +02:00
  • aef5c4291b
    chore(model gallery): add qwen3-8b (#5272) Ettore Di Giacinto 2025-04-29 09:59:17 +02:00
  • 14c8f648f2 chore(model gallery): add qwen3-8b Ettore Di Giacinto 2025-04-29 09:57:33 +02:00
  • c059f912b9
    chore(model gallery): add qwen3-14b (#5271) Ettore Di Giacinto 2025-04-29 09:56:50 +02:00
  • 058f606430 chore(model gallery): add qwen3-14b Ettore Di Giacinto 2025-04-29 09:52:26 +02:00
  • bc1e059259
    chore: ⬆️ Update ggml-org/llama.cpp to 5f5e39e1ba5dbea814e41f2a15e035d749a520bc (#5267) LocalAI [bot] 2025-04-29 09:49:42 +02:00
  • 38dc07793a
    chore(model-gallery): ⬆️ update checksum (#5268) LocalAI [bot] 2025-04-29 09:49:23 +02:00
  • da6ef0967d
    chore(model gallery): add qwen3-32b (#5270) Ettore Di Giacinto 2025-04-29 09:48:28 +02:00
  • 5024177ba9 chore(model gallery): add qwen3-32b Ettore Di Giacinto 2025-04-29 09:46:00 +02:00
  • 7a011e60bd
    chore(model gallery): add qwen3-30b-a3b (#5269) Ettore Di Giacinto 2025-04-29 09:44:44 +02:00
  • 09bb397fe1 chore(model gallery): add qwen3-30b-a3b Ettore Di Giacinto 2025-04-29 09:43:13 +02:00
  • e13dd5b09f
    chore(deps): bump appleboy/scp-action from 0.1.7 to 1.0.0 (#5265) dependabot[bot] 2025-04-28 22:36:30 +00:00
  • 544b9433ef ⬆️ Checksum updates in gallery/index.yaml mudler 2025-04-28 20:29:46 +00:00
  • d4e95ede56 ⬆️ Update ggml-org/llama.cpp mudler 2025-04-28 20:07:09 +00:00
  • b652cbc3d2
    chore(deps): bump torch in /backend/python/exllama2 dependabot/pip/backend/python/exllama2/torch-2.7.0cu118 dependabot[bot] 2025-04-28 19:21:45 +00:00
  • 5a93aa892a
    chore(deps): bump appleboy/scp-action from 0.1.7 to 1.0.0 dependabot[bot] 2025-04-28 19:12:55 +00:00
  • 86ee303bd6
    chore(model gallery): add nvidia_openmath-nemotron-14b-kaggle (#5264) Ettore Di Giacinto 2025-04-28 19:52:36 +02:00
  • 5e9217703d chore(model gallery): add nvidia_openmath-nemotron-14b-kaggle Ettore Di Giacinto 2025-04-28 19:44:34 +02:00
  • 978ee96fd3
    chore(model gallery): add nvidia_openmath-nemotron-14b (#5263) Ettore Di Giacinto 2025-04-28 19:43:49 +02:00
  • 4c92107ca3 chore(model gallery): add nvidia_openmath-nemotron-14b Ettore Di Giacinto 2025-04-28 19:42:37 +02:00
  • 3ad5691db6
    chore(model gallery): add nvidia_openmath-nemotron-7b (#5262) Ettore Di Giacinto 2025-04-28 19:41:59 +02:00
  • e0f1f6adae chore(model gallery): add nvidia_openmath-nemotron-7b Ettore Di Giacinto 2025-04-28 19:40:46 +02:00
  • 0027681090
    chore(model gallery): add nvidia_openmath-nemotron-1.5b (#5261) Ettore Di Giacinto 2025-04-28 19:40:09 +02:00