mirror of
https://github.com/mudler/LocalAI.git
synced 2025-05-28 22:44:59 +00:00
models(gallery): add llama-3.1-nemotron-70b-instruct-hf (#3854)
Signed-off-by: Ettore Di Giacinto <mudler@localai.io>
This commit is contained in:
parent
cdcfb2617c
commit
92cd538829
1 changed files with 22 additions and 0 deletions
|
@ -1672,6 +1672,28 @@
|
||||||
- filename: Mahou-1.5-llama3.1-70B.i1-Q4_K_M.gguf
|
- filename: Mahou-1.5-llama3.1-70B.i1-Q4_K_M.gguf
|
||||||
sha256: c2711c4c9c8d011edbeaa391b4418d433e273a318d1de3dbdda9b85baf4996f2
|
sha256: c2711c4c9c8d011edbeaa391b4418d433e273a318d1de3dbdda9b85baf4996f2
|
||||||
uri: huggingface://mradermacher/Mahou-1.5-llama3.1-70B-i1-GGUF/Mahou-1.5-llama3.1-70B.i1-Q4_K_M.gguf
|
uri: huggingface://mradermacher/Mahou-1.5-llama3.1-70B-i1-GGUF/Mahou-1.5-llama3.1-70B.i1-Q4_K_M.gguf
|
||||||
|
- !!merge <<: *llama31
|
||||||
|
name: "llama-3.1-nemotron-70b-instruct-hf"
|
||||||
|
urls:
|
||||||
|
- https://huggingface.co/nvidia/Llama-3.1-Nemotron-70B-Instruct-HF
|
||||||
|
- https://huggingface.co/mradermacher/Llama-3.1-Nemotron-70B-Instruct-HF-GGUF
|
||||||
|
description: |
|
||||||
|
Llama-3.1-Nemotron-70B-Instruct is a large language model customized by NVIDIA to improve the helpfulness of LLM generated responses to user queries.
|
||||||
|
|
||||||
|
This model reaches Arena Hard of 85.0, AlpacaEval 2 LC of 57.6 and GPT-4-Turbo MT-Bench of 8.98, which are known to be predictive of LMSys Chatbot Arena Elo
|
||||||
|
|
||||||
|
As of 1 Oct 2024, this model is #1 on all three automatic alignment benchmarks (verified tab for AlpacaEval 2 LC), edging out strong frontier models such as GPT-4o and Claude 3.5 Sonnet.
|
||||||
|
|
||||||
|
This model was trained using RLHF (specifically, REINFORCE), Llama-3.1-Nemotron-70B-Reward and HelpSteer2-Preference prompts on a Llama-3.1-70B-Instruct model as the initial policy.
|
||||||
|
|
||||||
|
Llama-3.1-Nemotron-70B-Instruct-HF has been converted from Llama-3.1-Nemotron-70B-Instruct to support it in the HuggingFace Transformers codebase. Please note that evaluation results might be slightly different from the Llama-3.1-Nemotron-70B-Instruct as evaluated in NeMo-Aligner, which the evaluation results below are based on.
|
||||||
|
overrides:
|
||||||
|
parameters:
|
||||||
|
model: Llama-3.1-Nemotron-70B-Instruct-HF.Q4_K_M.gguf
|
||||||
|
files:
|
||||||
|
- filename: Llama-3.1-Nemotron-70B-Instruct-HF.Q4_K_M.gguf
|
||||||
|
sha256: b6b80001b849e3c59c39b09508c018b35b491a5c7bbafafa23f2fc04243f3e30
|
||||||
|
uri: huggingface://mradermacher/Llama-3.1-Nemotron-70B-Instruct-HF-GGUF/Llama-3.1-Nemotron-70B-Instruct-HF.Q4_K_M.gguf
|
||||||
- &deepseek
|
- &deepseek
|
||||||
## Deepseek
|
## Deepseek
|
||||||
url: "github:mudler/LocalAI/gallery/deepseek.yaml@master"
|
url: "github:mudler/LocalAI/gallery/deepseek.yaml@master"
|
||||||
|
|
Loading…
Add table
Add a link
Reference in a new issue