Paul Gauthier
ad0e5c4770
copy
2024-11-21 12:53:21 -08:00
Paul Gauthier
8448eff1eb
copy
2024-11-21 11:38:41 -08:00
Paul Gauthier
6c42ee4edf
copy
2024-11-21 10:53:28 -08:00
Paul Gauthier
3053595bfe
added qwen models
2024-11-21 10:52:10 -08:00
Paul Gauthier
1f0d26e8c7
better over time plot
2024-11-20 20:19:44 -08:00
Paul Gauthier
8302e9d0dd
improved over time plot
2024-11-20 20:16:35 -08:00
Paul Gauthier
9b5a703307
updated models-over-time
2024-11-20 19:40:59 -08:00
Paul Gauthier
1aaa3d9279
gpt-4o-2024-11-20
2024-11-20 15:31:47 -08:00
Paul Gauthier
18a88596a6
Added gpt-4o-2024-11-20
2024-11-20 11:33:16 -08:00
Paul Gauthier
e917424f5d
copy
2024-11-20 07:13:15 -08:00
Paul Gauthier
1f6a5d04d9
chore: Update edit leaderboard with new Mistral model performance data
2024-11-20 07:09:38 -08:00
Paul Gauthier
ab5a8b24a5
updated blame
2024-11-18 13:56:46 -08:00
Paul Gauthier
c538817b61
updated blame
2024-11-13 13:43:05 -08:00
Paul Gauthier
44063590e2
copy
2024-11-11 10:30:23 -08:00
Paul Gauthier
557f25bf80
copy
2024-11-11 10:30:00 -08:00
柏枫
bd9c43a48d
Add evaluation results for Qwen2.5-Coder-32B-Instruct using the diff format.
2024-11-12 01:23:19 +08:00
Paul Gauthier
be6e3254ea
copy
2024-11-11 08:45:42 -08:00
柏枫
c0b1101a52
Add evaluation results of Qwen2.5-Coder series.
2024-11-11 20:18:30 +08:00
Jaap Buurman
af0466ea83
Added Qwen2.5-7b-coder with the updated weights
...
The Qwen team still calls it Qwen2.5, but as can be seen from the
benchmarks the difference in performance compared to the old weights
is pretty substantial. The GGUF version of this model made by Bartowski
calls it 2.5.1 to differentiate it from the earlier version of the
same model.
2024-11-07 13:18:24 +01:00
Paul Gauthier
e601682706
updated blame
2024-11-04 13:00:42 -08:00
Paul Gauthier
e6d4c3558b
add 3.5 haiku to leaderboard
2024-11-04 11:40:37 -08:00
Paul Gauthier
6a0380b8c0
copy
2024-11-01 09:46:27 -07:00
Paul Gauthier
1efb0ba53e
added new sonnet and o1 models to refac leaderboard
2024-10-22 14:17:36 -07:00
Paul Gauthier
57642cf96c
copy
2024-10-22 12:21:25 -07:00
Paul Gauthier
c80a032297
copy
2024-10-22 10:54:34 -07:00
Paul Gauthier
bd28d8f3fb
corrected 1022 benchmark results
2024-10-22 10:52:35 -07:00
Paul Gauthier
cfcb6656cb
added claude-3-5-sonnet-20241022 benchmarks
2024-10-22 10:05:15 -07:00
Paul Gauthier
b7a884d81e
feat: add Llama-3.1-Nemotron-70B model to edit leaderboard
2024-10-16 10:14:42 -07:00
Paul Gauthier
163a29b026
copy
2024-10-14 11:01:00 -07:00
Paul Gauthier
bf45a14b30
feat: add Grok-2 and Grok-2-mini model results to leaderboard
2024-10-14 10:58:40 -07:00
Paul Gauthier
10ecb4b97f
copy
2024-10-05 14:38:01 -07:00
Paul Gauthier
c10442087b
copy
2024-10-04 15:40:43 -07:00
Paul Gauthier
ff230554ce
chore: Update edit leaderboard with latest model performance data
2024-10-04 09:35:28 -07:00
itlackey
d621d16255
- Added hermes3
2024-10-01 16:06:15 -05:00
itlackey
61bc130464
- Added yi-coder:9b-chat-fp16
2024-10-01 16:06:15 -05:00
itlackey
aa911a847d
- Added mistral-small
2024-10-01 16:06:15 -05:00
itlackey
268331d5c0
- Added hermes3:8b-llama3.1-fp16
2024-10-01 16:06:15 -05:00
itlackey
8d4dd5c9de
- Added llama3.2:3b-instruct-fp16
2024-10-01 16:06:15 -05:00
itlackey
fb34af4362
- Added qwen2.5:32b
2024-10-01 16:06:15 -05:00
itlackey
e381260932
- Added mistral-nemo:12b-instruct-2407-q4_K_M
2024-10-01 16:06:15 -05:00
itlackey
e2d5f15aff
- Added wojtek/opencodeinterpreter:6.7b
2024-10-01 16:06:15 -05:00
itlackey
d4cecf9fd0
- Added codegeex4
2024-10-01 16:06:15 -05:00
Paul Gauthier
485bfa2492
copy
2024-09-29 15:46:32 -07:00
Paul Gauthier
53ca83beea
copy
2024-09-29 08:31:13 -07:00
itlackey
228ae24834
added ollama/codestral benchmark
2024-09-28 16:08:49 -05:00
Paul Gauthier
cfbf943eb1
copy
2024-09-27 11:48:38 -07:00
Paul Gauthier
6c946006e8
rename
2024-09-26 17:27:04 -07:00
Paul Gauthier
eb21cf2830
architect/editor
2024-09-26 16:10:19 -07:00
Paul Gauthier
068ff01cee
restor
2024-09-26 11:34:02 -07:00
Paul Gauthier
b3e3a5a401
better
2024-09-26 11:21:35 -07:00