Paul Gauthier
|
445c73267a
|
Updated plot dimensions and axis labels for better visualization in benchmark over time.
|
2024-05-15 11:29:44 -07:00 |
|
Paul Gauthier
|
05201883bf
|
Merge branch 'main' into models-over-time
|
2024-05-15 10:58:43 -07:00 |
|
Paul Gauthier
|
46afd4e0e2
|
added (udiff) label
|
2024-05-15 09:50:47 -07:00 |
|
Paul Gauthier
|
6d2b9d6699
|
Added gpt-4-turbo-2024-04-09 (diff) to the leaderboard
|
2024-05-15 09:49:38 -07:00 |
|
Paul Gauthier
|
efc9e56b23
|
cleanup
|
2024-05-15 09:46:13 -07:00 |
|
Paul Gauthier
|
f36bcd9b73
|
Added gpt-4-turbo-2024-04-09 (udiff) to the leaderboard
|
2024-05-15 09:45:11 -07:00 |
|
Paul Gauthier
|
c0ccd2cb1f
|
added release dates
|
2024-05-15 09:44:18 -07:00 |
|
Paul Gauthier
|
72613f3b27
|
switch naming from openai/gpt-4o to gpt-4o
|
2024-05-15 06:25:30 -07:00 |
|
Paul Gauthier
|
ebeec04cae
|
Updated leaderboard commands to reflect 4o is default model now
|
2024-05-13 11:30:09 -07:00 |
|
Paul Gauthier
|
8b99429dfa
|
updated leaderboards
|
2024-05-13 10:58:06 -07:00 |
|
Paul Gauthier
|
b158e1c230
|
copy
|
2024-05-09 14:07:28 -07:00 |
|
Paul Gauthier
|
a3c9bd97e2
|
updated deepseek-chat yaml
|
2024-05-09 12:51:16 -07:00 |
|
Paul Gauthier
|
80a3f6d4f6
|
updated deepseek-chat yaml
|
2024-05-09 11:57:41 -07:00 |
|
Paul Gauthier
|
2269f56aed
|
updated gpt-0125 refac
|
2024-05-08 15:38:41 -07:00 |
|
Paul Gauthier
|
bf09bd348f
|
0125 with ex_as_sys
|
2024-05-08 15:14:37 -07:00 |
|
Paul Gauthier
|
4c6fd48b27
|
updated gpt-4-1106-preview leaderboards
|
2024-05-08 15:02:16 -07:00 |
|
Paul Gauthier
|
eaa2514981
|
copy
|
2024-05-08 14:18:25 -07:00 |
|
Paul Gauthier
|
87664dc254
|
added gpt-3.5-turbo results to leaderboard
|
2024-05-08 14:15:16 -07:00 |
|
Paul Gauthier
|
b0a512770b
|
updated docs to use deepseek/ prefix
|
2024-05-08 08:40:28 -07:00 |
|
Paul Gauthier
|
6fc6d5056d
|
copy
|
2024-05-07 16:28:05 -07:00 |
|
Paul Gauthier
|
85864e01bc
|
added WizardLM-2 8x22B to leaderboard
|
2024-05-07 14:37:41 -07:00 |
|
Paul Gauthier
|
bbe8639160
|
renamed qwen
|
2024-05-07 13:57:45 -07:00 |
|
Paul Gauthier
|
c29d4860ab
|
Added qwen1.5-110b-chat to leaderboard
|
2024-05-07 13:55:12 -07:00 |
|
Paul Gauthier
|
dc7e61f3c9
|
added deepseek-chat v2 (diff) to leaderboard
|
2024-05-07 09:38:46 -07:00 |
|
Paul Gauthier
|
ca0faf7cc7
|
Merge remote-tracking branch 'origin/main'
|
2024-05-07 06:27:20 -07:00 |
|
Paul Gauthier
|
ecca737803
|
added deepseek-chat v2
|
2024-05-07 06:26:39 -07:00 |
|
paul-gauthier
|
6fb4983d88
|
Update edit_leaderboard.yml
|
2024-05-06 20:42:23 -07:00 |
|
Paul Gauthier
|
4b903a3bb8
|
cleanup
|
2024-05-06 11:50:49 -07:00 |
|
Paul Gauthier
|
92ea428b82
|
cleaned up edit data
|
2024-05-06 11:48:10 -07:00 |
|
Paul Gauthier
|
17b5dbe804
|
moved edit results to yaml
|
2024-05-06 11:44:29 -07:00 |
|
Paul Gauthier
|
fc3a43ef41
|
completed moving refac to yml
|
2024-05-06 11:25:14 -07:00 |
|
Paul Gauthier
|
e58ce69154
|
move refac data to yml
|
2024-05-06 11:21:38 -07:00 |
|
Paul Gauthier
|
3bb237bdc1
|
handle tasks with exceptions in the stats output
|
2024-05-05 08:24:45 -07:00 |
|
Paul Gauthier
|
6f8c1cf780
|
gemini refac results
|
2024-05-05 08:02:07 -07:00 |
|
Paul Gauthier
|
ec07b6e556
|
updated refac
|
2024-05-04 11:11:34 -07:00 |
|
Paul Gauthier
|
e524dd9203
|
added refac leaderboard
|
2024-05-04 11:05:32 -07:00 |
|
Paul Gauthier
|
47fe0f7211
|
updated gpt-4-0314
|
2024-05-04 08:14:24 -07:00 |
|
Paul Gauthier
|
9b88f8caf6
|
updated gpt-4-0314
|
2024-05-04 07:59:27 -07:00 |
|
Paul Gauthier
|
c9dbca9d0e
|
gpt-4-0314 with examples as sys
|
2024-05-04 07:52:22 -07:00 |
|
Paul Gauthier
|
f6580fff76
|
updated all openai models
|
2024-05-04 07:38:50 -07:00 |
|
paul-gauthier
|
d8a18f2c67
|
Update leaderboard.csv
|
2024-05-03 20:29:29 -07:00 |
|
Paul Gauthier
|
052df34300
|
copy
|
2024-05-03 15:48:43 -07:00 |
|
Paul Gauthier
|
3266ccdfe6
|
sort data
|
2024-05-03 15:33:55 -07:00 |
|
Paul Gauthier
|
471d637694
|
updated llama3
|
2024-05-03 15:31:20 -07:00 |
|
Paul Gauthier
|
b476671399
|
copy
|
2024-05-03 15:08:38 -07:00 |
|
Paul Gauthier
|
cb42150bba
|
added leaderboard
|
2024-05-03 14:52:21 -07:00 |
|