Commit graph

46 commits

Author SHA1 Message Date
Paul Gauthier
445c73267a Updated plot dimensions and axis labels for better visualization in benchmark over time. 2024-05-15 11:29:44 -07:00
Paul Gauthier
05201883bf Merge branch 'main' into models-over-time 2024-05-15 10:58:43 -07:00
Paul Gauthier
46afd4e0e2 added (udiff) label 2024-05-15 09:50:47 -07:00
Paul Gauthier
6d2b9d6699 Added gpt-4-turbo-2024-04-09 (diff) to the leaderboard 2024-05-15 09:49:38 -07:00
Paul Gauthier
efc9e56b23 cleanup 2024-05-15 09:46:13 -07:00
Paul Gauthier
f36bcd9b73 Added gpt-4-turbo-2024-04-09 (udiff) to the leaderboard 2024-05-15 09:45:11 -07:00
Paul Gauthier
c0ccd2cb1f added release dates 2024-05-15 09:44:18 -07:00
Paul Gauthier
72613f3b27 switch naming from openai/gpt-4o to gpt-4o 2024-05-15 06:25:30 -07:00
Paul Gauthier
ebeec04cae Updated leaderboard commands to reflect 4o is default model now 2024-05-13 11:30:09 -07:00
Paul Gauthier
8b99429dfa updated leaderboards 2024-05-13 10:58:06 -07:00
Paul Gauthier
b158e1c230 copy 2024-05-09 14:07:28 -07:00
Paul Gauthier
a3c9bd97e2 updated deepseek-chat yaml 2024-05-09 12:51:16 -07:00
Paul Gauthier
80a3f6d4f6 updated deepseek-chat yaml 2024-05-09 11:57:41 -07:00
Paul Gauthier
2269f56aed updated gpt-0125 refac 2024-05-08 15:38:41 -07:00
Paul Gauthier
bf09bd348f 0125 with ex_as_sys 2024-05-08 15:14:37 -07:00
Paul Gauthier
4c6fd48b27 updated gpt-4-1106-preview leaderboards 2024-05-08 15:02:16 -07:00
Paul Gauthier
eaa2514981 copy 2024-05-08 14:18:25 -07:00
Paul Gauthier
87664dc254 added gpt-3.5-turbo results to leaderboard 2024-05-08 14:15:16 -07:00
Paul Gauthier
b0a512770b updated docs to use deepseek/ prefix 2024-05-08 08:40:28 -07:00
Paul Gauthier
6fc6d5056d copy 2024-05-07 16:28:05 -07:00
Paul Gauthier
85864e01bc added WizardLM-2 8x22B to leaderboard 2024-05-07 14:37:41 -07:00
Paul Gauthier
bbe8639160 renamed qwen 2024-05-07 13:57:45 -07:00
Paul Gauthier
c29d4860ab Added qwen1.5-110b-chat to leaderboard 2024-05-07 13:55:12 -07:00
Paul Gauthier
dc7e61f3c9 added deepseek-chat v2 (diff) to leaderboard 2024-05-07 09:38:46 -07:00
Paul Gauthier
ca0faf7cc7 Merge remote-tracking branch 'origin/main' 2024-05-07 06:27:20 -07:00
Paul Gauthier
ecca737803 added deepseek-chat v2 2024-05-07 06:26:39 -07:00
paul-gauthier
6fb4983d88
Update edit_leaderboard.yml 2024-05-06 20:42:23 -07:00
Paul Gauthier
4b903a3bb8 cleanup 2024-05-06 11:50:49 -07:00
Paul Gauthier
92ea428b82 cleaned up edit data 2024-05-06 11:48:10 -07:00
Paul Gauthier
17b5dbe804 moved edit results to yaml 2024-05-06 11:44:29 -07:00
Paul Gauthier
fc3a43ef41 completed moving refac to yml 2024-05-06 11:25:14 -07:00
Paul Gauthier
e58ce69154 move refac data to yml 2024-05-06 11:21:38 -07:00
Paul Gauthier
3bb237bdc1 handle tasks with exceptions in the stats output 2024-05-05 08:24:45 -07:00
Paul Gauthier
6f8c1cf780 gemini refac results 2024-05-05 08:02:07 -07:00
Paul Gauthier
ec07b6e556 updated refac 2024-05-04 11:11:34 -07:00
Paul Gauthier
e524dd9203 added refac leaderboard 2024-05-04 11:05:32 -07:00
Paul Gauthier
47fe0f7211 updated gpt-4-0314 2024-05-04 08:14:24 -07:00
Paul Gauthier
9b88f8caf6 updated gpt-4-0314 2024-05-04 07:59:27 -07:00
Paul Gauthier
c9dbca9d0e gpt-4-0314 with examples as sys 2024-05-04 07:52:22 -07:00
Paul Gauthier
f6580fff76 updated all openai models 2024-05-04 07:38:50 -07:00
paul-gauthier
d8a18f2c67
Update leaderboard.csv 2024-05-03 20:29:29 -07:00
Paul Gauthier
052df34300 copy 2024-05-03 15:48:43 -07:00
Paul Gauthier
3266ccdfe6 sort data 2024-05-03 15:33:55 -07:00
Paul Gauthier
471d637694 updated llama3 2024-05-03 15:31:20 -07:00
Paul Gauthier
b476671399 copy 2024-05-03 15:08:38 -07:00
Paul Gauthier
cb42150bba added leaderboard 2024-05-03 14:52:21 -07:00