Commit graph

27 commits

Author SHA1 Message Date
Paul Gauthier
445c73267a Updated plot dimensions and axis labels for better visualization in benchmark over time. 2024-05-15 11:29:44 -07:00
Paul Gauthier
05201883bf Merge branch 'main' into models-over-time 2024-05-15 10:58:43 -07:00
Paul Gauthier
46afd4e0e2 added (udiff) label 2024-05-15 09:50:47 -07:00
Paul Gauthier
6d2b9d6699 Added gpt-4-turbo-2024-04-09 (diff) to the leaderboard 2024-05-15 09:49:38 -07:00
Paul Gauthier
efc9e56b23 cleanup 2024-05-15 09:46:13 -07:00
Paul Gauthier
f36bcd9b73 Added gpt-4-turbo-2024-04-09 (udiff) to the leaderboard 2024-05-15 09:45:11 -07:00
Paul Gauthier
c0ccd2cb1f added release dates 2024-05-15 09:44:18 -07:00
Paul Gauthier
72613f3b27 switch naming from openai/gpt-4o to gpt-4o 2024-05-15 06:25:30 -07:00
Paul Gauthier
ebeec04cae Updated leaderboard commands to reflect 4o is default model now 2024-05-13 11:30:09 -07:00
Paul Gauthier
8b99429dfa updated leaderboards 2024-05-13 10:58:06 -07:00
Paul Gauthier
b158e1c230 copy 2024-05-09 14:07:28 -07:00
Paul Gauthier
a3c9bd97e2 updated deepseek-chat yaml 2024-05-09 12:51:16 -07:00
Paul Gauthier
80a3f6d4f6 updated deepseek-chat yaml 2024-05-09 11:57:41 -07:00
Paul Gauthier
4c6fd48b27 updated gpt-4-1106-preview leaderboards 2024-05-08 15:02:16 -07:00
Paul Gauthier
eaa2514981 copy 2024-05-08 14:18:25 -07:00
Paul Gauthier
87664dc254 added gpt-3.5-turbo results to leaderboard 2024-05-08 14:15:16 -07:00
Paul Gauthier
b0a512770b updated docs to use deepseek/ prefix 2024-05-08 08:40:28 -07:00
Paul Gauthier
6fc6d5056d copy 2024-05-07 16:28:05 -07:00
Paul Gauthier
85864e01bc added WizardLM-2 8x22B to leaderboard 2024-05-07 14:37:41 -07:00
Paul Gauthier
bbe8639160 renamed qwen 2024-05-07 13:57:45 -07:00
Paul Gauthier
c29d4860ab Added qwen1.5-110b-chat to leaderboard 2024-05-07 13:55:12 -07:00
Paul Gauthier
dc7e61f3c9 added deepseek-chat v2 (diff) to leaderboard 2024-05-07 09:38:46 -07:00
Paul Gauthier
ca0faf7cc7 Merge remote-tracking branch 'origin/main' 2024-05-07 06:27:20 -07:00
Paul Gauthier
ecca737803 added deepseek-chat v2 2024-05-07 06:26:39 -07:00
paul-gauthier
6fb4983d88
Update edit_leaderboard.yml 2024-05-06 20:42:23 -07:00
Paul Gauthier
92ea428b82 cleaned up edit data 2024-05-06 11:48:10 -07:00
Paul Gauthier
17b5dbe804 moved edit results to yaml 2024-05-06 11:44:29 -07:00