Commit graph

168 commits

Author SHA1 Message Date
Paul Gauthier
d375103b64 data 2024-09-26 11:20:22 -07:00
Paul Gauthier
46ab701782 copy 2024-09-24 09:52:17 -07:00
Paul Gauthier
ba6ef29896 feat: Add new leaderboard entry for gemini/gemini-1.5-flash-8b-exp-0924 model 2024-09-24 09:46:56 -07:00
Paul Gauthier
d8dd7a259b feat: Add new Gemini model configurations 2024-09-24 09:45:36 -07:00
Curran Kelleher
330fa863c8
Add benchmark results for Codestral-22B 2024-09-22 08:26:04 -04:00
Paul Gauthier
ced3336176 updated blame data 2024-09-21 13:55:34 -07:00
Paul Gauthier
565c305aa6 update o1-preview leaderboard to diff only 2024-09-21 10:27:50 -07:00
Paul Gauthier
2753ac6b62 feat: Add new benchmark test case for qwen-2.5-72b-instruct-diff model 2024-09-20 13:27:58 -07:00
Paul Gauthier
d26fca0bca feat: Add new leaderboard entry for qwen-2.5-72b-instruct model 2024-09-20 13:19:26 -07:00
youknow
2463cbfd6c add Qwen2.5-7b-8q to leaderboard 2024-09-21 01:25:39 +09:00
Paul Gauthier
eba845ea51 copy 2024-09-12 20:40:12 -07:00
Paul Gauthier
c00ac80909 o1-mini diff results 2024-09-12 15:38:40 -07:00
Paul Gauthier
291d3509eb copy 2024-09-12 15:17:32 -07:00
Paul Gauthier
9f4d9d801e copy 2024-09-12 14:52:27 -07:00
Paul Gauthier
09cb4c4b09 copy 2024-09-12 14:27:35 -07:00
Paul Gauthier
71f3f3a22b copy 2024-09-12 14:12:48 -07:00
Paul Gauthier
297b51b997 pct 2024-09-12 14:11:26 -07:00
Paul Gauthier
96587f5f46 o1-mini blog article 2024-09-12 14:07:06 -07:00
Paul Gauthier
94a609d75e fix: Update model names in edit_leaderboard.yml 2024-09-11 08:56:46 -07:00
Paul Gauthier
13ac0f0968 fix: Update model names and commands in edit_leaderboard.yml 2024-09-11 08:55:25 -07:00
Paul Gauthier
ba54e4a6e0 feat: Add new leaderboard entries for command-r-plus-08-2024 and command-r-08-2024 models 2024-09-11 08:50:28 -07:00
Paul Gauthier
5420f67b2b copy 2024-09-09 14:56:44 -07:00
Paul Gauthier
e9c0c82e03 added reflection 70b 2024-09-06 13:47:14 -07:00
Paul Gauthier
2aef59e624 update name to DeepSeek V2.5 2024-09-06 13:32:15 -07:00
Jun Siang Cheah
8d151a3573 docs: clean up yi-coder model names 2024-09-05 23:04:54 +01:00
Paul Gauthier
76bc0e11b8 add deepseek v2.5 to refac bench 2024-09-05 10:07:46 -07:00
Paul Gauthier
6e3d8d90de Add deepseek v2.5 2024-09-05 07:59:32 -07:00
Jun Siang Cheah
5853c7fa92
docs: add benchmark results for yi-coder 9b 2024-09-04 18:34:52 +01:00
Paul Gauthier
9988a3ff79 updated blame 2024-09-04 09:20:40 -07:00
Paul Gauthier
ec585a3a1a added nous hermes 405b 2024-08-30 08:25:50 -07:00
Paul Gauthier
e7f4ef9c23 updated blame 2024-08-28 07:16:10 -07:00
Jun Siang Cheah
7f8203f89c
docs: match benchmark formatting 2024-08-28 09:21:04 +01:00
Jun Siang Cheah
9b6dda8813
docs: add benchmark results for new gemini experimental models 2024-08-28 08:51:55 +01:00
Paul Gauthier
b2279994f5 updated blame 2024-08-27 07:10:30 -07:00
Paul Gauthier
4b82277ef7 copy 2024-08-26 20:53:38 -07:00
Paul Gauthier
9243e49060 updated blame 2024-08-23 10:16:30 -07:00
Paul Gauthier
628e775314 updated blame after ignoring prompt files 2024-08-21 12:17:25 -07:00
Paul Gauthier
27190c279d updated gpt-4o date versions 2024-08-21 11:12:44 -07:00
Paul Gauthier
959a9fbcf1 copy 2024-08-20 09:17:01 -07:00
Paul Gauthier
db5dbb5d13 update with clean sonnet func data with args None fix 2024-08-15 13:27:26 -07:00
Paul Gauthier
8a1f696bce add clean deepseek func data, with args None issue resolved 2024-08-15 13:07:25 -07:00
Paul Gauthier
679e1b8990 copy 2024-08-15 11:13:20 -07:00
Paul Gauthier
04e816ff2e copy 2024-08-15 09:49:51 -07:00
Paul Gauthier
9982cda508 5 benchmark runs 2024-08-15 08:11:54 -07:00
Paul Gauthier
b3ed2c8a48 copy 2024-08-14 16:50:14 -07:00
Paul Gauthier
205a503d64 init 2024-08-14 16:41:22 -07:00
Paul Gauthier
1ced72b728 update models-over-time 2024-08-14 06:31:20 -07:00
Paul Gauthier
454408f9d5 Added chatgpt-4o-latest 2024-08-14 06:13:42 -07:00
Paul Gauthier
a5dde7000b Updated blame 2024-08-13 12:44:58 -07:00
Paul Gauthier
9d6630d3b4 Updated release contribution data 2024-08-10 13:42:04 -07:00