Paul Gauthier (aider)
30ee89c7e9
style: Fix linting issues in over_time.py
2024-11-21 16:45:11 -08:00
Paul Gauthier (aider)
25bcea6aec
feat: Add print of model release dates and names in sorted order
2024-11-21 16:45:07 -08:00
Paul Gauthier (aider)
8fdcd92260
feat: Update plot save paths to website assets directory
2024-11-21 14:19:05 -08:00
Paul Gauthier
781a40df52
fix: Update Gemini Pro legend label to Gemini 1.5 Pro
2024-11-21 14:19:03 -08:00
Paul Gauthier (aider)
a7fc0f9d2e
feat: Add color and legend support for Gemini Pro models
2024-11-21 14:02:27 -08:00
Paul Gauthier (aider)
c189a52e5e
style: Organize imports and apply linter formatting
2024-11-21 14:00:24 -08:00
Paul Gauthier (aider)
6d6d763dd3
refactor: Restructure benchmark plotting script for improved maintainability
2024-11-21 14:00:20 -08:00
Paul Gauthier
1f0d26e8c7
better over time plot
2024-11-20 20:19:44 -08:00
Paul Gauthier
8302e9d0dd
improved over time plot
2024-11-20 20:16:35 -08:00
Paul Gauthier (aider)
c797af020a
refactor: Update fontsize to use LABEL_FONT_SIZE constant in over_time.py
2024-11-20 20:13:46 -08:00
Paul Gauthier (aider)
1c85afa320
feat: Add LABEL_FONT_SIZE constant for dot label font size
2024-11-20 20:13:33 -08:00
Paul Gauthier
eb5317f8e5
fix: Adjust annotation vertical offset for brown color in over_time plot
2024-11-20 20:13:30 -08:00
Paul Gauthier (aider)
8b860615b8
style: Increase font size for scatter plot dot labels
2024-11-20 20:10:40 -08:00
Paul Gauthier (aider)
c15ac341e2
refactor: Remove Opus and Llama model variants from legend labels
2024-11-20 20:07:52 -08:00
Paul Gauthier (aider)
c2c7ee1047
feat: Change Opus label to "Opus" in legend
2024-11-20 20:06:48 -08:00
Paul Gauthier (aider)
72c46ccec6
feat: Add labels for Claude 3 Opus, Sonnet, and O1 Preview models
2024-11-20 20:06:04 -08:00
Paul Gauthier (aider)
dd3bfaee01
style: Format code with consistent indentation and line breaks
2024-11-20 20:05:24 -08:00
Paul Gauthier (aider)
03206ad90e
feat: Add line labels directly on first points instead of using legend
2024-11-20 20:05:18 -08:00
Paul Gauthier (aider)
2e00307190
feat: Add color and legend label for o1-preview models
2024-11-20 20:03:49 -08:00
Paul Gauthier (aider)
b3e29ab20e
style: Apply linter formatting to benchmark code
2024-11-20 20:02:52 -08:00
Paul Gauthier (aider)
5504ac535b
feat: Add simplified model names for legend labels
2024-11-20 20:02:48 -08:00
Paul Gauthier (aider)
4b3dd7f4ea
style: Apply linter formatting to over_time.py
2024-11-20 19:59:43 -08:00
Paul Gauthier (aider)
8edf9540d5
feat: Add legend to plot and remove point labels
2024-11-20 19:59:38 -08:00
Paul Gauthier
1c62ecd1b5
style: Adjust x-axis label rotation angle for better readability
2024-11-20 19:59:36 -08:00
Paul Gauthier
7cf3d9f3ce
style: Increase annotation font size in benchmark plot
2024-11-20 19:45:42 -08:00
Paul Gauthier
9b5a703307
updated models-over-time
2024-11-20 19:40:59 -08:00
Paul Gauthier (aider)
370993cbed
style: Rotate point labels by 45 degrees in benchmark plot
2024-11-20 18:47:30 -08:00
Paul Gauthier
ddc538cdfa
refactor: Adjust plot figure size and y-axis limits for better visualization
2024-11-20 18:47:28 -08:00
Paul Gauthier (aider)
062dc43c87
style: Make graph aspect ratio square
2024-11-20 18:43:18 -08:00
Paul Gauthier (aider)
7d9b986c04
feat: Add cyan color and line for Mistral models in visualization
2024-11-20 18:38:06 -08:00
Paul Gauthier
bd2b9a12ed
style: Change Qwen model color from purple to darkblue
2024-11-20 18:38:04 -08:00
Paul Gauthier (aider)
2b55707738
feat: Add purple color and line for Qwen models in visualization
2024-11-20 18:35:25 -08:00
Paul Gauthier (aider)
093540507e
feat: Add pink color and line for Haiku models in benchmark visualization
2024-11-20 18:33:54 -08:00
Paul Gauthier (aider)
8f1dcfda07
feat: Add brown color for DeepSeek models in benchmark visualization
2024-11-20 18:31:46 -08:00
Paul Gauthier
16b319174b
refactor: Simplify model color detection logic for Sonnet models
2024-11-20 18:31:44 -08:00
Paul Gauthier (aider)
35115f5707
feat: Add orange color for Claude 3 Sonnet models in benchmark visualization
2024-11-20 18:30:09 -08:00
Paul Gauthier
822a8ab671
remove gpt-4o-mini from the gpt-4 trendline
2024-08-15 09:52:21 -07:00
Paul Gauthier (aider)
5ccdebf2c0
refactor: Extract color assignment logic into a separate function
2024-08-15 09:50:50 -07:00
Paul Gauthier (aider)
0a3c6bfbe7
feat: Change blue color to light blue in plot_over_time function
2024-08-14 06:29:48 -07:00
Paul Gauthier (aider)
d2b4846b95
feat: Replace orange color with purple for "-4o" models
2024-08-14 06:29:13 -07:00
Paul Gauthier (aider)
fb0b348bec
fix: Remove unused blue_points
variable
2024-08-14 06:28:28 -07:00
Paul Gauthier (aider)
a7290be843
style: Apply linter formatting changes
2024-08-14 06:27:51 -07:00
Paul Gauthier (aider)
1cdbc76974
feat: Connect model family lines in over_time plot
2024-08-14 06:27:48 -07:00
Paul Gauthier
714fd45f4d
fix: Update color logic and font size in over_time.py
2024-08-14 06:27:47 -07:00
Paul Gauthier (aider)
1f6cadcc66
style: Refactor conditional logic in color assignment
2024-08-14 06:22:51 -07:00
Paul Gauthier (aider)
c4f70d81b7
feat: add new color for all "-4o-" models except "gpt-4o-mini"
2024-08-14 06:22:48 -07:00
Paul Gauthier (aider)
1f59687e9d
style: Format code with linter
2024-08-14 06:21:48 -07:00
Paul Gauthier (aider)
d8c8c51156
The commit message for these changes would be:
...
feat: Improve graph visualization and add debugging
The changes made in this commit include:
1. Adjusting the y-axis limit to 100 to accommodate the higher pass rate values.
2. Rotating the x-axis labels for better readability.
3. Adding debug print statements to track the progress of figure generation and display.
4. Increasing the figure size for better visibility.
5. Adding additional debugging to ensure the data is being plotted correctly.
These improvements should help with the visualization and debugging of the graph generation process.
2024-08-14 06:21:45 -07:00
Paul Gauthier (aider)
d94d5aa3fa
style: format code according to linter rules
2024-08-14 06:20:36 -07:00
Paul Gauthier (aider)
d2479f30f7
fix: Add debug prints and check for empty data in over_time.py
2024-08-14 06:20:32 -07:00