We tried the new model on our patched-codes/static-analysis-eval which evaluates the model on vulnerability remediation. gpt-4o has reclaimed the top spot on our leaderboard (from meta-llama/Meta-Llama-3-70B-Instruct).
You can now use the new model with our open-source framework PatchWork - https://github.com/patched-codes/patchwork by passing
model=gpt-4o
on the CLI.