-
Notifications
You must be signed in to change notification settings - Fork 107
Open
Description
I have attempted to reproduce Qwen/Qwen3-Coder-30B-A3B-Instruct 's performance on webarena and webarena-verified. However, using the Generic agent prompt, I achieved extremely low performance. Has anyone benchmarked Qwen/Qwen3-Coder-30B-A3B-Instruct on any of the benchmarks integrated here?
Reactions are currently unavailable
Metadata
Metadata
Assignees
Labels
No labels