Open LLM Leaderboard: DROP deep dive
Recently, three new benchmarks were added to the Open LLM Leaderboard: Winogrande, GSM8k and DROP, using the original implementations reproduced in the EleutherAI Harness.
DROP (Discrete Reasoning Over Paragraphs)
So what's next?
We have therefore taken the decision to remove DROP from the Open LLM Leaderboard until a new version arises.