Model Benchmarking
🚀
Can You Run It? LLM version
Determine GPU requirements for large language models
983
🥇
Open Medical-LLM Leaderboard
Browse and submit LLM evaluations
390
🐨
LLM Performance Leaderboard
View LLM Performance Leaderboard
327
🏆
Open Portuguese LLM Leaderboard
Track, rank and evaluate open LLMs in Portuguese
204
🏆
Low-bit Quantized Open LLM Leaderboard
Track, rank and evaluate open LLMs and chatbots
172
🏆
Open Object Detection Leaderboard
Request model evaluation on COCO val 2017 dataset
161
🥇
Vidore Leaderboard
Display visual document retrieval leaderboard
144
⚡
Modelcard Creator
Create and upload a Hugging Face model card
113
⚔
MTEB Arena
Launch MTEB Arena to compare models
108
🐐
MLX My Repo
Convert and upload Hugging Face models to MLX format
106
🌍
Internal European Leaderboard
Explore multilingual LLM benchmark results
97
🏆
Open LLM Leaderboard
Track, rank and evaluate open LLMs and chatbots
93
🐠
Nexus Function Calling Leaderboard
Visualize model performance on function calling tasks
92
🥇
LLM Safety Leaderboard
View and submit machine learning model evaluations
92
🥇
GIFT Eval
GIFT-Eval: A Benchmark for General Time Series Forecasting
85
☯
Convert to ONNX
Convert a Hugging Face model to ONNX format
79