Model Benchmarking | AI Tools Categories | Organized by Function

🚀

Can You Run It? LLM version

Determine GPU requirements for large language models

983

🥇

Open Medical-LLM Leaderboard

Browse and submit LLM evaluations

390

🐨

LLM Performance Leaderboard

View LLM Performance Leaderboard

327

🏆

Open Portuguese LLM Leaderboard

Track, rank and evaluate open LLMs in Portuguese

204

🏆

Low-bit Quantized Open LLM Leaderboard

Track, rank and evaluate open LLMs and chatbots

172

🏆

Open Object Detection Leaderboard

Request model evaluation on COCO val 2017 dataset

161

🥇

Vidore Leaderboard

Display visual document retrieval leaderboard

144

⚡

Modelcard Creator

Create and upload a Hugging Face model card

113

⚔

MTEB Arena

Launch MTEB Arena to compare models

108

🐐

MLX My Repo

Convert and upload Hugging Face models to MLX format

106

🌍

Internal European Leaderboard

Explore multilingual LLM benchmark results

🏆

Open LLM Leaderboard

Track, rank and evaluate open LLMs and chatbots

🐠

Nexus Function Calling Leaderboard

Visualize model performance on function calling tasks

🥇

LLM Safety Leaderboard

View and submit machine learning model evaluations

🥇

GIFT Eval

GIFT-Eval: A Benchmark for General Time Series Forecasting

☯

Convert to ONNX

Convert a Hugging Face model to ONNX format