Determine GPU requirements for large language models
Browse and submit LLM evaluations
View LLM Performance Leaderboard
Track, rank and evaluate open LLMs in Portuguese
Track, rank and evaluate open LLMs and chatbots
Request model evaluation on COCO val 2017 dataset
Display visual document retrieval leaderboard
Create and upload a Hugging Face model card
Launch MTEB Arena to compare models
Convert and upload Hugging Face models to MLX format
Explore multilingual LLM benchmark results
Track, rank and evaluate open LLMs and chatbots
Visualize model performance on function calling tasks
View and submit machine learning model evaluations
GIFT-Eval: A Benchmark for General Time Series Forecasting
Convert a Hugging Face model to ONNX format