Create and quantize Hugging Face models
Convert and PR models to Safetensors
Create a large, deduplicated dataset for LLM pre-training
Convert PDFs to a dataset and upload to Hugging Face
Create and validate structured metadata for datasets