🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.
🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch.
A high-throughput and memory-efficient inference and serving engine for LLMs
Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
LLM inference in C/C++
Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train OpenAI gpt-oss, DeepSeek, Qwen, Llama, Gemma, TTS 2x faster with 70% less VRAM.
Universal memory layer for AI Agents
Port of OpenAI's Whisper model in C/C++
Tensors and Dynamic neural networks in Python with strong GPU acceleration
Protocol Buffers - Google's data interchange format