The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (ViT), MobileNetV4, MobileNet-V3 & V2, RegNet, DPN, CSPNet, Swin Transformer, MaxViT, CoAtNet, ConvNeXt, and more
🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.
🕷️ An adaptive Web Scraping framework that handles everything from a single request to a full-scale crawl!
🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch.
Turn any PDF or image document into structured data for your AI. A powerful, lightweight OCR toolkit that bridges the gap between images/PDFs and LLMs. Supports 100+ languages.
Financial data platform for analysts, quants and AI agents.
Models and examples built with TensorFlow
"🐈 nanobot: The Ultra-Lightweight Personal AI Agent"
Pretrain, finetune ANY AI model of ANY size on 1 or 10,000+ GPUs with zero code changes.
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)