Multilingual Automatic Speech Recognition with word-level timestamps
State-of-the-art diffusion models for image and audio generation
20+ high-performance LLMs with recipes to pretrain, finetune at scale
Open-source tool designed to enhance the efficiency of workloads
An MLOps framework to package, deploy, monitor and manage models
A high-performance ML model serving framework, offers dynamic batching
Neural Network Compression Framework for enhanced OpenVINO
A library for accelerating Transformer models on NVIDIA GPUs
Standardized Serverless ML Inference Platform on Kubernetes
Library for OCR-related tasks powered by Deep Learning
Build your chatbot within minutes on your favorite device
PyTorch extensions for fast R&D prototyping and Kaggle farming
Create HTML profiling reports from pandas DataFrame objects
Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs
Probabilistic reasoning and statistical analysis in TensorFlow
A unified framework for scalable computing
Phi-3.5 for Mac: Locally-run Vision and Language Models
Libraries for applying sparsification recipes to neural networks
Gaussian processes in TensorFlow
Single-cell analysis in Python
Training and deploying machine learning models on Amazon SageMaker
Sparsity-aware deep learning inference runtime for CPUs
Large Language Model Text Generation Inference
Deep learning optimization library: makes distributed training easy
Easy-to-use Speech Toolkit including Self-Supervised Learning model