tiktoken is a fast BPE tokeniser for use with OpenAI's models
SpikingJelly is an open-source deep learning framework
Less Code, Lower Barrier, Faster Deployment
Instant neural graphics primitives: lightning fast NeRF and more
Token-Oriented Object Notation (TOON)
This repository contains the official implementation of FastVLM
Official repository for LTX-Video
Multi-Modal Neural Networks for Semantic Search, based on Mid-Fusion
Lightning fast C++/CUDA neural network framework
caret (Classification And Regression Training) R package
Reverse engineering Gemini's SynthID detection
Julia Implementation of Transformer models
Qwen2.5-VL is the multimodal large language model series
Unsupervised text tokenizer for Neural Network-based text generation
Unified Multimodal Understanding and Generation Models
A python tool that uses GPT-4, FFmpeg, and OpenCV
SOTA discrete acoustic codec models with 40/75 tokens per second
Package that makes it trivial to create and evaluate machine learning
Implementation of Vision Transformer, a simple way to achieve SOTA
Unifying 3D Mesh Generation with Language Models
Large-language-model & vision-language-model based on Linear Attention
Chinese Llama-3 LLMs) developed from Meta Llama 3
Minimal, clean code for the Byte Pair Encoding (BPE) algorithm
AI agent that streamlines the entire process of data analysis
Code for the paper Language Models are Unsupervised Multitask Learners