Page 5 | transformers free download

KoGPT

KakaoBrain KoGPT (Korean Generative Pre-trained Transformer)

KoGPT is a Korean language model based on OpenAI’s GPT architecture, designed for various natural language processing (NLP) tasks such as text generation, summarization, and dialogue systems.

Downloads: 1 This Week

Last Update: 2025-01-24

See Project

...AMP optimizers (~35% faster) and parallel preprocessing (16 CPU cores => ~16x faster). Modular design of language models and prediction heads. Switch between heads or combine them for multitask learning. Full Compatibility with HuggingFace Transformers' models and model hub. Smooth upgrading to newer language models. Integration of custom datasets via Processor class. Powerful experiment tracking & execution.

Downloads: 0 This Week

Last Update: 2022-08-12

See Project

jiant

jiant is an nlp toolkit

Jiant is a multitask NLP framework for fine-tuning transformer-based models on multiple natural language understanding (NLU) tasks.

Downloads: 1 This Week

Last Update: 2025-01-22

See Project

SimCSE

SimCSE: Simple Contrastive Learning of Sentence Embeddings

SimCSE (Simple Contrastive Learning of Sentence Embeddings) is a machine learning framework for training sentence embeddings using contrastive learning. It improves representation learning for NLP tasks.

Downloads: 0 This Week

Last Update: 2025-01-21

See Project

Tabnine

Vim client for TabNine

Tabnine is an AI-powered code completion extension trusted by millions of developers around the world. Whether you’re just getting started as a developer or if you’ve been doing it for decades, Tabnine will help you code twice as fast with half the keystrokes – all in your favorite IDE. Whether you call it IntelliSense, intelliCode, autocomplete, AI-assisted code completion, AI-powered code completion, AI copilot, AI code snippets, code suggestion, code prediction, code hinting, or...

Downloads: 23 This Week

Last Update: 2024-07-23

See Project

TextBrewer

A PyTorch-based knowledge distillation toolkit

TextBrewer is a PyTorch-based model distillation toolkit for natural language processing. It includes various distillation techniques from both NLP and CV field and provides an easy-to-use distillation framework, which allows users to quickly experiment with the state-of-the-art distillation methods to compress the model with a relatively small sacrifice in the performance, increasing the inference speed and reducing the memory usage.

Downloads: 0 This Week

Last Update: 2025-01-22

See Project

Image GPT

Large-scale autoregressive pixel model for image generation by OpenAI

...Researchers can use the code to sample new images, evaluate generative loss on datasets like ImageNet or CIFAR-10, and explore the impact of scaling on performance. While the repository is archived and provided as-is, it remains a valuable starting point for experimenting with autoregressive transformers applied directly to raw pixel data. By demonstrating GPT’s flexibility across modalities, Image-GPT influenced subsequent multimodal generative research.

Downloads: 3 This Week

Last Update: 6 days ago

See Project

Sparse Attention

"Generating Long Sequences with Sparse Transformers" examples

Sparse Attention is OpenAI’s code release for the Sparse Transformer model, introduced in the paper Generating Long Sequences with Sparse Transformers. It explores how modifying the self-attention mechanism with sparse patterns can reduce the quadratic scaling of standard transformers, making it possible to model much longer sequences efficiently. The repository provides implementations of sparse attention layers, training code, and evaluation scripts for benchmark datasets. ...

Downloads: 2 This Week

Last Update: 6 days ago

See Project

TFKit

Handling multiple nlp task in one pipeline

TFKit is a tool kit mainly for language generation. It leverages the use of transformers on many tasks with different models in this all-in-one framework. All you need is a little change of config. You can use tfkit for model training and evaluation with tfkit-train and tfkit-eval. The key to combine different task together is to make different task with same data format. All data will be in csv format - tfkit will use csv for all task, normally it will have two columns, first columns is the input of models, the second column is the output of models. ...

Downloads: 0 This Week

Last Update: 2023-03-23

See Project

DETR

End-to-end object detection with transformers

PyTorch training code and pretrained models for DETR (DEtection TRansformer). We replace the full complex hand-crafted object detection pipeline with a Transformer, and match Faster R-CNN with a ResNet-50, obtaining 42 AP on COCO using half the computation power (FLOPs) and the same number of parameters. Inference in 50 lines of PyTorch. What it is. Unlike traditional computer vision techniques, DETR approaches object detection as a direct set prediction problem. It consists of a set-based...

Downloads: 0 This Week

Last Update: 2021-08-04

See Project

PixelCNN

Code for the paper "PixelCNN++: A PixelCNN Implementation..."

...The project serves as both a research reference and a practical framework for experimenting with autoregressive generative models. Although archived, PixelCNN has influenced a wide range of later work in generative modeling, including advancements in image transformers and diffusion models.

Downloads: 0 This Week

Last Update: 5 hours ago

See Project

BlockSparse

Efficient GPU kernels for block-sparse matrix multiplication

...The idea is to exploit block-level sparsity — i.e. treat matrices or weight tensors as composed of blocks, many of which may be zero or unused — to save compute and memory when sparsity patterns are structured. This is particularly useful in models like Sparse Transformers, where attention matrices or intermediate layers may adopt block-sparse patterns to scale better. The repo implements both blocksparse and blockwise convolution/transpose-convolution primitives, with support for preparing, executing, and verifying those ops on NVIDIA GPUs. In addition to low-level kernels, it includes wrapper code for integrating with TensorFlow, example scripts (e.g. a transformer on the enwik8 dataset), transformer logic that uses blocksparse operations, and debugging helpers.

Downloads: 0 This Week

Last Update: 2025-10-04

See Project

maki

XML and Python-based webserving framework, featuring XSLT, embedded Python evaluation, pipeline processing model and caching. Works with several XSL transformers and webserver interfaces.

Downloads: 1 This Week

Last Update: 2013-03-07

See Project

Zope XML Methods

ZopeXMLMethods provides methods to apply to Zope objects for XML/XSLT processing. XSLTMethod associates XSLT transformers with XML documents. ZopeXMLMethods succeeds XMLTransform. It features file-system caching and works with many XML/XSLT libraries.

Downloads: 0 This Week

Last Update: 2013-06-04

See Project

gpt-oss-20b

OpenAI’s compact 20B open model for fast, agentic, and local use

...Like its larger sibling (gpt-oss-120b), it offers adjustable reasoning depth and full chain-of-thought visibility for better interpretability. It’s released under a permissive Apache 2.0 license, allowing unrestricted commercial and research use. GPT-OSS-20B is compatible with Transformers, vLLM, Ollama, PyTorch, and other tools. It is ideal for developers building lightweight AI agents or experimenting with fine-tuning on consumer-grade hardware.

Downloads: 0 This Week

Last Update: 2025-08-05

See Project

Search Results for "transformers" - Page 5

Showing 115 open source projects for "transformers"

KoGPT

FARM

jiant

SimCSE

Tabnine

TextBrewer

Image GPT

Sparse Attention

TFKit

DETR

PixelCNN

BlockSparse

maki

Zope XML Methods

gpt-oss-20b

Search Results for "transformers" - Page 5

Showing 115 open source projects for "transformers"

KoGPT

FARM

jiant

SimCSE

Tabnine

TextBrewer

Image GPT

Sparse Attention

TFKit

DETR

PixelCNN

BlockSparse

maki

Zope XML Methods

gpt-oss-20b

Related Categories