ofn-layer-modes free download

BertViz

BertViz: Visualize Attention in NLP Models (BERT, GPT2, BART, etc.)

...BertViz extends the Tensor2Tensor visualization tool by Llion Jones, providing multiple views that each offer a unique lens into the attention mechanism. The head view visualizes attention for one or more attention heads in the same layer. It is based on the excellent Tensor2Tensor visualization tool. The model view shows a bird's-eye view of attention across all layers and heads. The neuron view visualizes individual neurons in the query and key vectors and shows how they are used to compute attention.

Downloads: 0 This Week

Last Update: 2025-06-01

See Project

Lightweight' GAN

Implementation of 'lightweight' GAN, proposed in ICLR 2021

Implementation of 'lightweight' GAN proposed in ICLR 2021, in Pytorch. The main contribution of the paper is a skip-layer excitation in the generator, paired with autoencoding self-supervised learning in the discriminator. Quoting the one-line summary "converge on single gpu with few hours' training, on 1024 resolution sub-hundred images". Augmentation is essential for Lightweight GAN to work effectively in a low data setting. You can test and see how your images will be augmented before they pass into a neural network (if you use augmentation). ...

Downloads: 0 This Week

Last Update: 2025-01-12

See Project

KoboldCpp

Run GGUF models easily with a UI or API. One File. Zero Install.

KoboldCpp is an easy-to-use AI text-generation software for GGML and GGUF models, inspired by the original KoboldAI. It's a single self-contained distributable that builds off llama.cpp and adds many additional powerful features.

Downloads: 344 This Week

Last Update: 20 hours ago

See Project

DALL-E 2 - Pytorch

Implementation of DALL-E 2, OpenAI's updated text-to-image synthesis

Implementation of DALL-E 2, OpenAI's updated text-to-image synthesis neural network, in Pytorch. The main novelty seems to be an extra layer of indirection with the prior network (whether it is an autoregressive transformer or a diffusion network), which predicts an image embedding based on the text embedding from CLIP. Specifically, this repository will only build out the diffusion prior network, as it is the best performing variant (but which incidentally involves a causal transformer as the denoising network) To train DALLE-2 is a 3 step process, with the training of CLIP being the most important. ...

Downloads: 0 This Week

Last Update: 2023-10-19

See Project

DALL-E in Pytorch

Implementation / replication of DALL-E, OpenAI's Text to Image

...Currently only the VAE with a codebook size of 1024 is offered, with the hope that it may train a little faster than OpenAI's, which has a size of 8192. In contrast to OpenAI's VAE, it also has an extra layer of downsampling, so the image sequence length is 256 instead of 1024 (this will lead to a 16 reduction in training costs, when you do the math).

Downloads: 0 This Week

Last Update: 2023-05-24

See Project

LaMDA-pytorch

Open-source pre-training implementation of Google's LaMDA in PyTorch

Open-source pre-training implementation of Google's LaMDA research paper in PyTorch. The totally not sentient AI. This repository will cover the 2B parameter implementation of the pre-training architecture as that is likely what most can afford to train. You can review Google's latest blog post from 2022 which details LaMDA here. You can also view their previous blog post from 2021 on the model.

Downloads: 0 This Week

Last Update: 2023-03-25

See Project

FID score for PyTorch

Compute FID scores with PyTorch

...However, due to differences in the image interpolation implementation and library backends, FID results still differ slightly from the original implementation. In difference to the official implementation, you can choose to use a different feature layer of the Inception network instead of the default pool3 layer.

Downloads: 6 This Week

Last Update: 2022-08-11

See Project

Seq2seq Chatbot for Keras

This repository contains a new generative model of chatbot

...The canonical seq2seq model became popular in neural machine translation, a task that has different prior probability distributions for the words belonging to the input and output sequences since the input and output utterances are written in different languages. The architecture presented here assumes the same prior distributions for input and output words. Therefore, it shares an embedding layer (Glove pre-trained word embedding) between the encoding and decoding processes through the adoption of a new model.

Downloads: 0 This Week

Last Update: 2023-03-21

See Project

Search Results for "ofn-layer-modes"

Showing 8 open source projects for "ofn-layer-modes"

BertViz

Lightweight' GAN

KoboldCpp

DALL-E 2 - Pytorch

DALL-E in Pytorch

LaMDA-pytorch

FID score for PyTorch

Seq2seq Chatbot for Keras

Search Results for "ofn-layer-modes"

Showing 8 open source projects for "ofn-layer-modes"

BertViz

Lightweight' GAN

KoboldCpp

DALL-E 2 - Pytorch

DALL-E in Pytorch

LaMDA-pytorch

FID score for PyTorch

Seq2seq Chatbot for Keras

Related Searches

Related Categories