CogView

CogView is a large-scale pretrained text-to-image transformer model, introduced in the NeurIPS 2021 paper CogView: Mastering Text-to-Image Generation via Transformers. With 4 billion parameters, it was one of the earliest transformer-based models to successfully generate high-quality images from natural language descriptions in Chinese, with partial support for English via translation. The model incorporates innovations such as PB-relax and Sandwich-LN to enable stable training of very deep transformers without NaN loss issues. CogView supports multiple tasks beyond text-to-image, including image captioning, post-selection (ranking candidate images by relevance to a prompt), and super-resolution (upscaling model-generated images). The repository provides pretrained models, inference scripts, and training examples, along with a Docker environment for reproducibility.

Features

Supports simplified Chinese input (English input works better via translation)
4B-parameter transformer for text-to-image generation
Pretrained models for text-to-image, captioning, and super-resolution
Stable deep transformer training with PB-relax and Sandwich-LN techniques
Includes post-selection scripts to rank image outputs by relevance
Docker environment for easier setup and large-scale training reproduction

Project Samples

Project Activity

See All Activity >

License

Apache License V2.0

Follow CogView

CogView Web Site

Other Useful Business Software

Gen AI apps are built with MongoDB Atlas

Build gen AI apps with an all-in-one modern database: MongoDB Atlas

MongoDB Atlas provides built-in vector search and a flexible document model so developers can build, scale, and run gen AI apps without stitching together multiple databases. From LLM integration to semantic search, Atlas simplifies your AI architecture—and it’s free to get started.

Start Free

Rate This Project

User Reviews

Be the first to post a review of CogView!

Additional Project Details

Operating Systems

Linux

Programming Language

Python, Unix Shell

Related Categories

Unix Shell AI Image Generators, Python AI Image Generators

Registered

4 days ago

Similar Business Software

Imagen

Imagen is a text-to-image generation model developed by Google Research. It uses advanced deep learning techniques, primarily leveraging large Transformer-based architectures, to generate high-quality, photorealistic images from natural language descriptions. Imagen's core innovation lies in...

See Software
FLUX.1

FLUX.1 is a groundbreaking suite of open-source text-to-image models developed by Black Forest Labs, setting new benchmarks in AI-generated imagery with its 12 billion parameters. It surpasses established models like Midjourney V6, DALL-E 3, and Stable Diffusion 3 Ultra by offering superior...

See Software
FlyAgt

FlyAgt is an AI-powered, all-in-one platform for image and video creation and editing, designed to transform simple ideas into professional-quality visuals without coding or complex prompts. It supports text-to-image and text-and-image-to-video generation with physics-aware models,...

See Software
Photosonic

The AI that paints your dreams with pixels for free. Start with a detailed description. Photosonic has already generated 1053127 images using AI. Photosonic is a web-based tool that lets you create realistic or artistic images from any text description, using a state-of-the-art text-to-image AI...

See Software
OmniGen AI

OmniGen AI lets you transform text descriptions into stunning visuals and seamlessly edit images within a single, unified framework. Simply enter your text prompt, optionally embedding reference images with a simple syntax, then click “generate” to harness its advanced text-to-image model, which...

See Software
PicassoPix

PicassoPix is an innovative all-in-one platform that addresses the fragmented landscape of AI image generation tools. By consolidating various AI models and image editing capabilities under a single roof, PicassoPix offers users a comprehensive solution with a unified pricing system. This...

See Software

Report inappropriate content

CogView

Text-to-Image generation. The repo for NeurIPS 2021 paper

Get an email when there's a new version of CogView

Features

Project Samples

Project Activity

Categories

License

Follow CogView

User Reviews

Additional Project Details

Operating Systems

Programming Language

Related Categories

Registered