Page 5 | paper free download

Showing 206 open source projects for "paper"

View related business solutions

Python Clear Filters & Widen Search

Ship Agents Faster
Transform your applications and workflows into powerful agentic systems at global scale.

Gemini Enterprise Agent Platform lets you rapidly build, scale, govern and optimize production-ready agents grounded in your organization's data. The platform enables developers to build custom or pre-built agents for virtually any use case. New customers get $300 in free credits.

Get Started Free
Atera - an All-in-one platform for IT management
Ideal for IT departments and MSPs (managed service providers)

Your IT essentials, integrated & elevated. Take your IT management from automated to autonomous, download Atera's agent to start your free trial!

Try Atera now
1

Official YOLOv7

YOLOv7: Trainable bag-of-freebies sets new state-of-the-art

YOLOv7 is the official implementation of the paper “YOLOv7: Trainable bag-of-freebies sets new state-of-the-art for real-time object detectors.” It is a PyTorch-based object detection project focused on high speed and strong accuracy for real-time computer vision. The repository provides model definitions, training scripts, testing tools, inference examples, pretrained weights, and deployment-oriented materials.

Downloads: 1 This Week

Last Update: 2026-06-07
See Project
2

Blankly

Easily build, backtest and deploy your algo in just a few lines

Blankly is a live trading engine, backtest runner and development framework wrapped into one powerful open-source package. Models can be instantly backtested, paper traded, sandbox tested and run live by simply changing a single line. We built blankly for every type of quant including training & running ML models in the same environment, cross-exchange/cross-symbol arbitrage, and even long/short positions on stocks (all with built-in WebSockets). Blankly is the first framework to enable developers to backtest, paper trade, and go live across exchanges without modifying a single line of trading logic on stocks, crypto, and forex. ...

Downloads: 1 This Week

Last Update: 2022-06-13
See Project
3

CCTV Frame Timestamp Extractor

CCTV Footage Timestamp Search Tool

Python script to address the problem of manually locating required event timestamps from carved CCTV DVR footages. Full details can be found in the paper published in Eighteenth Annual IFIP WG 11.9 International Conference on Digital Forensics. Link to paper: https://link.springer.com/chapter/10.1007/978-3-031-10078-9_8 The project has been divided into four modules: Framextract.py- Extracts frames from video footages Reconstruct.py- Attempts to repair unplayable video by extracting the frames. ...

Downloads: 0 This Week

Last Update: 2022-07-14
See Project
4

ChangeFormer

A Transformer-Based Siamese Network for Change Detection

Here, we provide the PyTorch implementation of the paper: A Transformer-Based Siamese Network for Change Detection.

Downloads: 3 This Week

Last Update: 2023-10-19
See Project
MongoDB Atlas runs apps anywhere
Deploy in 115+ regions with the modern database for every enterprise.

MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.

Start Free
5

NWT - Pytorch (wip)

Implementation of NWT, audio-to-video generation, in Pytorch

Implementation of NWT, audio-to-video generation, in Pytorch. The paper proposes a new discrete latent representation named Memcodes, which can be succinctly described as a type of multi-head hard-attention to learned memory (codebook) key/values. They claim the need for less codes and smaller codebook dimensions in order to achieve better reconstructions.

Downloads: 0 This Week

Last Update: 2023-03-22
See Project
6

CIPS-3D

3D-aware GANs based on NeRF (arXiv)

3D-aware GANs based on NeRF (arXiv). This repository contains the code of the paper, CIPS-3D: A 3D-Aware Generator of GANs Based on Conditionally-Independent Pixel Synthesis. The problem of mirror symmetry refers to the sudden change of the direction of the bangs near the yaw angle of pi/2. We propose to use an auxiliary discriminator to solve this problem. Note that in the initial stage of training, the auxiliary discriminator must dominate the generator more than the main discriminator does. ...

Downloads: 0 This Week

Last Update: 2023-03-21
See Project
7

Deep Feature Rotation Multimodal Image

Implementation of Deep Feature Rotation for Multimodal Image

...I provide some in the data/content and data/style and you can try to use them easily. We provide a visual comparison between other rotation angles that do not appear in the paper. The rotation angles will produce a very diverse number of outputs. This has proven the effectiveness of our method with other methods.

Downloads: 0 This Week

Last Update: 2023-03-23
See Project
8

SVoice (Speech Voice Separation)

We provide a PyTorch implementation of the paper Voice Separation

SVoice is a PyTorch-based implementation of Facebook Research’s study on speaker voice separation as described in the paper “Voice Separation with an Unknown Number of Multiple Speakers.” This project presents a deep learning framework capable of separating mixed audio sequences where several people speak simultaneously, without prior knowledge of how many speakers are present. The model employs gated neural networks with recurrent processing blocks that disentangle voices over multiple computational steps, while maintaining speaker consistency across output channels. ...

Downloads: 1 This Week

Last Update: 12 hours ago
See Project
9

YOLOR

implementation of paper - You Only Learn One Representation

YOLOR is the implementation of “You Only Learn One Representation,” a unified network approach for learning explicit and implicit knowledge together. The project focuses on object detection while exploring how a shared representation can support multiple tasks. It builds on the YOLO family and related PyTorch detection work, combining practical detector training with a research idea about unified representations. YOLOR includes model configurations, training code, evaluation scripts,...

Downloads: 1 This Week

Last Update: 2026-06-03
See Project
Stop Cyber Threats with VM-Series Next-Gen Firewall on Azure
Native application identity and user-based security for your Azure cloud

Gain integrated visibility across all traffic in a single pass. Deploy Palo Alto Networks VM-Series to determine application identity and content while automating security policy updates via rich APIs.

Get a free trial
10

Paperless-ng

A supercharged version of paperless, scan, index and archive docs

Paperless is a simple Django application running in two parts, a Consumer (the thing that does the indexing) and a Web server (the part that lets you search & download already-indexed documents). Paper is a nightmare. Environmental issues aside, there’s no excuse for it in the 21st century. It takes up space, collects dust, doesn’t support any form of a search feature, indexing is tedious, it’s heavy and prone to damage & loss. I wrote this to make “going paperless” easier. I do not have to worry about finding stuff again. I feed documents right from the post box into the scanner and then shred them. ...

Downloads: 0 This Week

Last Update: 2022-03-04
See Project
11

Differentiable Neural Computer

A TensorFlow implementation of the Differentiable Neural Computer

The Differentiable Neural Computer (DNC), developed by Google DeepMind, is a neural network architecture augmented with dynamic external memory, enabling it to learn algorithms and solve complex reasoning tasks. Published in Nature in 2016 under the paper “Hybrid computing using a neural network with dynamic external memory,” the DNC combines the pattern recognition power of neural networks with a memory module that can be written to and read from in a differentiable way. This allows the model to learn how to store and retrieve information across long time horizons, much like a traditional computer. ...

Downloads: 2 This Week

Last Update: 5 days ago
See Project
12

SimSiam

PyTorch implementation of SimSiam

SimSiam is a PyTorch implementation of “Exploring Simple Siamese Representation Learning” by Xinlei Chen and Kaiming He. The project introduces a minimalist approach to self-supervised learning that avoids negative pairs, momentum encoders, or large memory banks—key complexities of prior contrastive methods. SimSiam learns image representations by maximizing similarity between two augmented views of the same image through a Siamese neural network with a stop-gradient operation, preventing...

Downloads: 4 This Week

Last Update: 5 days ago
See Project
13

TimeSformer

The official pytorch implementation of our paper

TimeSformer is a vision transformer architecture for video that extends the standard attention mechanism into spatiotemporal attention. The model alternates attention along spatial and temporal dimensions (or designs variants like divided attention) so that it can capture both appearance and motion cues in video. Because the attention is global across frames, TimeSformer can reason about dependencies across long time spans, not just local neighborhoods. The official implementation in PyTorch...

Downloads: 0 This Week

Last Update: 2025-10-07
See Project
14

DeepSpeech

Open source embedded speech-to-text engine

DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers. DeepSpeech is an open-source Speech-To-Text engine, using a model trained by machine learning techniques based on Baidu's Deep Speech research paper. Project DeepSpeech uses Google's TensorFlow to make the implementation easier. A pre-trained English model is available for use and can be downloaded following the instructions in the usage docs. If you want to use the pre-trained English model for performing speech-to-text, you can download it (along with other important inference material) from the DeepSpeech releases page.

Downloads: 5 This Week

Last Update: 2021-04-08
See Project
15

FixRes

Reproduces results of "Fixing the train-test resolution discrepancy"

...FixRes demonstrates that a mismatch between training and testing resolutions often leads to suboptimal accuracy, and fine-tuning the classifier and batch normalization layers at higher test resolutions significantly enhances performance. The repository includes pretrained models, feature embeddings, and evaluation scripts corresponding to the experiments reported in the NeurIPS 2019 paper “Fixing the train-test resolution discrepancy.”

Downloads: 4 This Week

Last Update: 5 days ago
See Project
16

Deep Exemplar-based Video Colorization

The source code of CVPR 2019 paper "Deep Exemplar-based Colorization"

The source code of CVPR 2019 paper "Deep Exemplar-based Video Colorization". End-to-end network for exemplar-based video colorization. The main challenge is to achieve temporal consistency while remaining faithful to the reference style. To address this issue, we introduce a recurrent framework that unifies the semantic correspondence and color propagation steps.

Downloads: 0 This Week

Last Update: 2023-03-23
See Project
17

Image GPT

Large-scale autoregressive pixel model for image generation by OpenAI

Image-GPT is the official research code and models from OpenAI’s paper Generative Pretraining from Pixels. The project adapts GPT-2 to the image domain, showing that the same transformer architecture can model sequences of pixels without altering its fundamental structure. It provides scripts to download pretrained checkpoints of different model sizes (small, medium, large) trained on large-scale datasets and includes utilities for handling color quantization with a 9-bit palette. ...

Downloads: 7 This Week

Last Update: 6 days ago
See Project
18

ALAE

Adversarial Latent Autoencoders

ALAE (Adversarial Latent Autoencoders) is a deep learning research implementation that combines autoencoders with generative adversarial networks to produce high-quality image synthesis models. The project implements the architecture introduced in the CVPR research paper on Adversarial Latent Autoencoders, which focuses on improving generative modeling by learning latent representations aligned with adversarial training objectives. Unlike traditional GANs that directly generate images from random noise, ALAE uses an encoder-decoder architecture that maps images into a structured latent space and then reconstructs them through adversarial training. ...

Downloads: 0 This Week

Last Update: 2026-03-11
See Project
19

Multi-Agent Emergence Environments

Environment generation code for the paper "Emergent Tool Use"

multi-agent-emergence-environments is an open source research environment framework developed by OpenAI for the study of emergent behaviors in multi-agent systems. It was designed for the experiments described in the paper and blog post “Emergent Tool Use from Multi-Agent Autocurricula”, which investigated how complex cooperative and competitive behaviors can evolve through self-play. The repository provides environment generation code that builds on the mujoco-worldgen package, enabling dynamic creation of simulated physical environments. Developers can construct custom environments by combining modular components such as Boxes, Ramps, and RandomWalls using a flexible layering approach that reduces code duplication. ...

Downloads: 0 This Week

Last Update: 5 days ago
See Project
20

CC-Net

Tools to download and cleanup Common Crawl data

cc_net provides tools to download, segment, clean, and filter Common Crawl to build large-scale text corpora, including monolingual datasets and the multilingual CC-100 collection introduced in the associated paper. It includes pipelines to fetch snapshots, extract text, de-duplicate, identify language, and apply quality filtering based on heuristics and language models. The outputs are intended for pretraining language models and for creating standardized corpora that can be reproduced or updated with new crawls. The repository documents practical concerns like HTTP failures, snapshot differences, and stats JSONs, reflecting community use across many languages. ...

Downloads: 0 This Week

Last Update: 2025-10-11
See Project
21

Sparse Attention

"Generating Long Sequences with Sparse Transformers" examples

Sparse Attention is OpenAI’s code release for the Sparse Transformer model, introduced in the paper Generating Long Sequences with Sparse Transformers. It explores how modifying the self-attention mechanism with sparse patterns can reduce the quadratic scaling of standard transformers, making it possible to model much longer sequences efficiently. The repository provides implementations of sparse attention layers, training code, and evaluation scripts for benchmark datasets.

Downloads: 9 This Week

Last Update: 5 days ago
See Project
22

EfficientNet Keras

Implementation of EfficientNet model. Keras and TensorFlow Keras

...Convolutional Neural Networks (ConvNets) are commonly developed at a fixed resource budget, and then scaled up for better accuracy if more resources are available. In this paper, we systematically study model scaling and identify that carefully balancing network depth, width, and resolution can lead to better performance. Based on this observation, we propose a new scaling method that uniformly scales all dimensions of depth/width/resolution using a simple yet highly effective compound coefficient. We demonstrate the effectiveness of this method on scaling up MobileNets and ResNet.

Downloads: 0 This Week

Last Update: 2022-08-10
See Project
23

Multilingual Speech Synthesis

An implementation of Tacotron 2 that supports multilingual experiments

This repository provides synthesized samples, training and evaluation data, source code, and parameters for the paper One Model, Many Languages: Meta-learning for Multilingual Text-to-Speech. It contains an implementation of Tacotron 2 that supports multilingual experiments and that implements different approaches to encoder parameter sharing. It presents a model combining ideas from Learning to speak fluently in a foreign language: Multilingual speech synthesis and cross-language voice cloning, End-to-End Code-Switched TTS with Mix of Monolingual Recordings, and Contextual Parameter Generation for Universal Neural Machine Translation. ...

Downloads: 0 This Week

Last Update: 2023-03-24
See Project
24

Magnitude

A fast, efficient universal vector embedding utility package

...It is primarily intended to be a simpler / faster alternative to Gensim but can be used as a generic key-vector store for domains outside NLP. It offers unique features like out-of-vocabulary lookups and streaming of large models over HTTP. Published in our paper at EMNLP 2018 and available on arXiv.

Downloads: 0 This Week

Last Update: 2024-08-16
See Project
25

Reliable Metrics for Generative Models

Code base for the precision, recall, density, and coverage metrics

...Because it does not differentiate the fidelity and diversity aspects of the generated images, recent papers have introduced variants of precision and recall metrics to diagnose those properties separately. In this paper, we show that even the latest version of the precision and recall (Kynkäänniemi et al., 2019) metrics are not reliable yet. For example, they fail to detect the match between two identical distributions, they are not robust against outliers, and the evaluation hyperparameters are selected arbitrarily. We propose density and coverage metrics that solve the above issues.

Downloads: 0 This Week

Last Update: 2023-03-21
See Project