Join/Login
Business Software
Open Source Software
For Vendors
Blog
About
More

For Vendors Help Create Join Login

Business Software

Open Source Software

SourceForge Podcast

Resources

Articles
Case Studies
Blog

Menu

Help
Create
Join
Login

Home
Open Source Software
Artificial Intelligence Software
Search Results

Search Results for "malware-samples" - Page 2

x

Sort By:

Relevance

Clear All Filters

OS

Linux 91
Windows 83
Mac 80
More...
BSD 38
ChromeOS 36
Mobile Operating Systems 4
Desktop Operating Systems 1

Category

Artificial Intelligence 91
Scientific/Engineering 12
Software Development 9
Multimedia 4
Security 4
System 3
Education 2
Business 1

License

OSI-Approved Open Source 70
Creative Commons Attribution License 2
Public Domain 1

Translations

English 6
Arabic 1
Chinese (Traditional) 1
French 1

Programming Language

Python 52
C++ 6
JavaScript 6
TypeScript 5
More...
C 3
C# 3
Go 3
Java 3
PowerShell 3
Rust 2
Unix Shell 2
Julia 1
MATLAB 1
Perl 1

Status

Production/Stable 4
Pre-Alpha 2
Alpha 2
Beta 2

Showing 91 open source projects for "malware-samples"

View related business solutions

Artificial Intelligence Linux Clear Filters & Widen Search

$300 in Free Credit Towards Top Cloud Services
Build VMs, containers, AI, databases, storage—all in one place.

Start your project in minutes. After credits run out, 20+ products include free monthly usage. Only pay when you're ready to scale.

Get Started
Our Free Plans just got better! | Auth0
With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.

Try free now
1

LLM Datasets

Curated list of datasets and tools for post-training

...It highlights instruction-tuning and conversation-style corpora while also pointing to code, math, or domain-specific sets for targeted capabilities. Quality is a recurring theme: examples and utilities help filter low-value samples, enforce length limits, and split train/validation consistently so results are comparable. Licensing and provenance are surfaced to encourage compliant usage and to guide dataset selection in commercial settings. For practitioners, the repo is a practical “starting pantry” that accelerates experimentation and helps keep data wrangling from dominating the project timeline.

Downloads: 2 This Week

Last Update: 2026-04-29
See Project
2

StreamSpeech

StreamSpeech is a seamless model for offline speech recognition

StreamSpeech is an “all-in-one” speech model designed to perform offline and simultaneous speech recognition, speech translation, and speech synthesis within a single unified architecture. Developed as part of an ACL 2024 paper, it targets streaming and low-latency scenarios where intermediate results and final translations or synthetic speech must be produced continuously as audio is being received. The model supports eight tasks: offline ASR, speech-to-text translation, speech-to-speech...

Downloads: 1 This Week

Last Update: 2025-11-28
See Project
3

AI Agent Deep Dive

AI Agent Source Code Deep Research Report

...It explores how agents interact with environments, execute tasks, and maintain context over time, highlighting both strengths and limitations of current approaches. The repository likely includes diagrams, annotated code samples, and conceptual walkthroughs that mirror real production systems.

Downloads: 0 This Week

Last Update: 2026-04-12
See Project
4

MobileCLIP

Implementation of "MobileCLIP" CVPR 2024

...Project notes highlight latency/accuracy trade-offs, with MobileCLIP2 variants matching or surpassing larger baselines at notably lower parameter counts and runtime on mobile devices. A companion “mobileclip-dr” repository details large-scale, distributed data-generation pipelines used to reinforce datasets across billions of samples on thousands of GPUs. Overall, MobileCLIP emphasizes end-to-end practicality: scalable training, deployable models, and consumer-grade demos.

Downloads: 0 This Week

Last Update: 2026-04-15
See Project
8 Monitoring Tools in One APM. Install in 5 Minutes.
Errors, performance, logs, uptime, hosts, anomalies, dashboards, and check-ins. One interface.

AppSignal works out of the box for Ruby, Elixir, Node.js, Python, and more. 30-day free trial, no credit card required.

Start Free
5

Generative AI

Sample code and notebooks for Generative AI on Google Cloud

Generative AI is a comprehensive collection of code samples, notebooks, and demo applications designed to help developers build generative-AI workflows on the Vertex AI platform. It spans multiple modalities—text, image, audio, search (RAG/grounding) and more—showing how to integrate foundation models like the Gemini family into cloud projects. The README emphasises getting started with prompts, datasets, environments and sample apps, making it ideal for both experimentation and production-ready usage. ...

Downloads: 1 This Week

Last Update: 2 days ago
See Project
6

Step-Audio-EditX

LLM-based Reinforcement Learning audio edit model

...This allows users to modify not only what is said (the text) but also how it's said: emotion, tone, speaking style, prosody, accent, even paralinguistic cues. Because the model is trained with a “large-margin learning” objective over many synthesized and natural speech samples, it gains robust control over expressive attributes, and can perform iterative editing: e.g. you could record a line, then ask the model to “make it sadder,” “speak slower,” or “change accent to X.”

Downloads: 3 This Week

Last Update: 2026-04-09
See Project
7

Lightly

A python library for self-supervised learning on images

...Our solution can be applied before any data annotation step and the learned representations can be used to visualize and analyze datasets. This allows selecting the best core set of samples for model training through advanced filtering. We provide PyTorch, PyTorch Lightning and PyTorch Lightning distributed examples for each of the models to kickstart your project. Lightly requires Python 3.6+ but we recommend using Python 3.7+. We recommend installing Lightly in a Linux or OSX environment. With lightly, you can use the latest self-supervised learning methods in a modular way using the full power of PyTorch. ...

Downloads: 1 This Week

Last Update: 2026-03-24
See Project
8

Flow Matching

A PyTorch library for implementing flow matching algorithms

flow_matching is a PyTorch library implementing flow matching algorithms in both continuous and discrete settings, enabling generative modeling via matching vector fields rather than diffusion. The underlying idea is to parameterize a flow (a time-dependent vector field) that transports samples from a simple base distribution to a target distribution, and train via matching of flows without requiring score estimation or noisy corruption—this can lead to more efficient or stable generative training. The library supports both continuous-time flows (via differential equations) and discrete-time analogues, giving flexibility in design and tradeoffs. ...

Downloads: 0 This Week

Last Update: 2026-01-05
See Project
9

YData Synthetic

Synthetic data generators for tabular and time-series data

A package to generate synthetic tabular and time-series data leveraging state-of-the-art generative models. Synthetic data is artificially generated data that is not collected from real-world events. It replicates the statistical components of real data without containing any identifiable information, ensuring individuals' privacy. This repository contains material related to Generative Adversarial Networks for synthetic data generation, in particular regular tabular data and time-series. It...

Downloads: 0 This Week

Last Update: 2026-04-23
See Project
Compliant and Reliable File Transfers Backed by Top Security Certifications
Cerberus FTP Server delivers SOC 2 Type II certified security and FIPS 140-2 validated encryption.

Stop relying on non-certified, legacy file transfer tools that creak under the weight of modern security demands. Get full audit trails, advanced access controls and more supported by an award-winning team of experts. Start your free 25-day trial today.

Start Free Trial
10

VideoChat

Real-time voice interactive digital human

VideoChat is a real-time voice-interactive “digital human” system that combines automatic speech recognition, large language models, text-to-speech, and talking-head generation into a single conversational pipeline. It supports both pure end-to-end voice solutions based on multimodal large language models (GLM-4-Voice feeding directly into talking-head generation) and a more traditional cascaded pipeline using ASR → LLM → TTS → talking head. It is built as a Gradio Python demo, exposing a...

Downloads: 1 This Week

Last Update: 2025-12-18
See Project
11

E2B Cookbook

Examples of using E2B

E2B Cookbook is an open-source collection of example projects, guides, and reference implementations demonstrating how to build applications using the E2B platform. The repository acts as a practical learning resource for developers who want to integrate AI agents with secure cloud execution environments that allow large language models to run code and interact with tools. The examples illustrate how developers can build AI workflows capable of performing tasks such as data analysis, code...

Downloads: 0 This Week

Last Update: 2026-04-22
See Project
12

OuteTTS

Interface for OuteTTS models

OuteTTS is an interface library for running OuteTTS text-to-speech models across a range of backends, making it easier to deploy the same model on different hardware and runtimes. It provides a high-level Interface API that wraps model configuration, speaker handling, and audio generation so you can focus on integrating speech into your application rather than wiring up low-level engines. The project supports multiple backends including llama.cpp (Python bindings and server), Hugging Face...

Downloads: 0 This Week

Last Update: 2025-11-28
See Project
13

Improved Diffusion

Release for Improved Denoising Diffusion Probabilistic Models

...The repository provides code for training and sampling diffusion models with improved techniques that enhance stability, efficiency, and output fidelity. It includes scripts for setting up training runs, generating samples, and reproducing results from OpenAI’s research on diffusion-based generation. The implementation is intended for researchers and practitioners who want to explore the theoretical and practical aspects of diffusion models in deep learning. By making this code available, OpenAI provides a foundation for further experimentation and development in generative modeling research.

Downloads: 3 This Week

Last Update: 2 days ago
See Project
14

AnimateDiff

Plug-n-play module turning text-to-image models into animation

AnimateDiff is an open-source project designed to enhance text-to-image diffusion models by adding animation capabilities. It allows users to turn static images generated by popular text-to-image models into animated sequences without requiring additional model training. This plug-and-play tool is compatible with a wide range of community models and facilitates the generation of animation directly from pre-existing text-to-image models. It supports various configurations to create animations...

1 Review

Downloads: 25 This Week

Last Update: 2025-03-06
See Project
15

GPT-2 Output Dataset

Dataset of GPT-2 outputs for research in detection, biases, and more

The GPT-2 Output Dataset is a large collection of model-generated text, released by OpenAI alongside the GPT-2 research paper to study the behaviors and limitations of large language models. It contains 250,000 samples of GPT-2 outputs, generated with different sampling strategies such as top-k truncation, to highlight the diversity and quality of model completions. The dataset also includes corresponding human-written text for comparison, enabling researchers to explore methods for distinguishing machine-generated content from human-authored text. ...

Downloads: 2 This Week

Last Update: 10 hours ago
See Project
16

DPM-Solver

Fast ODE Solver for Diffusion Probabilistic Model Sampling

...The project introduces a specialized numerical solver designed to approximate the diffusion process using a small number of high-order integration steps. By reformulating the sampling problem as the solution of a diffusion-related ordinary differential equation, the solver can produce high-quality samples much more efficiently. This approach significantly reduces the computational cost required to generate images while maintaining strong generation quality.

Downloads: 0 This Week

Last Update: 2026-03-12
See Project
17

Consistency Models

Official repo for consistency models

consistency_models is the repository for Consistency Models, a new family of generative models introduced by OpenAI that aim to generate high-quality samples by mapping noise directly into data — circumventing the need for lengthy diffusion chains. It builds on and extends diffusion model frameworks (e.g. based on the guided-diffusion codebase), adding techniques like consistency distillation and consistency training to enable fast, often one-step, sample generation. The repo is implemented in PyTorch and includes support for large-scale experiments on datasets like ImageNet-64 and LSUN variants. ...

Downloads: 0 This Week

Last Update: 2025-10-02
See Project
18

finetuner

Task-oriented finetuning for better embeddings on neural search

...Create high-quality embeddings for semantic search, visual similarity search, cross-modal text image search, recommendation systems, clustering, duplication detection, anomaly detection, or other uses. Bring considerable improvements to model performance, making the most out of as little as a few hundred training samples, and finish fine-tuning in as little as an hour.

Downloads: 0 This Week

Last Update: 2023-08-21
See Project
19

PRM800K

800,000 step-level correctness labels on LLM solutions to MATH problem

PRM800K is a process supervision dataset accompanying the paper Let’s Verify Step by Step, providing 800,000 step-level correctness labels on model-generated solutions to problems from the MATH dataset. The repository releases the raw labels and the labeler instructions used in two project phases, enabling researchers to study how human raters graded intermediate reasoning. Data are stored as newline-delimited JSONL files tracked with Git LFS, where each line is a full solution sample that...

Downloads: 2 This Week

Last Update: 2 days ago
See Project
20

TensorFlow Documentation

TensorFlow documentation

An end-to-end platform for machine learning. TensorFlow makes it easy to create ML models that can run in any environment. Learn how to use the intuitive APIs through interactive code samples.

Downloads: 0 This Week

Last Update: 2024-08-02
See Project
21

ChatGPT Plugins Collection

An unofficial collection of Plugins for ChatGPT

...By centralizing community contributions, the repository highlights practical applications of plugins across domains such as productivity, data access, and automation. The project also serves as a starting point for developers interested in building their own custom plugins, offering inspiration and code samples. With its open structure, it encourages collaboration and knowledge sharing in the growing ecosystem of ChatGPT extensions.

Downloads: 6 This Week

Last Update: 10 hours ago
See Project
22

Minimal text diffusion

A minimal implementation of diffusion models for text generation

A minimal implementation of diffusion models of text: learns a diffusion model of a given text corpus, allowing to generate text samples from the learned model. The main idea was to retain just enough code to allow training a simple diffusion model and generating samples, remove image-related terms, and make it easier to use. To train a model, run scripts/train.sh. By default, this will train a model on the simple corpus. However, you can change this to any text file using the --train_data argument. ...

Downloads: 0 This Week

Last Update: 2023-03-23
See Project
23

Hello AI World

Guide to deploying deep-learning inference networks

...You’ll also get to code your own easy-to-follow recognition program in Python or C++, and train your own DNN models onboard Jetson with PyTorch. Ready to dive into deep learning? It only takes two days. We’ll provide you with all the tools you need, including easy to follow guides, software samples such as TensorRT code, and even pre-trained network models including ImageNet and DetectNet examples. Follow these directions to integrate deep learning into your platform of choice and quickly develop a proof-of-concept design.

Downloads: 1 This Week

Last Update: 2022-08-03
See Project
24

Guided Diffusion

Codebase for Diffusion Models Beat GANS on Image Synthesis

...A key insight is that combining diffusion sampling with classifier gradients allows fine control over the generated images, trading off diversity vs fidelity. The repository includes scripts such as image_train.py, image_sample.py, and classifier_train.py to train diffusion models, generate samples, and train guiding classifiers. It also ships with precomputed evaluation batches and baseline comparisons to support reproducible benchmarking of new models.

Downloads: 0 This Week

Last Update: 2025-10-02
See Project
25

Fashion-MNIST

A MNIST-like fashion product database

Fashion-MNIST is an open-source dataset created by Zalando Research that provides a standardized benchmark for image classification algorithms in machine learning. The dataset contains grayscale images of fashion products such as shirts, shoes, coats, and bags, each labeled according to its clothing category. It was designed as a direct replacement for the original MNIST handwritten digits dataset, maintaining the same structure and image size so that researchers could easily switch datasets...

Downloads: 6 This Week

Last Update: 2026-03-10
See Project

Previous
1
You're on page 2
3
4
Next

Related Searches

llm

ai

deep learning

baunilla

ai apps

Related Categories

Artificial Intelligence

Scientific/Engineering

Software Development

Multimedia

Security

SourceForge

Create a Project
Open Source Software
Business Software
Top Downloaded Projects

Company

About
Team
SourceForge Headquarters
1320 Columbia Street Suite 310
San Diego, CA 92101
+1 (858) 422-6466

Resources

Support
Site Documentation
Site Status
SourceForge Reviews

© 2026 Slashdot Media. All Rights Reserved.

Terms Privacy Opt Out Advertise