Join/Login
Business Software
Open Source Software
For Vendors
Blog
About
More

For Vendors Help Create Join Login

Business Software

Open Source Software

SourceForge Podcast

Resources

Articles
Case Studies
Blog

Menu

Help
Create
Join
Login

Home
Open Source Software
Artificial Intelligence Software
Search Results

Search Results for "source cross reference"

x

Sort By:

Relevance

Clear All Filters

OS

Windows 377
Linux 360
Mac 342
More...
BSD 150
ChromeOS 138
Mobile Operating Systems 17
Desktop Operating Systems 1

Category

Artificial Intelligence 377
Software Development 60
Scientific/Engineering 32
Multimedia 19
Games 11
System 10
Business 9
Internet 9
Communications 5
Education 4
Mobile 3
Formats and Protocols 2
Text Editors 2
Database 1
Desktop Environment 1
Security 1
Social sciences 1

License

OSI-Approved Open Source 366
Other License 1
Public Domain 1

Translations

English 37
Spanish 4
Chinese (Simplified) 3
Arabic 2
More...
French 2
Korean 2
Russian 2
Turkish 2
Chinese (Traditional) 1
Dutch 1
German 1
Greek 1
Italian 1
Japanese 1
Tamil 1

Programming Language

Python 184
TypeScript 57
C++ 49
Java 32
More...
JavaScript 22
C 16
C# 11
Go 8
MATLAB 6
Unix Shell 6
Perl 5
Rust 4
Lua 3
PowerShell 3
Assembly 2
Dart 2
F# 2
PHP 2
R 2
IDL 1
Julia 1
Lisp 1
Prolog 1
S/R 1
Tcl 1
Visual Basic .NET 1

Status

Production/Stable 34
Beta 22
Pre-Alpha 11
Alpha 10
More...
Planning 8
Mature 6
Inactive 1

Showing 377 open source projects for "source cross reference"

View related business solutions

Artificial Intelligence Windows Clear Filters & Widen Search

Gemini 3 and 200+ AI Models on One Platform
Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

Build generative AI apps with Vertex AI Studio. Switch between models without switching platforms.

Start Free
Push Code. Get a Production URL. Done.
Cloud Run deploys any language instantly. Scales to zero. Pay only when code runs.

Skip the Kubernetes configs. Cloud Run handles HTTPS, scaling, and infrastructure automatically. Two million requests free per month.

Try Cloud Run Free
1

OpenVoice

Instant voice cloning by MIT and MyShell. Audio foundation model

OpenVoice is a versatile instant voice cloning system that can replicate a speaker’s tone color from just a short audio clip and then generate speech in multiple languages. It is designed not only to match the timbre of the reference voice, but also to give granular control over style parameters such as emotion, accent, rhythm, pauses, and intonation. The model supports cross-lingual and even zero-shot cross-lingual voice cloning, so a speaker recorded in one language can be made to speak naturally in others. Architecturally, OpenVoice separates “tone color” cloning from style control, which makes it easier to keep a consistent identity while flexibly changing prosody or language. ...

Downloads: 21 This Week

Last Update: 2025-11-28
See Project
2

HeartMuLa

A Family of Open Sourced Music Foundation Models

HeartMuLa is the open-source library and reference implementation for the HeartMuLa family of music foundation models, designed to support both music generation and music-related understanding tasks in a cohesive stack. At the center is HeartMuLa, a music language model that generates music conditioned on inputs like lyrics and tags, with multilingual support that broadens the range of lyric-driven use cases.

Downloads: 21 This Week

Last Update: 2026-02-13
See Project
3

Lux Desktop

Example client of oagi-python developed with Tauri

Lux Desktop is a cross-platform desktop application that showcases how to build a graphical client on top of the Lux computer-use model using the OAGI Python SDK as its backend. It is implemented with Tauri, combining a TypeScript/React front end with a Rust-powered shell to produce lightweight native executables for macOS and Windows. The project serves as a reference implementation for developers who want to see how Lux-powered agents can be packaged into a user-friendly app, including how to wire up commands, configuration, and UI flows around computer-use automation. ...

Downloads: 3 This Week

Last Update: 2025-12-09
See Project
4

FLUX.2

Official inference repo for FLUX.2 models

FLUX.2 is a state-of-the-art open-weight image generation and editing model released by Black Forest Labs aimed at bridging the gap between research-grade capabilities and production-ready workflows. The model offers both text-to-image generation and powerful image editing, including editing of multiple reference images, with fidelity, consistency, and realism that push the limits of what open-source generative models have achieved. It supports high-resolution output (up to ~4 megapixels), which allows for photography-quality images, detailed product shots, infographics or UI mockups rather than just low-resolution drafts. FLUX.2 is built with a modern architecture (a flow-matching transformer + a revamped VAE + a strong vision-language encoder), enabling strong prompt adherence, correct rendering of text/typography in images, reliable lighting, layout, and physical realism, and consistent style/character/product identity across multiple generations or edits.

Downloads: 32 This Week

Last Update: 2026-02-17
See Project
99.99% Uptime for Your Most Critical Databases
Sub-second maintenance. 2x read/write performance. Built-in vector search for AI apps.

Cloud SQL Enterprise Plus delivers near-zero downtime with 35 days of point-in-time recovery. Supports MySQL, PostgreSQL, and SQL Server.

Try Free
5

OuteTTS

Interface for OuteTTS models

OuteTTS is an interface library for running OuteTTS text-to-speech models across a range of backends, making it easier to deploy the same model on different hardware and runtimes. It provides a high-level Interface API that wraps model configuration, speaker handling, and audio generation so you can focus on integrating speech into your application rather than wiring up low-level engines. The project supports multiple backends including llama.cpp (Python bindings and server), Hugging Face...

Downloads: 1 This Week

Last Update: 2025-11-28
See Project
6

ncnn

High-performance neural network inference framework for mobile

ncnn is a high-performance neural network inference computing framework designed specifically for mobile platforms. It brings artificial intelligence right at your fingertips with no third-party dependencies, and speeds faster than all other known open source frameworks for mobile phone cpu. ncnn allows developers to easily deploy deep learning algorithm models to the mobile platform and create intelligent APPs. It is cross-platform and supports most commonly used CNN networks, including Classical CNN (VGG AlexNet GoogleNet Inception), Face Detection (MTCNN RetinaFace), Segmentation (FCN PSPNet UNet YOLACT), and more. ncnn is currently being used in a number of Tencent applications, namely: QQ, Qzone, WeChat, and Pitu.

Downloads: 28 This Week

Last Update: 2026-01-13
See Project
7

Sopro TTS

A lightweight text-to-speech model with zero-shot voice cloning

Sopro TTS is an open-source text-to-speech (TTS) project that implements a lightweight model capable of producing speech from text with zero-shot voice cloning, meaning it can mimic a speaker’s voice from only a few seconds of reference audio. Built with a 169 million-parameter architecture that uses dilated convolutions and cross-attention layers instead of large Transformer stacks, it achieves relatively fast real-time performance even on CPUs (about a 0.25 real-time factor measured on an M3 base). ...

Downloads: 1 This Week

Last Update: 2026-02-06
See Project
8

HunyuanVideo-Avatar

Tencent Hunyuan Multimodal diffusion transformer (MM-DiT) model

HunyuanVideo-Avatar is a multimodal diffusion transformer (MM-DiT) model by Tencent Hunyuan for animating static avatar images into dynamic, emotion-controllable, and multi-character dialogue videos, conditioned on audio. It addresses challenges of motion realism, identity consistency, and emotional alignment. Innovations include a character image injection module, an Audio Emotion Module for transferring emotion cues, and a Face-Aware Audio Adapter to isolate audio effects on faces,...

Downloads: 8 This Week

Last Update: 2025-12-16
See Project
9

MLPerf

Reference implementations of MLPerf™ training benchmarks

This is a repository of reference implementations for the MLPerf training benchmarks. These implementations are valid as starting points for benchmark implementations but are not fully optimized and are not intended to be used for "real" performance measurements of software frameworks or hardware. Benchmarking the performance of training ML models on a wide variety of use cases, software, and hardware drives AI performance across the tech industry. The MLPerf Training working group draws on...

Downloads: 0 This Week

Last Update: 2024-08-16
See Project
AI-generated apps that pass security review
Stop waiting on engineering. Build production-ready internal tools with AI—on your company data, in your cloud.

Retool lets you generate dashboards, admin panels, and workflows directly on your data. Type something like “Build me a revenue dashboard on my Stripe data” and get a working app with security, permissions, and compliance built in from day one. Whether on our cloud or self-hosted, create the internal software your team needs without compromising enterprise standards or control.

Try Retool free
10

LatentSync

Taming Stable Diffusion for Lip Sync

LatentSync is an open-source framework from ByteDance that produces high-quality lip-synchronization for video by using an audio-conditioned latent diffusion model, bypassing traditional intermediate motion representations. In effect, given a source video (with masked or reference frames) and an audio track, LatentSync directly generates frames whose lip motions and expressions align with the audio, producing convincing talking-head or animated lip-sync output. ...

Downloads: 1 This Week

Last Update: 2025-12-02
See Project
11

Ultimate Vocal Remover (UVR5)

GUI for a Vocal Remover that uses Deep Neural Networks

This application uses state-of-the-art source separation models to remove vocals from audio files. UVR's core developers trained all of the models provided in this package (except for the Demucs v3 and v4 4-stem models).

Downloads: 562 This Week

Last Update: 2025-01-20
See Project
12

Fish Speech

SOTA Open Source TTS

...Fish Speech emphasizes expressive and controllable voices: it supports a long list of emotion tags, tone markers, and special audio effect markers that can be embedded in the text to drive prosody and vocal style, from basic emotions to nuanced states like sarcastic, conciliative, or hysterical. The system is multilingual and cross-lingual, handling multiple languages in a single input without explicit phoneme markup, and is trained on large-scale datasets.

Downloads: 2 This Week

Last Update: 2025-11-28
See Project
13

ONNX Runtime

ONNX Runtime: cross-platform, high performance ML inferencing

ONNX Runtime is a cross-platform inference and training machine-learning accelerator. ONNX Runtime inference can enable faster customer experiences and lower costs, supporting models from deep learning frameworks such as PyTorch and TensorFlow/Keras as well as classical machine learning libraries such as scikit-learn, LightGBM, XGBoost, etc. ONNX Runtime is compatible with different hardware, drivers, and operating systems, and provides optimal performance by leveraging hardware accelerators...

Downloads: 65 This Week

Last Update: 2026-02-19
See Project
14

Upscayl

Free and Open Source AI Image Upscaler for Linux, MacOS and Windows

Free and Open Source AI Image Upscaler for Linux, MacOS and Windows built with Linux-First philosophy. Upscayl is a cross-platform application built with the Linux-first philosophy. This means that we prioritize Linux builds over others but that doesn't mean we'll break things for other OSes. Upscayl does not work without a GPU, sorry. You'll need a Vulkan-compatible GPU to upscale images.

1 Review

Downloads: 175 This Week

Last Update: 2025-01-15
See Project
15

CosyVoice

Multi-lingual large voice generation model, providing inference

...The model supports multiple languages, including Chinese, English, Japanese, Korean, and a range of Chinese dialects such as Cantonese, Sichuanese, Shanghainese, Tianjinese, and Wuhanese. It is designed for zero-shot voice cloning and cross-lingual or mix-lingual scenarios, so a single reference voice can be used to synthesize speech across languages and in code-switching contexts. CosyVoice 2.0 significantly improves on version 1.0 by boosting accuracy, stability, speed, and overall speech quality, making it more suitable for production environments. The repository contains training recipes, inference pipelines, deployment scripts, and integration examples, positioning it as a comprehensive toolkit rather than just a set of model weights.

Downloads: 0 This Week

Last Update: 2025-11-30
See Project
16

GPT-SoVITS

1 min voice data can also be used to train a good TTS model

GPT‑SoVITS is a state-of-the-art voice conversion and TTS system that enables zero‑shot and few‑shot synthesis based on a short vocal sample (e.g., 5 seconds). It supports cross‑lingual speech synthesis across English, Chinese, Japanese, Korean, Cantonese, and more. It's powered by VITS architecture enhanced for few‑sample adaptation and real‑time usability.

Downloads: 35 This Week

Last Update: 2025-07-29
See Project
17

PicoClaw

Ultra-Efficient AI Assistant in Go

...PicoClaw can run on hardware costing as little as $10 and on resource-constrained environments like RISC-V or ARM boards, with cross-architecture portability achieved through a single self-contained binary. The project’s goals include broad platform support (including Linux, macOS, and multiple CPU architectures), rapid startup times that make the assistant feel responsive, and integration with popular messaging platforms via gateways or bots.

Downloads: 155 This Week

Last Update: 2 days ago
See Project
18

emgucv

Cross platform .Net wrapper to the OpenCV image processing library

Emgu CV is a cross platform .Net wrapper to the OpenCV image processing library. Allowing OpenCV functions to be called from .NET compatible languages. The wrapper can be compiled by Visual Studio and Unity, it can run on Windows, Linux, Mac OS, iOS and Android.

Downloads: 11 This Week

Last Update: 2025-10-09
See Project
19

MARS5

MARS5 speech model (TTS) from CAMB.AI

...To control speaker identity, MARS5 uses a short reference audio clip, typically between 2 and 12 seconds, from which it learns the voice characteristics. It supports two main inference modes: shallow clone, which is faster and only needs the reference audio, and deep clone, which additionally uses the transcript of the reference audio to increase similarity and naturalness at the cost of more computation.

Downloads: 0 This Week

Last Update: 2025-11-28
See Project
20

Open Claude Cowork

Open Source version of Claude Cowork built with Claude Code

...With support for over 500 integrated tools—including Gmail, Slack, GitHub, Google Drive, and more via the Composio Tool Router—Open Claude Cowork lets agents execute complex tasks that span multiple platforms and APIs, effectively acting as a cross-service productivity layer.

Downloads: 73 This Week

Last Update: 2026-01-29
See Project
21

LuxTTS

A high-quality rapid TTS voice cloning model

...The project supports zero-shot voice cloning, meaning it can adapt to a reference speaker’s voice with minimal example data, enabling realistic and personalized synthetic speech. Intended for developers, hobbyists, and creators, the repository includes installation instructions, usage examples, and Python APIs that make it feasible to integrate the model in local workflows, web demos, or production systems.

Downloads: 8 This Week

Last Update: 2026-02-14
See Project
22

StyleTTS 2

Towards Human-Level Text-to-Speech through Style Diffusion

StyleTTS2 is a state-of-the-art text-to-speech system that aims for human-level naturalness by combining style diffusion, adversarial training, and large speech language models. It extends the original StyleTTS idea by introducing a style diffusion model that can sample rich, realistic speaking styles conditioned on reference speech, allowing highly expressive and diverse prosody. The architecture uses a two-stage training process and leverages an auxiliary speech language model to guide...

Downloads: 1 This Week

Last Update: 2025-11-28
See Project
23

Sec-Context

AI Code Security Anti-Patterns distilled from 150+ sources

Sec-Context is a curated security research project that distills common code anti-patterns and vulnerabilities that generative AI tends to produce, presenting them as a comprehensive set of examples and secure alternatives that can be used to train or guide AI assistants and reviewers toward safer code generation. It compiles insights from over 150 industry and academic sources into structured reference documents that outline real-world security problems such as hardcoded secrets, SQL injection, cross-site scripting, command injection, weak password storage, and other frequent issues that occur when code is auto-generated without context of best practices. Each anti-pattern is paired with a secure coding alternative and explanation, offering educational value for both humans and automated review agents designed to flag or correct unsafe patterns.

Downloads: 2 This Week

Last Update: 6 days ago
See Project
24

clone-voice

A sound cloning tool with a web interface, using your voice

Clone-voice is a local voice-cloning tool that lets you synthesize speech in any target voice or convert one recording into another voice using the same timbre. It is built around Coqui’s XTTS-v2 model, so it inherits multilingual support and modern neural TTS quality while wrapping it in a user-friendly desktop workflow. The app is designed to be very easy to use: you download a precompiled package, double-click app.exe, and it launches a browser-based web interface where you control...

Downloads: 18 This Week

Last Update: 2025-11-28
See Project
25

LTX-2

Python inference and LoRA trainer package for the LTX-2 audio–video

LTX-2 is a powerful, open-source toolkit developed by Lightricks that provides a modular, high-performance base for building real-time graphics and visual effects applications. It is architected to give developers low-level control over rendering pipelines, GPU resource management, shader orchestration, and cross-platform abstractions so they can craft visually compelling experiences without starting from scratch.

Downloads: 44 This Week

Last Update: 2026-02-09
See Project

Previous
You're on page 1
2
3
4
5
Next

Related Searches

voice cloning

text to speech

gpt-sovits

emgucv

windows

upscayl

voice cloning app

nokia windows phone 8.1 mobile apps

vocal

stem separation

Related Categories

Artificial Intelligence

Software Development

Scientific/Engineering

Multimedia

Games

SourceForge

Create a Project
Open Source Software
Business Software
Top Downloaded Projects

Company

About
Team
SourceForge Headquarters
1320 Columbia Street Suite 310
San Diego, CA 92101
+1 (858) 422-6466

Resources

Support
Site Documentation
Site Status
SourceForge Reviews

© 2026 Slashdot Media. All Rights Reserved.

Terms Privacy Opt Out Advertise

Thanks for helping keep SourceForge clean.

X

You seem to have CSS turned off. Please don't fill out this field.

You seem to have CSS turned off. Please don't fill out this field.

Briefly describe the problem (required):

Upload screenshot of ad (required):

Select a file, or drag & drop file here.

✔

✘

Screenshot instructions:

Click URL instructions:
Right-click on the ad, choose "Copy Link", then paste here →
(This may not be possible with some types of ads)

More information about our ad policies

Ad destination/click URL: