Join/Login
Business Software
Open Source Software
For Vendors
Blog
About
More

For Vendors Help Create Join Login

Business Software

Open Source Software

SourceForge Podcast

Resources

Articles
Case Studies
Blog

Menu

Help
Create
Join
Login

Home
Open Source Software
Search Results

Search Results for "active appearance models" - Page 2

x

Sort By:

Relevance

OS

Windows 139
Linux 133
Mac 120
More...
BSD 59
ChromeOS 54
Mobile Operating Systems 4

Category

Artificial Intelligence 88
Software Development 25
Scientific/Engineering 15
Business 12
Multimedia 11
Games 7
Database 5
Education 2
System 2
Text Editors 2
Communications 1
Desktop Environment 1
Formats and Protocols 1
Internet 1
Productivity 1
Security 1

License

OSI-Approved Open Source 112
Creative Commons Attribution License 1
Other License 1

Translations

English 12
German 3
Spanish 2
French 1
More...
Hungarian 1
Portuguese 1

Programming Language

Python 58
C++ 21
JavaScript 9
TypeScript 9
More...
Java 7
MATLAB 5
C 4
PHP 3
Ruby 3
C# 2
Rust 2
Ada 1
Dart 1
Fortran 1
Julia 1
Objective C 1
Prolog 1
R 1
Visual Basic .NET 1

Status

Production/Stable 13
Beta 6
Alpha 3
Mature 3
More...
Pre-Alpha 1
Inactive 1

Showing 153 open source projects for "active appearance models"

View related business solutions

Full-stack observability with actually useful AI | Grafana Cloud
Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.

Create free account
Custom VMs From 1 to 96 vCPUs With 99.95% Uptime
General-purpose, compute-optimized, or GPU/TPU-accelerated. Built to your exact specs.

Live migration and automatic failover keep workloads online through maintenance. One free e2-micro VM every month.

Try Free
1

Norfair

Lightweight Python library for adding real-time multi-object tracking

Norfair is a customizable lightweight Python library for real-time multi-object tracking. Using Norfair, you can add tracking capabilities to any detector with just a few lines of code. Any detector expressing its detections as a series of (x, y) coordinates can be used with Norfair. This includes detectors performing tasks such as object or keypoint detection. It can easily be inserted into complex video processing pipelines to add tracking to existing projects. At the same time, it is...

Downloads: 0 This Week

Last Update: 2025-04-30
See Project
2

Handy STT

A free, open source, and extensible speech-to-text application

...Handy allows users to start transcription instantly using a configurable keyboard shortcut—press to record, release to transcribe—and automatically pastes the resulting text into any active text field. Its backend leverages OpenAI’s Whisper models for GPU-accelerated speech recognition and Parakeet V3 for efficient CPU-only transcription with automatic language detection. To further refine accuracy and responsiveness, Handy integrates Silero’s Voice Activity Detection (VAD) for silence filtering, ensuring only speech segments are processed.

Downloads: 62 This Week

Last Update: 2026-04-02
See Project
3

Tongyi DeepResearch

Tongyi Deep Research, the Leading Open-source Deep Research Agent

...It’s built to act like a research agent: synthesizing, reasoning, retrieving information via the web and documents, and backing its outputs with evidence. The model is about 30.5 billion parameters in size, though at any given token only ~3.3B parameters are active. It uses a mix of synthetic data generation, fine-tuning and reinforcement learning; supports benchmarks like web search, document understanding, question answering, “agentic” tasks; provides inference tools, evaluation scripts, and “web agent” style interfaces. The aim is to enable more autonomous, agentic models that can perform sustained knowledge gathering, reasoning, and synthesis across multiple modalities (web, files, etc.).

Downloads: 5 This Week

Last Update: 2026-02-27
See Project
4

TRELLIS 2

Native and Compact Structured Latents for 3D Generation

TRELLIS.2 is a cutting-edge open-source model and codebase for high-fidelity 3D asset generation from 2D images, developed to push forward the state of the art in image-to-3D generation. At its core is a novel sparse voxel structure called O-Voxel that jointly encodes both geometry and surface appearance, enabling reconstruction and generation of complex 3D shapes with arbitrary topology, open surfaces, and physically based rendering (PBR) textures. The system leverages a large 4-billion-parameter architecture combining sparse 3D variational autoencoders with flow-matching transformers to produce fully textured 3D models at resolutions up to 1536³ voxels. ...

Downloads: 56 This Week

Last Update: 2026-01-29
See Project
MongoDB Atlas runs apps anywhere
Deploy in 115+ regions with the modern database for every enterprise.

MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.

Start Free
5

ChatLLM.cpp

Pure C++ implementation of several models for real-time chatting

chatllm.cpp is a pure C++ implementation designed for real-time chatting with Large Language Models (LLMs) on personal computers, supporting both CPU and GPU executions. It enables users to run various LLMs ranging from less than 1 billion to over 300 billion parameters, facilitating responsive and efficient conversational AI experiences without relying on external servers.

Downloads: 0 This Week

Last Update: 2026-03-27
See Project
6

VideoChat

Real-time voice interactive digital human

VideoChat is a real-time voice-interactive “digital human” system that combines automatic speech recognition, large language models, text-to-speech, and talking-head generation into a single conversational pipeline. It supports both pure end-to-end voice solutions based on multimodal large language models (GLM-4-Voice feeding directly into talking-head generation) and a more traditional cascaded pipeline using ASR → LLM → TTS → talking head. It is built as a Gradio Python demo, exposing a web interface where users can talk to an animated avatar that lip-syncs to synthesized speech while responding intelligently. ...

Downloads: 0 This Week

Last Update: 2025-12-18
See Project
7

QuivrHQ

Opiniated RAG for integrating GenAI in your apps

Quivr is an open-source platform that leverages Retrieval-Augmented Generation (RAG) to integrate Generative AI into applications. It serves as a "second brain," enabling users to build powerful AI-driven assistants that can process and retrieve information efficiently. Quivr supports various large language models and vector stores, providing flexibility and customization for developers.

Downloads: 0 This Week

Last Update: 2025-05-30
See Project
8

4M

4M: Massively Multimodal Masked Modeling

...Training/inference configs and issues discuss things like depth tokenizers, input masks for generation, and CUDA build questions, signaling active research iteration. The design leans into flexibility and steerability, so prompts and masks can shape behavior without bespoke heads per task. In short, 4M provides a unified recipe to pretrain large multimodal models that generalize broadly while remaining practical to fine-tune.

Downloads: 0 This Week

Last Update: 2025-10-08
See Project
9

LitGPT

20+ high-performance LLMs with recipes to pretrain, finetune at scale

LitGPT is a collection of over 20 high-performance large language models (LLMs) accompanied by recipes to pretrain, finetune, and deploy them at scale. It provides implementations without abstractions, making it beginner-friendly while offering advanced features like flash attention and support for various precision levels. LitGPT is designed to run efficiently across multiple GPUs or TPUs, catering to both small-scale and large-scale deployments.

Downloads: 2 This Week

Last Update: 2025-12-18
See Project
Try Google Cloud Risk-Free With $300 in Credit
No hidden charges. No surprise bills. Cancel anytime.

Use your credit across every product. Compute, storage, AI, analytics. When it runs out, 20+ products stay free. You only pay when you choose to.

Start Free
10

DSPy

DSPy: The framework for programming—not prompting—language models

Developed by the Stanford NLP Group, DSPy (Declarative Self-improving Python) is a framework that enables developers to program language models through compositional Python code rather than relying solely on prompt engineering. It facilitates the construction of modular AI systems and provides algorithms for optimizing prompts and weights, enhancing the quality and reliability of language model outputs.

Downloads: 0 This Week

Last Update: 5 days ago
See Project
11

Lemonade

Lemonade helps users run local LLMs with the highest performance

...The project positions itself as a “local LLM server” you can run on laptops and workstations, abstracting away backend differences while giving you a single place to serve and manage models. Its README emphasizes real-world adoption across startups, research groups, and large companies, signaling a focus on practical deployments rather than toy demos. The repository highlights easy onboarding with downloads, docs, and a Discord for support, suggesting an active user community. Messaging centers on squeezing maximum throughput/latency from modern accelerators without users having to hand-tune kernels or flags. ...

Downloads: 10 This Week

Last Update: 2026-04-08
See Project
12

Kaleidoscope-SDK

User toolkit for analyzing and interfacing with Large Language Models

kaleidoscope-sdk is a Python module used to interact with large language models hosted via the Kaleidoscope service available at: https://github.com/VectorInstitute/kaleidoscope. It provides a simple interface to launch LLMs on an HPC cluster, asking them to perform basic features like text generation, but also retrieve intermediate information from inside the model, such as log probabilities and activations. Users must authenticate using their Vector Institute cluster credentials. This can...

Downloads: 0 This Week

Last Update: 2024-07-10
See Project
13

Qwen-Image

Qwen-Image is a powerful image generation foundation model

Qwen-Image is a powerful 20-billion parameter foundation model designed for advanced image generation and precise editing, with a particular strength in complex text rendering across diverse languages, especially Chinese. Built on the MMDiT architecture, it achieves remarkable fidelity in integrating text seamlessly into images while preserving typographic details and layout coherence. The model excels not only in text rendering but also in a wide range of artistic styles, including...

1 Review

Downloads: 11 This Week

Last Update: 2026-02-10
See Project
14

Qwen3.6

Qwen3.6 is the large language model series developed by Qwen team

The Qwen3.6 project is an open-source large language model series developed by Alibaba’s Qwen team, designed to deliver high-performance AI capabilities with a strong emphasis on real-world usability and developer productivity. It builds upon the advancements introduced in Qwen3.5, focusing on improving stability, responsiveness, and practical application in coding and agent-based workflows. The repository serves as a central hub for documentation, community discussion, and access to the...

Downloads: 18 This Week

Last Update: 6 days ago
See Project
15

FlashInfer

FlashInfer: Kernel Library for LLM Serving

FlashInfer is a kernel library designed to enhance the serving of Large Language Models (LLMs) by optimizing inference performance. It provides a high-performance framework that integrates seamlessly with existing systems, aiming to reduce latency and improve efficiency in LLM deployments. FlashInfer supports various hardware architectures and is built to scale with the demands of production environments.

Downloads: 6 This Week

Last Update: 2 days ago
See Project
16

nichenetr

NicheNet: predict active ligand-target links between interacting cells

nichenetr: the R implementation of the NicheNet method. The goal of NicheNet is to study intercellular communication from a computational perspective. NicheNet uses human or mouse gene expression data of interacting cells as input and combines this with a prior model that integrates existing knowledge on ligand-to-target signaling paths. This allows to predict ligand-receptor interactions that might drive gene expression changes in cells of interest. This model of prior information on...

Downloads: 0 This Week

Last Update: 2024-09-05
See Project
17

Qwen Code

Qwen Code is a coding agent that lives in the digital world

Qwen Code is a command-line AI workflow tool designed to enhance developer productivity by leveraging the power of Qwen3-Coder models. Adapted from the Google Gemini CLI, it features an enhanced parser optimized specifically for Qwen-Coder models, enabling deep code understanding and manipulation. The tool supports querying and editing large codebases beyond traditional context limits, making it ideal for modern, complex projects. Qwen Code automates various development workflows, including...

1 Review

Downloads: 20 This Week

Last Update: 2 days ago
See Project
18

Cleanlab

The standard data-centric AI package for data quality and ML

cleanlab helps you clean data and labels by automatically detecting issues in a ML dataset. To facilitate machine learning with messy, real-world data, this data-centric AI package uses your existing models to estimate dataset problems that can be fixed to train even better models. cleanlab cleans your data's labels via state-of-the-art confident learning algorithms, published in this paper and blog. See some of the datasets cleaned with cleanlab at labelerrors.com. This package helps you...

Downloads: 4 This Week

Last Update: 2026-01-13
See Project
19

HunyuanImage-3.0

A Powerful Native Multimodal Model for Image Generation

HunyuanImage-3.0 is a powerful, native multimodal text-to-image generation model released by Tencent’s Hunyuan team. It unifies multimodal understanding and generation in a single autoregressive framework, combining text and image modalities seamlessly rather than relying on separate image-only diffusion components. It uses a Mixture-of-Experts (MoE) architecture with many expert subnetworks to scale efficiently, deploying only a subset of experts per token, which allows large parameter...

1 Review

Downloads: 17 This Week

Last Update: 2026-02-03
See Project
20

Chipper

AI interface for tinkerers (Ollama, Haystack RAG, Python)

Chipper is an AI interface designed for tinkerers and developers, providing a platform to experiment with various AI models and techniques. It offers integration with tools like Ollama and Haystack for Retrieval-Augmented Generation (RAG), enabling users to build and test AI applications efficiently. Chipper supports Python and provides a modular architecture, allowing for customization and extension based on specific project requirements.

Downloads: 0 This Week

Last Update: 2025-06-04
See Project
21

ESPnet

End-to-end speech processing toolkit

ESPnet is a comprehensive end-to-end speech processing toolkit covering a wide spectrum of tasks, including automatic speech recognition (ASR), text-to-speech (TTS), speech translation (ST), speech enhancement, speaker diarization, and spoken language understanding. It uses PyTorch as its deep learning engine and adopts a Kaldi-style data processing pipeline for features, data formats, and experimental recipes. This combination allows researchers to leverage modern neural architectures while...

Downloads: 3 This Week

Last Update: 4 days ago
See Project
22

Gemini Next Chat

Deploy your private Gemini application for free with one click

Gemini Next Chat is an open-source web application that allows you to deploy your own private chat interface powered by Google’s Gemini models (e.g., Gemini 1.5, Gemini 2.0, etc.). It is built with Next.js/TypeScript and targets developers and hobbyists who want a self-hosted solution for interacting with advanced multimodal models (text, image, voice). It supports features like image recognition, voice-based conversation, plugins (web search, ArXiv search, weather, etc.), and client apps (tray app) for greater convenience. ...

Downloads: 3 This Week

Last Update: 2025-11-24
See Project
23

Dillo

Dillo, a multi-platform graphical web browser

Dillo is a lightweight, minimal graphical web browser, designed for speed, low resource usage, and privacy. It is written in C and C++ using the FLTK (Fast Light Toolkit) GUI library. Its goals include enabling web access on old or constrained hardware, using slow or unreliable network connections, minimizing dependencies, and avoiding many of the complexities and overheads of modern full-featured browsers. It omits many modern features (notably JavaScript), instead focusing on rendering...

Downloads: 26 This Week

Last Update: 2025-09-11
See Project
24

PowerSystems.jl

Data structures in Julia to enable power systems analysis

The PowerSystems.jl package provides a rigorous data model using Julia structures to enable power systems analysis and modeling. In addition to stand-alone system analysis tools and data model building, the PowerSystems.jl package is used as the foundational data container for the PowerSimulations.jl and PowerSimulationsDynamics.jl packages. PowerSystems.jl supports a limited number of data file formats for parsing.

Downloads: 2 This Week

Last Update: 4 days ago
See Project
25

Gradient Bang

Gradient Bang is an online multiplayer universe

Gradient Bang is an experimental open-source project developed within the Pipecat ecosystem that reimagines AI interaction as a persistent, multiplayer simulation where users and large language models coexist inside a shared virtual environment. Rather than functioning as a traditional application or API, it is conceptualized as an “online multiplayer universe” in which participants can explore, trade, battle, and collaborate while interacting with AI agents as active entities within the system. The project serves both as a prototype and a conceptual playground for testing how conversational AI systems behave when embedded into dynamic, game-like environments rather than static chat interfaces. ...

Downloads: 2 This Week

Last Update: 4 days ago
See Project

Previous
1
You're on page 2
3
4
5
6
7
Next

Related Searches

speech

handy

dillo

transcribe audio to srt

transcribe

program

ai

institute

text to image generator

qwen

Related Categories

Artificial Intelligence

Software Development

Scientific/Engineering

Business

Multimedia

SourceForge

Create a Project
Open Source Software
Business Software
Top Downloaded Projects

Company

About
Team
SourceForge Headquarters
1320 Columbia Street Suite 310
San Diego, CA 92101
+1 (858) 422-6466

Resources

Support
Site Documentation
Site Status
SourceForge Reviews

© 2026 Slashdot Media. All Rights Reserved.

Terms Privacy Opt Out Advertise