Join/Login
Business Software
Open Source Software
For Vendors
Blog
About
More

For Vendors Help Create Join Login

Business Software

Open Source Software

SourceForge Podcast

Resources

Articles
Case Studies
Blog

Menu

Help
Create
Join
Login

Home
Open Source Software
Search Results

Search Results for "language processing" - Page 10

x

Sort By:

Relevance

Clear All Filters

OS

Linux 960
Windows 893
Mac 832
More...
BSD 437
ChromeOS 363
Desktop Operating Systems 31
Mobile Operating Systems 14
Server Operating Systems 9
Embedded Operating Systems 1
Game Consoles 1

Category

Artificial Intelligence 551
Software Development 219
Scientific/Engineering 142
Text Editors 110
Multimedia 64
Business 50
Formats and Protocols 42
Internet 40
Education 38
Database 23
System 21
Games 14
Communications 13
Desktop Environment 7
Printing 7
Security 7
Productivity 3
Social sciences 3
Mobile 1
Religion and Philosophy 1

License

OSI-Approved Open Source 793
Creative Commons Attribution License 17
Public Domain 12
Other License 8
More...
GNU Free Documentation License 5

Translations

Programming Language

Status

Production/Stable 129
Beta 111
Alpha 59
Pre-Alpha 44
More...
Planning 28
Inactive 9
Mature 8

Showing 960 open source projects for "language processing"

View related business solutions

Linux Clear Filters & Widen Search

MongoDB Atlas runs apps anywhere
Deploy in 115+ regions with the modern database for every enterprise.

MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.

Start Free
Our Free Plans just got better! | Auth0
With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.

Try free now
1

Handy STT

A free, open source, and extensible speech-to-text application

Handy is a free, open-source, offline speech-to-text application built for privacy, accessibility, and extensibility. Developed using Tauri (Rust + React/TypeScript), it runs natively across Windows, macOS, and Linux while performing local speech recognition without sending any audio to cloud servers. Handy allows users to start transcription instantly using a configurable keyboard shortcut—press to record, release to transcribe—and automatically pastes the resulting text into any active...

Downloads: 40 This Week

Last Update: 2026-04-27
See Project
2

kg-gen

Knowledge Graph Generation from Any Text

kg-gen is an open-source framework developed by the STAIR Lab that automatically generates knowledge graphs from unstructured text using large language models. The system is designed to transform plain text sources such as documents, articles, or conversation transcripts into structured graphs composed of entities and relationships. Instead of relying on traditional rule-based extraction techniques, KG-Gen uses language models to identify entities and their relationships, producing...

Downloads: 0 This Week

Last Update: 2026-03-09
See Project
3

NVIDIA Generative AI Examples

Generative AI reference workflows

...The project is designed to help developers accelerate the development of AI applications by providing ready-to-run pipelines, notebooks, and tools that demonstrate how to integrate large language models into real-world systems. The repository includes examples covering topics such as retrieval-augmented generation pipelines, agent-based workflows, and multimodal AI applications that combine text, vision, and data processing. Many of the examples show how to deploy AI services using containerized environments, GPU acceleration, and microservices that can scale across modern infrastructure. ...

Downloads: 0 This Week

Last Update: 2026-03-05
See Project
4

Deep Research

Use any LLMs (Large Language Models) for Deep Research

Deep Research is a local-first research agent that orchestrates multiple LLMs to generate in-depth reports in minutes. It combines “thinking” and “task” model roles with live internet access to plan, search, read, and synthesize findings into structured outputs. The project emphasizes privacy: processing and storage happen locally, avoiding server-side retention of your queries and notes. A simple web UI lets you enter topics and configure models, while the backend streams progress as...

Downloads: 1 This Week

Last Update: 2026-02-10
See Project
Atera - an All-in-one platform for IT management
Ideal for IT departments and MSPs (managed service providers)

Your IT essentials, integrated & elevated. Take your IT management from automated to autonomous, download Atera's agent to start your free trial!

Try Atera now
5

Webpack Encore

A simple but powerful API for processing and compiling assets

...But it can easily be used in any application in any language!

Downloads: 1 This Week

Last Update: 2026-03-13
See Project
6

Pipecat

Framework for building real-time voice and multimodal AI agents

Pipecat is an open source Python framework designed for building real-time voice and multimodal conversational AI agents. It provides developers with tools to orchestrate complex pipelines that combine speech recognition, language models, audio processing, and speech synthesis into a cohesive conversational system. Pipecat focuses on low-latency interactions so voice conversations with AI feel natural and responsive during live use. Pipecat allows applications to integrate multiple AI services and transports, enabling flexible deployment across different environments and communication channels. ...

Downloads: 4 This Week

Last Update: 2026-05-30
See Project
7

Paperless-AI

AI-powered document analysis and tagging for Paperless-ngx

Paperless-AI is an AI-powered extension designed to enhance document management within Paperless-ngx by automating analysis, classification, and organization tasks. It continuously monitors incoming documents and processes them using various AI backends, enabling automatic assignment of titles, tags, document types, and correspondents. It integrates with multiple OpenAI-compatible services as well as local models, giving users flexibility in how document intelligence is handled. A key...

Downloads: 2 This Week

Last Update: 2026-03-17
See Project
8

AudioMuse-AI

AudioMuse-AI is an Open Source Dockerized environment

AudioMuse-AI is an open-source system designed to automatically generate playlists and analyze music libraries using artificial intelligence and audio signal processing techniques. The platform runs locally in a Dockerized environment and performs detailed sonic analysis on audio files to understand characteristics such as tempo, mood, and acoustic similarity. By analyzing the underlying audio content rather than relying on external metadata services, the system can organize large personal...

Downloads: 6 This Week

Last Update: 10 hours ago
See Project
9

ESPnet

End-to-end speech processing toolkit

ESPnet is a comprehensive end-to-end speech processing toolkit covering a wide spectrum of tasks, including automatic speech recognition (ASR), text-to-speech (TTS), speech translation (ST), speech enhancement, speaker diarization, and spoken language understanding. It uses PyTorch as its deep learning engine and adopts a Kaldi-style data processing pipeline for features, data formats, and experimental recipes.

Downloads: 0 This Week

Last Update: 2026-04-22
See Project
Streamline Azure Security with Palo Alto Networks VM-Series
Centrally manage physical and virtualized firewalls with Panorama

Improve your security posture and reduce incident response time. Use the VM-Series to natively analyze Azure traffic and dynamically drive policy updates based on workload changes.

Learn more
10

TDengine

Open-source time-series database with high-performance and scalability

...TDengine’s native distributed architecture powers out-of-the-box scalability and high availability. Nodes can be added through simple configuration to achieve greater data processing power. In addition, this feature is open source. TDengine uses SQL as the query language, thereby reducing learning and migration costs, while adding SQL extensions to handle time-series data better, and supporting convenient and flexible schemaless data ingestion.

Downloads: 0 This Week

Last Update: 2026-04-26
See Project
11

SuperCollider

Audio server, programming language, and IDE for sound synthesis

...It is free and open source software available for Windows, macOS, and Linux. scsynth, a real-time audio server, forms the core of the platform. It features 400+ unit generators (“UGens”) for analysis, synthesis, and processing. Its granularity allows the fluid combination of many known and unknown audio techniques, moving between additive and subtractive synthesis, FM, granular synthesis, FFT, and physical modeling. You can write your own UGens in C++, and users have already contributed several hundred more to the sc3-plugins repository. sclang, an interpreted programming language. ...

Downloads: 1 This Week

Last Update: 2025-11-24
See Project
12

GalTransl

Automated translation solution for visual novels

GalTransl is an automated translation system specifically designed for visual novels, particularly those in the “galgame” genre, leveraging large language models to streamline and enhance the translation process. It integrates support for multiple advanced LLM providers such as GPT-4, Claude, DeepSeek, and other models, enabling high-quality, context-aware translations that go beyond traditional machine translation approaches. The platform is built to handle the unique structure of visual...

Downloads: 3 This Week

Last Update: 2026-05-22
See Project
13

Agent SOP

Natural language workflows for AI agents

Agent SOP is a framework that implements structured operational procedures (SOPs) for autonomous agents so that they can carry out complex multi-step tasks reliably and in a defined order. Instead of relying solely on broad language model reasoning, this project enforces explicit step sequences with checkpoints, conditional transitions, and rollback logic, making agent workflows more predictable and auditable. It defines reusable SOP templates that agents can instantiate with context-specific parameters, allowing organizations to codify best practices for customer support, data processing, document workflows, or incident response. ...

Downloads: 3 This Week

Last Update: 2026-04-10
See Project
14

Claude Code Video Vision

Give Claude the ability to watch and understand videos

...The system dynamically adapts how much data it extracts based on the user’s query, adjusting frame rate, resolution, and time windows to optimize both performance and token efficiency. It supports multiple backends for audio processing, including local and cloud-based options, enabling flexible deployment depending on privacy or performance requirements.

Downloads: 3 This Week

Last Update: 2026-05-18
See Project
15

Resume-Matcher

Improve your resumes with Resume Matcher

Resume-Matcher is a command-line application that compares resumes against job descriptions using natural language processing. It provides a compatibility score based on keyword relevance and highlights areas where the resume aligns—or doesn't—with the target role. Designed for job seekers and HR professionals, it helps improve resume tailoring and streamlines candidate screening.

Downloads: 1 This Week

Last Update: 2026-04-02
See Project
16

Cocur Slugify

Converts a string to a slug. Includes integrations for Symfony

Slugify is a PHP library that converts strings into URL-friendly slugs. It replaces spaces and special characters with hyphens or other specified separators, making it ideal for generating SEO-friendly URLs. Slugify is lightweight, fast, and highly configurable, supporting custom rules and language-specific transliterations for accurate slug creation.

Downloads: 1 This Week

Last Update: 2025-11-27
See Project
17

TEN Framework

TEN, a voice agent framework to create conversational AI.

TEN (Transformative Extensions Network) is a voice agent framework for creating conversational AI applications, focusing on high performance and modularity.

Downloads: 1 This Week

Last Update: 2026-05-26
See Project
18

AI Powered Knowledge Graph Generator

AI Powered Knowledge Graph Generator

...Knowledge graphs organize information as networks of nodes and relationships, allowing applications to analyze connections between concepts, datasets, or real-world entities. By incorporating AI techniques such as natural language processing and semantic reasoning, the project enables systems to automatically extract relationships and insights from large volumes of data. These capabilities make knowledge graph platforms particularly useful for applications such as recommendation engines, enterprise knowledge management, and research data exploration. The system emphasizes structured data modeling and graph-based queries that allow users to explore relationships that would be difficult to identify using traditional relational databases.

Downloads: 0 This Week

Last Update: 2026-03-06
See Project
19

Fulling

Full-stack Engineer Agent. Built with Next.js, Claude, shadcn/ui

...Instead of manually configuring development environments, the system automatically provisions the required infrastructure including a Linux environment, database services, and development tools. It integrates an AI pair programmer that can generate code, implement features, and assist with debugging tasks through natural language instructions. The environment also includes web-based terminals, file management tools, and version control capabilities to support collaborative software development workflows. Developers can connect external services by simply providing API credentials, allowing the AI system to automatically integrate features such as authentication or payment processing.

Downloads: 0 This Week

Last Update: 2026-05-11
See Project
20

Transformer Explainer

Learn How LLM Transformer Models Work with Interactive Visualization

Transformer Explainer is an interactive visualization tool created to help users understand how transformer-based language models operate internally. The platform runs a lightweight GPT-2 model directly in the user’s browser and allows users to experiment with text prompts while observing the model’s internal operations. Through visual diagrams and interactive interfaces, the tool reveals how tokens are processed through layers such as embeddings, attention mechanisms, and feed-forward...

Downloads: 0 This Week

Last Update: 2026-03-04
See Project
21

Step3-VL-10B

Multimodal model achieving SOTA performance

...It achieves this efficiency and strong performance through unified pre-training on a massive 1.2 trillion-token multimodal corpus that jointly optimizes a language-aligned perception encoder with a powerful decoder, creating deep synergy between image processing and text understanding.

Downloads: 0 This Week

Last Update: 2026-01-22
See Project
22

Readest

Readest is a modern, feature-rich ebook reader

Readest is a project meant to facilitate reading, studying, or consuming content by integrating reading tools with AI-powered assistance. Although the repository is not as widely documented or popular as some, the idea is that Readest supports features to help with reading comprehension — likely combining OCR / text retrieval, translation, note-taking, or summarization for reading materials (eBooks, articles, PDFs). The goal appears to be to let users feed in arbitrary reading material and...

Downloads: 25 This Week

Last Update: 6 days ago
See Project
23

fooyin

A customisable music player

...It provides a modular interface that can be built from scratch or adapted from preset layouts, allowing users to tailor the experience to their workflow. The player supports a wide range of audio formats and includes advanced playback features such as gapless playback, ReplayGain, and DSP processing. It integrates a powerful plugin system that enables extensions for widgets, decoders, metadata handling, and external services. fooyin also includes a scripting language called FooScript, which allows users to customize interface behavior, automate playlists, and control display logic. Its library management tools offer advanced filtering, tagging, and playlist organization for large music collections. ...

Downloads: 15 This Week

Last Update: 2026-05-21
See Project
24

RAG Web UI

RAG Web UI is an intelligent dialogue system based on RAG

RAG Web UI is an open-source intelligent dialogue system built on retrieval-augmented generation technology, designed to enable users to create AI-powered question answering systems grounded in their own knowledge bases. It combines document retrieval with large language models to provide accurate, context-aware responses based on indexed data rather than generic model knowledge. The platform supports ingestion of multiple document formats, including PDFs, Word files, Markdown, and plain text, automatically processing and vectorizing them for efficient retrieval. It features a multi-turn conversational interface that maintains context across interactions, allowing users to engage in more natural and continuous dialogues with their data. ...

Downloads: 1 This Week

Last Update: 2026-04-06
See Project
25

SimpleMem

SimpleMem: Efficient Lifelong Memory for LLM Agents

SimpleMem is a lightweight memory-augmented model framework that helps developers build AI applications that retain long-term context and recall relevant information without overloading model context windows. It provides easy-to-use APIs for storing structured memory entries, querying those memories using semantic search, and retrieving context to augment prompt inputs for downstream processing. Unlike monolithic systems where memory management is ad-hoc, SimpleMem formalizes a memory...

Downloads: 1 This Week

Last Update: 2026-05-21
See Project

Previous
6
7
8
9
You're on page 10
11
12
13
14
Next

Related Searches

speech

handy

readest

transcribe audio to srt

transcribe

speech to text

audio plugins

resume

ebook

vosk

Related Categories

Artificial Intelligence

Software Development

Scientific/Engineering

Text Editors

Multimedia

SourceForge

Create a Project
Open Source Software
Business Software
Top Downloaded Projects

Company

About
Team
SourceForge Headquarters
1320 Columbia Street Suite 310
San Diego, CA 92101
+1 (858) 422-6466

Resources

Support
Site Documentation
Site Status
SourceForge Reviews

© 2026 Slashdot Media. All Rights Reserved.

Terms Privacy Opt Out Advertise