Join/Login
Business Software
Open Source Software
For Vendors
Blog
About
More

For Vendors Help Create Join Login

Business Software

Open Source Software

SourceForge Podcast

Resources

Articles
Case Studies
Blog

Menu

Help
Create
Join
Login

Home
Open Source Software
Search Results

Search Results for "language processing" - Page 8

x

Sort By:

Relevance

Clear All Filters

OS

Mac 842
Linux 832
Windows 829
More...
BSD 377
ChromeOS 363
Desktop Operating Systems 17
Mobile Operating Systems 16
Server Operating Systems 5
Embedded Operating Systems 1
Game Consoles 1

Category

Artificial Intelligence 516
Software Development 177
Scientific/Engineering 106
Text Editors 84
Multimedia 51
Business 42
Formats and Protocols 34
Internet 32
Education 28
Database 19
Games 10
System 9
Communications 8
Security 6
Printing 5
Desktop Environment 4
Productivity 3
Social sciences 2
Religion and Philosophy 1

License

OSI-Approved Open Source 686
Creative Commons Attribution License 13
Public Domain 8
Other License 6
More...
GNU Free Documentation License 5

Translations

Programming Language

Status

Production/Stable 98
Beta 79
Alpha 41
Pre-Alpha 26
More...
Planning 21
Mature 8
Inactive 8

Showing 842 open source projects for "language processing"

View related business solutions

Mac Clear Filters & Widen Search

Enterprise-grade ITSM, for every business
Give your IT, operations, and business teams the ability to deliver exceptional services—without the complexity.

Freshservice is an intuitive, AI-powered platform that helps IT, operations, and business teams deliver exceptional service without the usual complexity. Automate repetitive tasks, resolve issues faster, and provide seamless support across the organization. From managing incidents and assets to driving smarter decisions, Freshservice makes it easy to stay efficient and scale with confidence.

Try it Free
Train ML Models With SQL You Already Know
BigQuery automates data prep, analysis, and predictions with built-in AI assistance.

Build and deploy ML models using familiar SQL. Automate data prep with built-in Gemini. Query 1 TB and store 10 GB free monthly.

Try Free
1

nndeploy

An Easy-to-Use and High-Performance AI Deployment Framework

nndeploy is an open-source framework designed to simplify the deployment of artificial intelligence models across multiple hardware platforms and devices. The framework focuses on making it easier to transform trained AI models into production-ready applications that can run efficiently on desktops, mobile devices, servers, and edge computing hardware. Developers can use visual workflows to design and configure AI processing pipelines by connecting modular nodes that represent different...

Downloads: 7 This Week

Last Update: 2026-04-04
See Project
2

Open Generative AI

Uncensored, open-source alternative to Higgsfield AI

...The repository organizes information about models, libraries, datasets, and learning materials, making it easier for developers to navigate the rapidly evolving AI landscape. It includes references to tools for natural language processing, computer vision, and multimodal systems. The project is designed as a knowledge hub, helping users discover technologies and best practices for building generative AI applications. It is particularly useful for beginners who need a structured overview as well as for experienced developers looking for new tools. The repository is continuously updated to reflect the latest developments in the field. ...

Downloads: 41 This Week

Last Update: 2026-05-23
See Project
3

LocalAI

The free, Open Source alternative to OpenAI, Claude and others

LocalAI is an open-source platform that allows users to run large language models and other AI systems locally on their own hardware. It acts as a drop-in replacement for APIs such as OpenAI, enabling developers to build AI-powered applications without relying on external cloud services. The platform supports a wide range of model types, including text generation, image creation, speech processing, and embeddings.

Downloads: 11 This Week

Last Update: 2026-05-30
See Project
4

GoAWK

A POSIX-compliant AWK interpreter written in Go, with CSV support

GoAWK now uses a bytecode compiler and includes native support for CSV files. AWK is a fascinating text processing language, and The AWK Programming Language is a wonderfully concise book describing it. The A, W, and K in AWK stand for the surnames of the three original creators: Alfred Aho, Peter Weinberger, and Brian Kernighan. Kernighan is also an author of The C Programming Language (“K&R”), and the two books have that same each-page-packs-a-punch feel. ...

Downloads: 4 This Week

Last Update: 2025-12-23
See Project
$300 Free Credits for Your Google Cloud Projects
Start building on Google Cloud with $300 in free credits. No commitment, no credit card required until you're ready to scale.

Launch your next project with $300 in free Google Cloud credits—no strings attached. Test, build, and deploy without risk. Use your credits across the entire Google Cloud platform to find what works best for your needs. After your credits are used, continue with always-free tier services. Only pay when you're ready to scale. Sign up in minutes and start exploring.

Start Free Trial
5

Exq

Job processing library for Elixir - compatible with Resque / Sidekiq

Exq is a job processing library compatible with Resque / Sidekiq for the Elixir language. Exq uses Redis as a store for background processing jobs. Exq handles concurrency, job persistence, job retries, reliable queueing and tracking so you don't have to. Jobs are persistent so they would survive across node restarts. You can use multiple Erlang nodes to process from the same pool of jobs.

Downloads: 2 This Week

Last Update: 2025-11-04
See Project
6

Ultravox

Fast multimodal LLM for real-time voice interaction and AI apps

Ultravox is an open source multimodal large language model designed specifically for real-time voice-based interactions. It is built to process both text and spoken audio directly, eliminating the need for a separate speech recognition stage and enabling more seamless conversational experiences. Ultravox works by combining text prompts with encoded audio inputs, allowing it to understand spoken language alongside written instructions in a unified pipeline. Internally, it leverages pretrained...

Downloads: 2 This Week

Last Update: 2026-03-18
See Project
7

PowerInfer

High-speed Large Language Model Serving for Local Deployment

PowerInfer is a high-performance inference engine designed to run large language models efficiently on personal computers equipped with consumer-grade GPUs. The project focuses on improving the performance of local AI inference by optimizing how neural network computations are distributed between CPU and GPU resources. Its architecture exploits the observation that only a subset of neurons in large models are frequently activated, allowing the system to preload frequently used neurons into GPU memory while processing less common activations on the CPU. ...

Downloads: 0 This Week

Last Update: 2026-05-11
See Project
8

AI App Lab

Implementing large models into scenario-based applications

AI App Lab is an open-source platform developed by Volcengine that provides tools, SDKs, and example applications for building real-world AI applications powered by large language models. The project focuses on helping developers bridge the gap between AI models and practical business use cases by offering a structured environment for creating production-ready AI systems. It includes a high-level SDK called Arkitect, which provides workflows and tools for integrating models, plugins, and multimodal capabilities such as text, image, and voice processing.

Downloads: 1 This Week

Last Update: 2026-05-26
See Project
9

AI-Media2Doc

AI tool converting video/audio into structured documents instantly

AI-Media2Doc is a web-based application that uses large language models to convert video and audio content into structured, readable documents in a single workflow. It is designed to transform multimedia inputs into formats such as knowledge notes, summaries, mind maps, and social-style articles, making content easier to review and reuse. AI-Media2Doc emphasizes privacy by processing media locally in the browser using WebAssembly-based ffmpeg, ensuring that original video files are not uploaded externally. ...

Downloads: 2 This Week

Last Update: 2026-03-18
See Project
MongoDB Atlas runs apps anywhere
Deploy in 115+ regions with the modern database for every enterprise.

MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.

Start Free
10

WhisperJAV

Uses Qwen3-ASR, local LLM, Whisper, TEN-VAD

WhisperJAV is an open-source speech transcription pipeline designed specifically for generating subtitles for Japanese adult video content. The project addresses challenges that standard speech recognition models face when transcribing this type of audio, which often includes low signal-to-noise ratios and large numbers of non-verbal vocalizations. Traditional automatic speech recognition systems can misinterpret these sounds as words, leading to inaccurate transcripts. WhisperJAV introduces...

Downloads: 19 This Week

Last Update: 2026-05-11
See Project
11

docext

An on-premises, OCR-free unstructured data extraction

docext is a document intelligence toolkit that uses vision-language models to extract structured information from documents such as PDFs, forms, and scanned images. The system is designed to operate entirely on-premises, allowing organizations to process sensitive documents without relying on external cloud services. Unlike traditional document processing pipelines that rely heavily on optical character recognition, docext leverages multimodal AI models capable of understanding both visual and textual information directly from document images. ...

Downloads: 0 This Week

Last Update: 2026-03-12
See Project
12

Moonshine Voice

Fast and accurate automatic speech recognition (ASR) for edge devices

moonshine is an open-source automatic speech recognition toolkit optimized for fast and accurate transcription on edge devices and local environments. The project is designed to enable real-time voice applications such as live transcription, voice commands, and embedded speech interfaces without requiring heavy cloud infrastructure. Its architecture emphasizes low latency and flexible input handling, allowing audio streams of varying durations rather than relying on fixed processing windows....

Downloads: 6 This Week

Last Update: 6 days ago
See Project
13

DeepSeek-OCR

Contexts Optical Compression

...It supports local deployment, enabling organizations concerned about privacy or latency to run the pipeline on-premises rather than send sensitive documents to third-party cloud services. The codebase is written in Python with a focus on modularity: you can swap preprocessing, recognition, and post-processing components as needed for custom workflows.

Downloads: 16 This Week

Last Update: 2026-01-27
See Project
14

Bedrock Chat

AWS-native chatbot using Bedrock

Bedrock Chat is a mirrored version of an open-source project that provides a conversational interface for interacting with large language models and AI services through a chat-style application. The project typically focuses on delivering a user interface that allows individuals or teams to communicate with AI models, manage conversations, and experiment with prompts and responses. Implementations like Bedrock Chat often integrate with model hosting platforms or APIs that provide access to...

Downloads: 1 This Week

Last Update: 2026-04-09
See Project
15

dataline

AI data analysis and visualization on CSV, Postgres, MySQL, Snowflake

...The platform is designed with a privacy-first architecture that stores data locally on the user’s device rather than sending it to external cloud services by default. It can also hide sensitive data from language models during processing, ensuring that only necessary metadata is used for query generation.

Downloads: 0 This Week

Last Update: 2026-03-06
See Project
16

Kotlin JDSL

Kotlin library that makes it easy to build and execute queries

Kotlin JDSL is a Kotlin library that makes it easy to build a query without a generated metamodel. There are many libraries that use an annotation processing tool (APT) to do the job. However, with the APT, you have to recompile whenever the name or type of field in an entity or a table class is changed. Kotlin JDSL provides a domain-specific language (DSL) based on KClass and KProperty to help you easily build queries without such inconveniences from the APT. Kotlin JDSL does not provide an executor or a wrapper class as it is designed to help you build and execute queries with the library you are using.

Downloads: 3 This Week

Last Update: 2026-05-11
See Project
17

AppWorld

World of apps for benchmarking interactive coding agent

AppWorld is a framework developed by Stony Brook University's NLP group to simulate environments for training and evaluating dialogue agents in task-oriented applications.

Downloads: 0 This Week

Last Update: 2026-02-17
See Project
18

Scrimage

JVM - Java, Kotlin, Scala image processing library

Scrimage is an immutable, functional, and performant JVM library for the manipulation of images. The aim of this library is to provide a simple and concise way to do common image operations, such as resizing to fit a required width and height, converting between formats, applying filters, and so on. It is easy to use from any language on the JVM. A typical use case for this library would be creating thumbnails of images uploaded by users in a web app, bounding a set of product images so that...

Downloads: 4 This Week

Last Update: 6 days ago
See Project
19

repo2txt

Web-based tool converts GitHub repository contents

repo2txt is an open-source developer tool that converts the contents of a code repository into a single structured text file that can be easily consumed by large language models. The tool is designed to address the challenge of analyzing entire codebases with AI assistants, where code is normally distributed across many files and directories. By collecting repository contents and formatting them into a single text document, repo2txt allows developers to feed complete projects into AI systems...

Downloads: 0 This Week

Last Update: 2026-03-08
See Project
20

NeMo Curator

Scalable data pre processing and curation toolkit for LLMs

NeMo Curator is a Python library specifically designed for fast and scalable dataset preparation and curation for large language model (LLM) use-cases such as foundation model pretraining, domain-adaptive pretraining (DAPT), supervised fine-tuning (SFT) and paramter-efficient fine-tuning (PEFT). It greatly accelerates data curation by leveraging GPUs with Dask and RAPIDS, resulting in significant time savings. The library provides a customizable and modular interface, simplifying pipeline...

Downloads: 0 This Week

Last Update: 2026-05-12
See Project
21

MiniMax-01

Large-language-model & vision-language-model based on Linear Attention

MiniMax-01 is the official repository for two flagship models: MiniMax-Text-01, a long-context language model, and MiniMax-VL-01, a vision-language model built on top of it. MiniMax-Text-01 uses a hybrid attention architecture that blends Lightning Attention, standard softmax attention, and Mixture-of-Experts (MoE) routing to achieve both high throughput and long-context reasoning. It has 456 billion total parameters with 45.9 billion activated per token and is trained with advanced parallel...

Downloads: 0 This Week

Last Update: 2025-12-01
See Project
22

Ai-Learn

The artificial intelligence learning roadmap compiles 200 cases

...The repository was created to help learners start self-study programs in artificial intelligence without getting overwhelmed by the large number of available resources. It organizes topics such as Python programming, mathematics for machine learning, data analysis, deep learning, computer vision, and natural language processing into a structured learning path. The project also provides a large collection of practical exercises and case studies that allow learners to apply theoretical knowledge through real projects. According to the repository description, it includes nearly two hundred hands-on AI examples developed through years of teaching experience.

Downloads: 0 This Week

Last Update: 2026-03-10
See Project
23

Medeo Video Generator

AI-powered video generation skill for OpenClaw

Medeo Video Generator is an AI-driven project designed to enable advanced video processing and generation capabilities within agent-based or automation systems. It provides a “skill” module that can be integrated into AI agents, allowing them to create, edit, and manipulate video content programmatically. The project focuses on bridging the gap between language-based AI systems and multimedia outputs by enabling models to produce structured video content as part of their workflows. ...

Downloads: 6 This Week

Last Update: 2026-04-08
See Project
24

Short Video Factory

AI tool for automatic batch short video creation and editing

Short Video Factory is an open source desktop application designed to simplify the creation of short-form videos using AI-driven automation. It enables users to generate product marketing clips and general content videos by combining simple prompt-based input with pre-prepared media assets. Short Video Factory integrates multiple stages of video production, including script generation, voice synthesis, video editing, and subtitle effects, into a single streamlined workflow. By leveraging AI...

Downloads: 6 This Week

Last Update: 2026-04-07
See Project
25

LLM Scraper

Extract structured data from webpages using LLM-powered scraping

...LLM Scraper integrates browser automation through Playwright, allowing it to load webpages and process their content before sending it to a language model for interpretation. Multiple content processing modes are supported, including raw HTML, cleaned HTML, Markdown, extracted text, screenshots, and custom inputs, making it adaptable to a wide range of scraping scenarios. LLM Scraper also provides streaming output and code generation capabilities that help developers build reusable scraping workflows.

Downloads: 1 This Week

Last Update: 4 days ago
See Project

Previous
4
5
6
7
You're on page 8
9
10
11
12
Next

Related Searches

ocr

uncensored ai

gpt4all

ai

gawk

whisperjav

uncensored ai image

ai applications

run locally

open source ai website builder for ubuntu

Related Categories

Artificial Intelligence

Software Development

Scientific/Engineering

Text Editors

Multimedia

SourceForge

Create a Project
Open Source Software
Business Software
Top Downloaded Projects

Company

About
Team
SourceForge Headquarters
1320 Columbia Street Suite 310
San Diego, CA 92101
+1 (858) 422-6466

Resources

Support
Site Documentation
Site Status
SourceForge Reviews

© 2026 Slashdot Media. All Rights Reserved.

Terms Privacy Opt Out Advertise