Join/Login
Business Software
Open Source Software
For Vendors
Blog
About
More

For Vendors Help Create Join Login

Business Software

Open Source Software

SourceForge Podcast

Resources

Articles
Case Studies
Blog

Menu

Help
Create
Join
Login

Home
Open Source Software
Artificial Intelligence Software
Search Results

Search Results for "language all" - Page 2

x

Sort By:

Relevance

Clear All Filters

OS

Windows 75
Linux 73
Mac 67
More...
BSD 15
ChromeOS 15

Category

Artificial Intelligence 79
Software Development 4
Business 3
Scientific/Engineering 3
Desktop Environment 1
Education 1
Internet 1

License

OSI-Approved Open Source 71
Creative Commons Attribution License 1
Other License 1

Programming Language

Python 79
JavaScript 2
TypeScript 2
C++ 1
Go 1
More...
Rust 1
Unix Shell 1

Status

Production/Stable 4
Pre-Alpha 1
Beta 1

Showing 79 open source projects for "language all"

View related business solutions

Artificial Intelligence Python Clear Filters & Widen Search

Gemini 3 and 200+ AI Models on One Platform
Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

Build, govern, and optimize agents and models with Gemini Enterprise Agent Platform.

Start Free
$300 in Free Credit Towards Top Cloud Services
Build VMs, containers, AI, databases, storage—all in one place.

Start your project in minutes. After credits run out, 20+ products include free monthly usage. Only pay when you're ready to scale.

Get Started
1

LOTUS

AI-Powered Data Processing: Use LOTUS to process all of your datasets

LOTUS is an open-source framework and query engine designed to enable efficient processing of structured and unstructured datasets using large language models. The system provides a declarative programming model that allows developers to express complex AI data operations using high-level commands rather than manually orchestrating model calls. It offers a Python interface with a Pandas-like API, making it familiar for data scientists and engineers already working with data analysis...

Downloads: 0 This Week

Last Update: 2026-03-06
See Project
2

BertViz

BertViz: Visualize Attention in NLP Models (BERT, GPT2, BART, etc.)

...It is based on the excellent Tensor2Tensor visualization tool. The model view shows a bird's-eye view of attention across all layers and heads. The neuron view visualizes individual neurons in the query and key vectors and shows how they are used to compute attention.

Downloads: 1 This Week

Last Update: 2025-06-01
See Project
3

Codeflash

Optimize your code automatically with AI

Codeflash is a general-purpose optimizer for Python that uses advanced large language models (LLMs) to automatically generate, test, and benchmark multiple optimization ideas, then creates merge-ready pull requests with the best improvements for your code. Optimize an entire existing codebase by running codeflash --all. Automate optimizing all future code you will write by installing Codeflash as a GitHub action.

Downloads: 0 This Week

Last Update: 2026-04-02
See Project
4

OpenRecall

OpenRecall is a fully open-source, privacy-first alternative

...It works by taking periodic screenshots of a user’s screen and applying local AI processing, including OCR and semantic analysis, to extract and structure information from both text and images. This data is then indexed into a searchable database, allowing users to retrieve past information quickly using natural language queries. Unlike proprietary alternatives, OpenRecall operates entirely locally, ensuring that all captured data remains on the user’s device and is never transmitted to external servers. The platform supports multiple operating systems, including Windows, macOS, and Linux, making it widely accessible across different environments.

Downloads: 2 This Week

Last Update: 2026-03-18
See Project
MongoDB Atlas runs apps anywhere
Deploy in 115+ regions with the modern database for every enterprise.

MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.

Start Free
5

Ludwig AI

Low-code framework for building custom LLMs, neural networks

Declarative deep learning framework built for scale and efficiency. Ludwig is a low-code framework for building custom AI models like LLMs and other deep neural networks. Declarative YAML configuration file is all you need to train a state-of-the-art LLM on your data. Support for multi-task and multi-modality learning. Comprehensive config validation detects invalid parameter combinations and prevents runtime failures. Automatic batch size selection, distributed training (DDP, DeepSpeed),...

Downloads: 0 This Week

Last Update: 2 days ago
See Project
6

UI-TARS

UI-TARS-desktop version that can operate on your local personal device

UI-TARS is an open-source multimodal “GUI agent” created by ByteDance: a model designed to perceive raw screenshots (or rendered UI frames), reason about what needs to be done, and then perform real interactions with graphical user interfaces (GUIs) — like clicking, typing, navigating menus — across desktop, browser, mobile, or game environments. Rather than relying on rigid, manually scripted UI automation, UI-TARS uses a unified vision-language model (VLM) that integrates perception, reasoning, grounding, and action into one end-to-end framework: it “thinks before acting,” enabling flexible, general-purpose automation. This allows it to perform complex, multi-step tasks such as filling forms, downloading files, navigating applications, and even controlling in-game actions — all by understanding the UI as a human would. ...

Downloads: 13 This Week

Last Update: 2025-12-01
See Project
7

TapeAgents

A framework that facilitates all stages of LLM development

TapeAgents is a framework that facilitates all stages of the Large Language Model (LLM) agent development lifecycle, providing tools for building, testing, and deploying AI agents.

Downloads: 0 This Week

Last Update: 2025-08-19
See Project
8

VibeVoice

Open-source multi-speaker long-form text-to-speech model

...A key innovation is its use of continuous acoustic and semantic speech tokenizers operating at an ultra-low frame rate of 7.5 Hz, enabling high audio fidelity with efficient processing of long sequences. The model integrates a Qwen2.5-based large language model with a diffusion head to produce realistic acoustic details and capture conversational context. Training involved curriculum learning with increasing sequence lengths up to 65K tokens, allowing VibeVoice to handle very long dialogues effectively. Safety mechanisms include an audible disclaimer and imperceptible watermarking in all generated audio to mitigate misuse risks.

Downloads: 8 This Week

Last Update: 2026-05-06
See Project
9

refinery

Open-source choice to scale, assess and maintain natural language data

...Also, the makers of refinery currently work on integrations to other labeling tools, such that you can easily switch between different choices. refinery is a multi-repository project, you can find all integrated services in the architecture below. The app builds on top of Hugging Face and spaCy to leverage pre-built language models for your NLP tasks, as well as qdrant for neural search.

Downloads: 0 This Week

Last Update: 2024-06-13
See Project
Stop Storing Third-Party Tokens in Your Database
Auth0 Token Vault handles secure token storage, exchange, and refresh for external providers so you don't have to build it yourself.

Rolling your own OAuth token storage can be a security liability. Token Vault securely stores access and refresh tokens from federated providers and handles exchange and renewal automatically. Connected accounts, refresh exchange, and privileged worker flows included.

Try Auth0 for Free
10

Datasets

Hub of ready-to-use datasets for ML models

Datasets is a library for easily accessing and sharing datasets, and evaluation metrics for Natural Language Processing (NLP), computer vision, and audio tasks. Load a dataset in a single line of code, and use our powerful data processing methods to quickly get your dataset ready for training in a deep learning model. Backed by the Apache Arrow format, process large datasets with zero-copy reads without any memory constraints for optimal speed and efficiency.

Downloads: 1 This Week

Last Update: 2026-04-27
See Project
11

Artificial Intelligence for Beginners

12 Weeks, 24 Lessons, AI for All

AI-For-Beginners is a comprehensive open-source educational curriculum designed to introduce learners to the foundations of artificial intelligence through structured lessons and hands-on practice. The repository provides a 12-week program composed of 24 lessons that combine theory, code examples, quizzes, and laboratory exercises. It covers a broad range of topics including neural networks, computer vision, natural language processing, and AI ethics. The curriculum is intentionally...

Downloads: 0 This Week

Last Update: 2026-04-24
See Project
12

Mistral Vibe CLI

Minimal CLI coding agent by Mistral

Mistral Vibe is an AI-powered “vibe-coding” command-line interface (CLI) and coding-assistant framework built by Mistral AI to let developers write, refactor, search, and manage code through natural language and context-aware automation, rather than manual typing only. It aims to take developers out of repetitive boilerplate and let them stay “in the flow”: you can ask the tool to generate functions, refactor code, search across the codebase, manipulate files, commit changes via Git, or run commands — all from a unified CLI interface. ...

Downloads: 7 This Week

Last Update: 15 hours ago
See Project
13

TaskingAI

Open platform for building, deploying, and managing LLM agents

TaskingAI is an open source platform designed to simplify the development and deployment of applications powered by large language models. It follows a Backend as a Service approach, allowing developers to separate AI logic from frontend product development while maintaining a structured and scalable workflow. TaskingAI integrates hundreds of language models from multiple providers into a unified system, enabling developers to switch models or combine capabilities without major reconfiguration. ...

Downloads: 1 This Week

Last Update: 2026-03-17
See Project
14

OmniBox

Collect, organize, use, and share, all in OmniBox

Omnibox (mirror) is a SourceForge mirror of the Omnibox open-source project, which provides a software interface designed to simplify interaction with multiple tools and services through a unified command or search interface. The project focuses on creating a centralized input field where users can enter commands, queries, or shortcuts that trigger actions across different applications or services. Inspired by the omnibox concept used in modern browsers, the system combines search...

Downloads: 1 This Week

Last Update: 2 days ago
See Project
15

BERTopic

Leveraging BERT and c-TF-IDF to create easily interpretable topics

BERTopic is a topic modeling technique that leverages transformers and c-TF-IDF to create dense clusters allowing for easily interpretable topics whilst keeping important words in the topic descriptions. BERTopic supports guided, supervised, semi-supervised, manual, long-document, hierarchical, class-based, dynamic, and online topic modeling. It even supports visualizations similar to LDAvis! Corresponding medium posts can be found here, here and here. For a more detailed overview, you can...

Downloads: 1 This Week

Last Update: 2025-12-03
See Project
16

cognee

Deterministic LLMs Outputs for AI Applications and AI Agents

Cognee implements scalable, modular data pipelines that allow for creating the LLM-enriched data layer using graph and vector stores. Cognee acts a semantic memory layer, unveiling hidden connections within your data and infusing it with your company's language and principles. This self-optimizing process ensures ultra-relevant, personalized, and contextually aware LLM retrievals. Any kind of data works; unstructured text or raw media files, PDFs, tables, presentations, JSON files, and so many more. Add small or large files, or many files at once. We map out a knowledge graph from all the facts and relationships we extract from your data. ...

Downloads: 5 This Week

Last Update: 4 days ago
See Project
17

Infinity

Low-latency REST API for serving text-embeddings

Infinity is a high-throughput, low-latency REST API for serving vector embeddings, supporting all sentence-transformer models and frameworks. Infinity is developed under MIT License. Infinity powers inference behind Gradient.ai and other Embedding API providers.

Downloads: 0 This Week

Last Update: 2025-08-22
See Project
18

Swirl

Swirl queries any number of data sources with APIs

Swirl queries any number of data sources with APIs and uses spaCy and NLTK to re-rank the unified results without extracting and indexing anything! Includes zero-code configs for Apache Solr, ChatGPT, Elastic Search, OpenSearch, PostgreSQL, Google BigQuery, RequestsGet, Google PSE, NLResearch.com, Miro & more! SWIRL adapts and distributes queries to anything with a search API - search engines, databases, noSQL engines, cloud/SaaS services etc - and uses AI (Large Language Models) to re-rank...

Downloads: 0 This Week

Last Update: 2 days ago
See Project
19

gensim

Topic Modelling for Humans

Gensim is a Python library for topic modeling, document indexing, and similarity retrieval with large corpora. The target audience is the natural language processing (NLP) and information retrieval (IR) community.

Downloads: 0 This Week

Last Update: 2025-10-16
See Project
20

DeepEval

DeepEval is a simple-to-use, open-source LLM evaluation framework, for evaluating and testing large-language model systems. It is similar to Pytest but specialized for unit testing LLM outputs. DeepEval incorporates the latest research to evaluate LLM outputs based on metrics such as G-Eval, hallucination, answer relevancy, RAGAS, etc., which uses LLMs and various other NLP models that run locally on your machine for evaluation. Whether your application is implemented via RAG or fine-tuning,...

Downloads: 3 This Week

Last Update: 2026-05-13
See Project
21

Evo 2

Genome modeling and design across all domains of life

Evo 2 is a DNA language model system designed for long-context genome modeling and biological sequence design across all domains of life. The project models DNA at single-nucleotide resolution and supports context windows of up to one million base pairs, which places it in a class of models built for very large genomic reasoning tasks. According to the repository, it uses the StripedHyena 2 architecture, was pretrained with Savanna, and was trained autoregressively on the OpenGenome2 dataset containing 8.8 trillion tokens. ...

Downloads: 0 This Week

Last Update: 2026-04-20
See Project
22

Deep Lake

Data Lake for Deep Learning. Build, manage, and query datasets

Deep Lake (formerly known as Activeloop Hub) is a data lake for deep learning applications. Our open-source dataset format is optimized for rapid streaming and querying of data while training models at scale, and it includes a simple API for creating, storing, and collaborating on AI datasets of any size. It can be deployed locally or in the cloud, and it enables you to store all of your data in one place, ranging from simple annotations to large videos. Deep Lake is used by Google, Waymo,...

Downloads: 0 This Week

Last Update: 2026-02-12
See Project
23

SHAP

A game theoretic approach to explain the output of ml models

SHAP (SHapley Additive exPlanations) is a game theoretic approach to explain the output of any machine learning model. It connects optimal credit allocation with local explanations using the classic Shapley values from game theory and their related extensions. While SHAP can explain the output of any machine learning model, we have developed a high-speed exact algorithm for tree ensemble methods. Fast C++ implementations are supported for XGBoost, LightGBM, CatBoost, scikit-learn and pyspark...

Downloads: 0 This Week

Last Update: 2026-03-03
See Project
24

DocTR

Library for OCR-related tasks powered by Deep Learning

...Easy integration (available templates for browser demo & API deployment). End-to-End OCR is achieved in docTR using a two-stage approach: text detection (localizing words), then text recognition (identify all characters in the word). As such, you can select the architecture used for text detection, and the one for text recognition from the list of available implementations.

Downloads: 4 This Week

Last Update: 2026-05-05
See Project
25

Qwen3-Omni

Qwen3-omni is a natively end-to-end, omni-modal LLM

Qwen3-Omni is a natively end-to-end multilingual omni-modal foundation model that processes text, images, audio, and video and delivers real-time streaming responses in text and natural speech. It uses a Thinker-Talker architecture with a Mixture-of-Experts (MoE) design, early text-first pretraining, and mixed multimodal training to support strong performance across all modalities without sacrificing text or image quality. The model supports 119 text languages, 19 speech input languages, and...

Downloads: 1 This Week

Last Update: 2026-04-23
See Project

Previous
1
You're on page 2
3
4
Next

Related Searches

craxsrat-7.5

artificial intelligence projects

ai

dataset

offline artificial intelligence\

mistral

ocr

train ai

python ai

•mobile phone forensics tools

Related Categories

Artificial Intelligence

Software Development

Business

Scientific/Engineering

Desktop Environment

SourceForge

Create a Project
Open Source Software
Business Software
Top Downloaded Projects

Company

About
Team
SourceForge Headquarters
1320 Columbia Street Suite 310
San Diego, CA 92101
+1 (858) 422-6466

Resources

Support
Site Documentation
Site Status
SourceForge Reviews

© 2026 Slashdot Media. All Rights Reserved.

Terms Privacy Opt Out Advertise