Join/Login
Business Software
Open Source Software
For Vendors
Blog
About
More

For Vendors Help Create Join Login

Business Software

Open Source Software

SourceForge Podcast

Resources

Articles
Case Studies
Blog

Menu

Help
Create
Join
Login

Home
Open Source Software
Artificial Intelligence Software
Search Results

Search Results for "open source png text"

x

Sort By:

Relevance

Clear All Filters

OS

Linux 95
Windows 93
Mac 91
More...
BSD 39
ChromeOS 38
Desktop Operating Systems 4
Mobile Operating Systems 3

Category

Artificial Intelligence 99
Multimedia 11
Text Editors 6
Software Development 5
Internet 4
Education 3
Scientific/Engineering 3
Business 2
Communications 2
Database 1
Games 1
Productivity 1
Religion and Philosophy 1
Security 1

License

OSI-Approved Open Source 98

Translations

English 11
Chinese (Simplified) 3
French 3
Arabic 2
More...
Russian 2
Spanish 2
Dutch 1
German 1
Hindi 1
Indonesian 1
Italian 1
Japanese 1
Portuguese 1
Swahili 1
Ukrainian 1

Programming Language

JavaScript 99
Python 20
TypeScript 11
PHP 7
Java 6
More...
C++ 5
Unix Shell 4
BASIC 2
C 2
Go 2
Perl 2
Ruby 2
ASP.NET 1
Delphi/Kylix 1
Elixir 1
Fortran 1
Lua 1
PowerShell 1
Rust 1

Status

Production/Stable 10
Beta 4
Alpha 3
Planning 2

Showing 99 open source projects for "open source png text"

View related business solutions

Artificial Intelligence JavaScript Clear Filters & Widen Search

Earn up to 16% annual interest with Nexo.
More flexibility. More control.

Generate interest, access liquidity without selling, and execute trades seamlessly. All in one platform. Geographic restrictions, eligibility, and terms apply.

Get started with Nexo.
Ship Agents Faster
Transform your applications and workflows into powerful agentic systems at global scale.

Gemini Enterprise Agent Platform lets you rapidly build, scale, govern and optimize production-ready agents grounded in your organization's data. The platform enables developers to build custom or pre-built agents for virtually any use case. New customers get $300 in free credits.

Get Started Free
1

Open Semantic Search

Open source semantic search and text analytics for large document sets

Open Semantic Search is an open source research and analytics platform designed for searching, analyzing, and exploring large collections of documents using semantic search technologies. It provides an integrated search server combined with a document processing pipeline that supports crawling, text extraction, and automated analysis of content from many different sources.

Downloads: 5 This Week

Last Update: 4 hours ago
See Project
2

Open-LLM-VTuber

Open source AI VTuber platform with voice chat and Live2D avatars

Open-LLM-VTuber is an open source platform designed to create AI-powered VTuber characters that can interact with users through voice and animated avatars. It enables hands-free conversations with large language models by combining speech recognition, language processing, and text-to-speech synthesis into a single system. Users can speak directly to the AI character, and the system can respond with a generated voice while animating a Live2D avatar to simulate a talking virtual personality. ...

Downloads: 4 This Week

Last Update: 2026-03-17
See Project
3

Text-to-image Playground

A playground to generate images from any text prompt using SD

dalle-playground is an open-source web application that allows users to generate images from natural language text prompts using modern text-to-image generative models. Originally built around DALL-E Mini, the project later transitioned to using Stable Diffusion, enabling more detailed and higher-quality image synthesis. The system combines a backend machine learning service with a browser-based frontend interface that lets users experiment interactively with prompt engineering and generative AI. ...

Downloads: 0 This Week

Last Update: 2026-03-11
See Project
4

Text Embeddings Inference

High-performance inference server for text embeddings models API layer

Text Embeddings Inference is a high-performance server designed to serve text embedding models efficiently in production environments. It focuses on delivering fast and scalable embedding generation by leveraging optimized inference techniques and modern hardware acceleration. It is built to support transformer-based embedding models, making it suitable for tasks such as semantic search, clustering, and retrieval-augmented systems. It provides an API interface that allows developers to...

Downloads: 0 This Week

Last Update: 2026-03-23
See Project
Forever Free Full-Stack Observability | Grafana Cloud
Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.

Create free account
5

Tesseract.js

A pure Javascript Multilingual OCR

Tesseract.js is a pure Javascript port of the popular Tesseract OCR engine. Tesseract.js' library supports more than 100 languages, automatic text orientation and script detection, a simple interface for reading paragraph, word, and character bounding boxes. Tesseract.js can run either in a browser and on a server with NodeJS. Tesseract.js is a javascript library that gets words in almost any spoken language out of images. The main Tesseract.js functions (ex. recognize, detect) take an image...

Downloads: 10 This Week

Last Update: 2025-12-15
See Project
6

Scribe.js

JavaScript OCR and text extraction for images and PDFs

Scribe.js is a JavaScript library that provides Optical Character Recognition (OCR) and text extraction capabilities for both images and PDF documents, aimed at developers who want to build OCR features directly into their applications. The library can take image files (such as PNG or JPEG) and recognize the text they contain, and it can also extract text from PDF files that either already contain text or are image-based scans, using modern web standards and WebAssembly under the hood. In...

Downloads: 4 This Week

Last Update: 2026-05-27
See Project
7

Pot Desktop

A cross-platform software for text translation and recognition

Pot-Desktop is a cross-platform productivity tool aimed at helping users quickly translate, perform OCR (optical character recognition), and synthesize speech for selected text or images — all with minimal friction. It supports picking text via mouse selection (“highlight-and-translate”), clipboard listening, or screenshot-based OCR; this makes it ideal for reading webpages, documents, images — or any on-screen text — and instantly getting translations or text extraction. The tool supports...

Downloads: 19 This Week

Last Update: 2025-11-28
See Project
8

compromise

Modest natural-language processing

Language is complicated and there's a gazillion words. Compromise is a javascript library that interprets and pre-parses text and makes some reasonable decisions so things are way easier. Compromise tries its best to parse text. it is small, quick, and often good-enough. It is not as smart as you'd think. Conjugate and negate verbs in any tense. Play between plural, singular and possessive forms. Interpret plain-text numbers. Handle implicit terms. Use it on the client-side or as an...

Downloads: 7 This Week

Last Update: 7 days ago
See Project
9

Easy Diffusion

An easy 1-click way to create beautiful artwork on your PC using AI

Easy Diffusion is a widely used community-driven repository offering a simple, one-click way to install and use Stable Diffusion-based generative AI on a personal computer without advanced technical skills or prior setup. It provides a browser-based user interface that runs locally, allowing users to type text prompts and immediately generate images directly within their web browser, democratizing access to powerful text-to-image models for artists and hobbyists alike. The project abstracts...

Downloads: 39 This Week

Last Update: 2026-03-31
See Project
Stop Cyber Threats with VM-Series Next-Gen Firewall on Azure
Native application identity and user-based security for your Azure cloud

Gain integrated visibility across all traffic in a single pass. Deploy Palo Alto Networks VM-Series to determine application identity and content while automating security policy updates via rich APIs.

Get a free trial
10

SillyTavern

LLM Frontend for Power Users

Mobile-friendly, Multi-API (KoboldAI/CPP, Horde, NovelAI, Ooba, OpenAI, OpenRouter, Claude, Scale), VN-like Waifu Mode, Horde SD, System TTS, WorldInfo (lorebooks), customizable UI, auto-translate, and more prompt options than you'd ever want or need. Optional Extras server for more SD/TTS options + ChromaDB/Summarize. SillyTavern is a user interface you can install on your computer (and Android phones) that allows you to interact with text generation AIs and chat/roleplay with characters...

Downloads: 722 This Week

Last Update: 2026-05-03
See Project
11

Diffusion Bee

Diffusion Bee is the easiest way to run Stable Diffusion locally

Diffusion Bee is a user-friendly local application designed to make running the Stable Diffusion text-to-image generative model as simple as possible on macOS machines, including both Intel and Apple Silicon. It wraps Stable Diffusion and its dependencies into a one-click installer so users don’t need to manually install Python, drivers, or machine-learning frameworks to generate images. The app runs entirely on the local machine so images are created offline and no user data is sent to...

Downloads: 23 This Week

Last Update: 2026-02-03
See Project
12

Supertonic

Lightning-fast, on-device TTS, running natively via ONNX

Supertonic is a lightning-fast, on-device text-to-speech system built around ONNX Runtime for maximum speed and portability. It focuses on running entirely locally, eliminating the need for cloud APIs and providing low latency and strong privacy guarantees, even on constrained devices like Raspberry Pi boards and e-readers. The core model is highly compact at around 66 million parameters, yet benchmarks show it can generate speech up to 167× faster than real time on modern consumer hardware...

Downloads: 5 This Week

Last Update: 2026-01-06
See Project
13

Search-Index

A persistent, network resilient, full text search library

Search-Index is a lightweight and fast JavaScript-based search engine that enables full-text search indexing and retrieval for web applications.

Downloads: 0 This Week

Last Update: 2025-03-12
See Project
14

canvas-editor

Canvas-based WYSIWYG rich text editor with advanced layout tools

canvas-editor is a browser-based rich text editor that renders content using HTML5 Canvas and SVG instead of traditional DOM-based approaches. It is designed to provide a WYSIWYG editing experience similar to word processors, enabling precise control over layout, rendering, and document structure. canvas-editor supports a wide range of formatting and document features, including text styling, tables, images, and embedded elements, all managed through a structured data model. Its architecture...

Downloads: 2 This Week

Last Update: 4 hours ago
See Project
15

natural

General natural language facilities for node

"Natural" is a general natural language facility for nodejs. It offers a broad range of functionalities for natural language processing. Tokenizing, stemming, classification, phonetics, tf-idf, WordNet, string similarity, and some inflections are currently supported. It’s still in the early stages, so we’re very interested in bug reports, contributions and the like. Note that many algorithms from Rob Ellis’s node-nltools are being merged into this project and will be maintained from here...

Downloads: 0 This Week

Last Update: 2026-02-18
See Project
16

Agili Hacker Podcast

AI tool that turns Hacker News posts into daily podcast updates

...As an open-source tool, it also encourages community contributions and customization for developers who want to adapt or extend its workflow for similar AI-driven content pipelines.

Downloads: 3 This Week

Last Update: 4 days ago
See Project
17

FAY

Framework for building AI-powered interactive digital humans and agent

Fay is an open source framework designed to build and deploy interactive digital humans powered by large language models. It acts as a middleware layer that connects digital character technologies with conversational AI systems and business applications. Fay supports various types of digital humans, including 2.5D and 3D avatars, and can be integrated with applications running on mobile devices, PCs, web platforms, and embedded systems.

Downloads: 4 This Week

Last Update: 3 days ago
See Project
18

AUTOMATIC1111 Stable Diffusion web UI

Stable Diffusion web UI

AUTOMATIC1111's stable-diffusion-webui is a powerful, user-friendly web interface built on the Gradio library that allows users to easily interact with Stable Diffusion models for AI-powered image generation. Supporting both text-to-image (txt2img) and image-to-image (img2img) generation, this open-source UI offers a rich feature set including inpainting, outpainting, attention control, and multiple advanced upscaling options. With a flexible installation process across Windows, Linux, and Apple Silicon, plus support for GPUs and CPUs, it caters to a wide range of users—from hobbyists to professionals. ...

1 Review

Downloads: 168 This Week

Last Update: 2025-06-02
See Project
19

Node.js Client For NLP Cloud

NLP Cloud serves high performance pre-trained or custom models

This is the Node.js client (with Typescript types) for the NLP Cloud API. NLP Cloud serves high-performance pre-trained or custom models for NER, sentiment analysis, classification, summarization, dialogue summarization, paraphrasing, intent classification, product description and ad generation, chatbot, grammar and spelling correction, keywords and keyphrases extraction, text generation, image generation, blog post generation, text generation, question answering, automatic speech...

Downloads: 0 This Week

Last Update: 2024-11-27
See Project
20

PyGPT

Open source personal AI Assistant for Linux, Windows and Mac

PyGPT is a desktop application that allows you to talk to OpenAI's LLM models such as GPT4 and GPT3 using your own computer and OpenAI API. It allows you to talk in chat mode and in completion mode, as well as generate images using DALL-E 2. PyGPT also adds access to the Internet for GPT via Google Custom Search API and Wikipedia API and includes voice synthesis using Microsoft Azure Text-to-Speech API. Moreover, the application has implemented context memory support, context storage,...

Downloads: 11 This Week

Last Update: 2026-02-06
See Project
21

Vectorize MCP Server

Official Vectorize MCP Server

The Vectorize MCP Server is a Model Context Protocol server that integrates with Vectorize, offering advanced vector retrieval and text extraction capabilities.

Downloads: 0 This Week

Last Update: 2025-04-08
See Project
22

ALLWEONE

AI tool that generates custom presentations with real-time editing

...Presentation AI by ALLWEONE includes image generation, rich text editing, and drag-and-drop functionality for easy adjustments. It also supports presentation mode, so you can present directly within the app. Built with modern technologies like Next.js, React, and Tailwind CSS, it integrates AI services such as OpenAI for content generation. It is fully open source under the MIT licence, making it suitable for developers who want to customise or extend its capabilities.

Downloads: 5 This Week

Last Update: 4 days ago
See Project
23

Generative AI for Beginners (Version 3)

21 Lessons, Get Started Building with Generative AI

...Lessons are split into “Learn” modules for core concepts and “Build” modules with hands-on code in Python and TypeScript, so you can jump in at any point that matches your goals. The course covers everything from model selection, prompt engineering, and chat/text/image app patterns to secure development practices and UX for AI. It also walks through modern application techniques such as function calling, RAG with vector databases, working with open source models, agents, fine-tuning, and using SLMs. Each lesson includes a short video, a written guide, runnable samples for Azure OpenAI, the GitHub Marketplace Model Catalog, and the OpenAI API, plus a “Keep Learning” section for deeper study.

Downloads: 5 This Week

Last Update: 3 days ago
See Project
24

FastRTC

The python library for real-time communication

FastRTC is a Python library designed to simplify real-time communication (RTC), especially for audio and video streaming applications. It abstracts away much of the complexity that typically comes with implementing WebRTC by providing a simple interface — e.g. a Stream class — that can be mounted within a web backend (for example a FastAPI application). This makes it particularly well suited for building real-time voice (or video) interfaces for applications such as AI assistants, live chat,...

Downloads: 0 This Week

Last Update: 2025-11-28
See Project
25

Stable Diffusion web UI for AMDGPUs

Stable Diffusion WebUI optimized for AMD GPUs with editing tools

Stable Diffusion WebUI AMDGPU is a browser-based interface for generating images using Stable Diffusion, built with Gradio and adapted for AMD graphics hardware. It provides both text-to-image and image-to-image workflows, allowing users to create, refine, and upscale visuals within a single interface. It includes tools such as inpainting and outpainting for editing specific areas of an image, along with features like prompt matrix generation and attention controls to fine-tune outputs....

Downloads: 8 This Week

Last Update: 2026-03-19
See Project

Previous
You're on page 1
2
3
4
Next

Related Searches

automatic1111

sillytavern

portable stable diffusion

tesseract-ocr-w64-setup-v5.x.x.exe

all indian language ocr

fast food pos

easy diffusion

ai

android ai

android

Related Categories

Artificial Intelligence

Multimedia

Text Editors

Software Development

Internet

SourceForge

Create a Project
Open Source Software
Business Software
Top Downloaded Projects

Company

About
Team
SourceForge Headquarters
1320 Columbia Street Suite 310
San Diego, CA 92101
+1 (858) 422-6466

Resources

Support
Site Documentation
Site Status
SourceForge Reviews

© 2026 Slashdot Media. All Rights Reserved.

Terms Privacy Opt Out Advertise