Join/Login
Business Software
Open Source Software
For Vendors
Blog
About
More

For Vendors Help Create Join Login

Business Software

Open Source Software

SourceForge Podcast

Resources

Articles
Case Studies
Blog

Menu

Help
Create
Join
Login

Home
Open Source Software
Search Results

Search Results for "open source png text" - Page 13

x

Sort By:

Relevance

Clear All Filters

OS

Linux 1,411
Windows 1,341
Mac 1,122
More...
BSD 709
ChromeOS 561
Desktop Operating Systems 36
Mobile Operating Systems 28
Server Operating Systems 11
Embedded Operating Systems 1
Game Consoles 1

Category

Artificial Intelligence 636
Text Editors 280
Software Development 257
Multimedia 158
Business 127
Internet 113
System 100
Scientific/Engineering 98
Games 65
Education 62
Formats and Protocols 57
Communications 56
Desktop Environment 50
Security 36
Database 23
Terminals 22
Productivity 17
Printing 16
Social sciences 8
Religion and Philosophy 6
Blockchain 3
Mobile 2

License

OSI-Approved Open Source 1,577
Other License 8
Public Domain 5
Creative Commons Attribution License 4
More...
GNU Free Documentation License 1
Open Source Hardware 1

Translations

Programming Language

Status

Production/Stable 288
Beta 248
Alpha 139
Pre-Alpha 71
More...
Planning 40
Mature 28
Inactive 18

Showing 1604 open source projects for "open source png text"

View related business solutions

Python Clear Filters & Widen Search

MongoDB Atlas runs apps anywhere
Deploy in 115+ regions with the modern database for every enterprise.

MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.

Start Free
$300 Free Credits for Your Google Cloud Projects
Start building on Google Cloud with $300 in free credits. No commitment, no credit card required until you're ready to scale.

Launch your next project with $300 in free Google Cloud credits—no strings attached. Test, build, and deploy without risk. Use your credits across the entire Google Cloud platform to find what works best for your needs. After your credits are used, continue with always-free tier services. Only pay when you're ready to scale. Sign up in minutes and start exploring.

Start Free Trial
1

DeepSeek-OCR 2

Visual Causal Flow

DeepSeek-OCR-2 is the second-generation optical character recognition system developed to improve document understanding by introducing a “visual causal flow” mechanism, enabling the encoder to reorder visual tokens in a way that better reflects semantic structure rather than strict raster scan order. It is designed to handle complex layouts and noisy documents by giving the model causal reasoning capabilities that mimic human visual scanning behavior, enhancing OCR performance on documents...

Downloads: 6 This Week

Last Update: 2026-02-03
See Project
2

OuteTTS

Interface for OuteTTS models

OuteTTS is an interface library for running OuteTTS text-to-speech models across a range of backends, making it easier to deploy the same model on different hardware and runtimes. It provides a high-level Interface API that wraps model configuration, speaker handling, and audio generation so you can focus on integrating speech into your application rather than wiring up low-level engines. The project supports multiple backends including llama.cpp (Python bindings and server), Hugging Face...

Downloads: 0 This Week

Last Update: 2025-11-28
See Project
3

LangCheck

Simple, Pythonic building blocks to evaluate LLM applications

Simple, Pythonic building blocks to evaluate LLM applications.

Downloads: 0 This Week

Last Update: 2024-12-12
See Project
4

marqo

Tensor search for humans

A tensor-based search and analytics engine that seamlessly integrates with your applications, websites, and workflows. Marqo is a versatile and robust search and analytics engine that can be integrated into any website or application. Due to horizontal scalability, Marqo provides lightning-fast query times, even with millions of documents. Marqo helps you configure deep-learning models like CLIP to pull semantic meaning from images. It can seamlessly handle image-to-image, image-to-text and...

Downloads: 0 This Week

Last Update: 2026-04-02
See Project
Build Agents and Models on One Platform
Everything you need to build production-ready agents and models. Access 200+ Google and third-party AI models and tools.

Gemini Enterprise Agent Platform is Google Cloud's comprehensive platform for developers to build, scale, govern, and optimize agents and models. Choose from Google's most advanced models and third-party models like Anthropic's Claude Model Family.

Try It Free
5

BettaFish

Public opinion analysis system

BettaFish is an open-source, multi-agent public opinion analysis system built to automate the collection, deep analysis, and reporting of social media data at scale through conversational queries. It uses a modular architecture of specialized agents that collaborate to crawl mainstream platforms, extract multimodal content like text and short video, and synthesize insights through both statistical and large language model techniques.

Downloads: 0 This Week

Last Update: 2026-02-17
See Project
6

Qwen3-ASR

Qwen3-ASR is an open-source series of ASR models

Qwen3-ASR is an automatic speech recognition system in the QwenLM family, developed to convert spoken language into text with strong accuracy and real-time performance. As a specialized ASR variant of the broader Qwen language model ecosystem, it focuses on capturing reliable transcriptions from audio sources such as recordings, live streams, or conversational inputs while supporting low latency use cases. The architecture combines advanced neural acoustic modeling with context-aware...

Downloads: 0 This Week

Last Update: 2026-02-09
See Project
7

MiniMind-V

"Big Model" trains a visual multimodal VLM with 26M parameters

MiniMind-V is an experimental open-source project that aims to train a very small multimodal vision–language model (VLM) from scratch with extremely low compute and cost, making research and experimentation accessible to more people. The repository showcases training workflows and code designed to produce a 26-million parameter model—including both image and text capabilities—using minimal resources in very little time, reflecting a trend toward democratizing AI research. ...

Downloads: 0 This Week

Last Update: 2026-01-21
See Project
8

jrnl

Collect your thoughts and notes without leaving the command line

Collect your thoughts and notes without leaving the command line. jrnl has a natural-language interface so you don't have to remember cryptic shortcuts when you're writing down your thoughts. Your journals are stored in plain-text files that will still be readable in 50 years when all your fancy iPad apps will have gone the way of the Dodo. Encrypt your journals with industry-strength AES encryption. The NSA won't be able to read your dirty secrets. Sync your journals with Dropbox and...

Downloads: 0 This Week

Last Update: 2024-11-17
See Project
9

Bolna

Conversational voice AI agents

Bolna is an end-to-end open-source platform for building conversational voice AI agents, enabling developers to create voice-first conversational assistants efficiently.

Downloads: 1 This Week

Last Update: 3 hours ago
See Project
Earn up to 16% annual interest with Nexo.
More flexibility. More control.

Generate interest, access liquidity without selling, and execute trades seamlessly. All in one platform. Geographic restrictions, eligibility, and terms apply.

Get started with Nexo.
10

Google Antigravity SDK

Python library for building agents that leverages Google Antigravity

Google Antigravity SDK for Python is a Python library for building AI agents powered by Antigravity and Gemini. It provides a secure, scalable, and stateful infrastructure layer so developers can focus on agent behavior instead of manually implementing the full agent loop. The SDK includes a high-level Agent class for quick setup, as well as lower-level conversation and connection abstractions for more controlled workflows. It supports streaming responses, stateful sessions, custom Python...

Downloads: 11 This Week

Last Update: 3 days ago
See Project
11

cognee

Deterministic LLMs Outputs for AI Applications and AI Agents

Cognee implements scalable, modular data pipelines that allow for creating the LLM-enriched data layer using graph and vector stores. Cognee acts a semantic memory layer, unveiling hidden connections within your data and infusing it with your company's language and principles. This self-optimizing process ensures ultra-relevant, personalized, and contextually aware LLM retrievals. Any kind of data works; unstructured text or raw media files, PDFs, tables, presentations, JSON files, and so...

Downloads: 8 This Week

Last Update: 4 days ago
See Project
12

Databend

Cloud-native open source data warehouse for analytics and AI queries

Databend is an open source cloud-native data warehouse designed for large-scale analytics and modern data workloads. Built in Rust, the system focuses on high performance, scalability, and efficient data processing for analytical queries. It is designed with a separation of compute and storage, allowing compute nodes to scale independently while storing data in object storage systems.

Downloads: 0 This Week

Last Update: 2026-04-17
See Project
13

NVIDIA NeMo Framework

Scalable generative AI framework built for researchers and developers

NVIDIA NeMo is a scalable, cloud-native generative AI framework aimed at researchers and PyTorch developers working on large language models, multimodal models, and speech AI (ASR and TTS), with growing support for computer vision. It provides collections of domain-specific modules and reference implementations that make it easier to pre-train, fine-tune, and deploy very large models on multi-GPU and multi-node infrastructure. NeMo 2.0 introduces a Python-based configuration system,...

Downloads: 2 This Week

Last Update: 2026-04-22
See Project
14

iTerm2 Color Schemes

Over 425 terminal color schemes/themes for iTerm/iTerm2

This project curates a large collection of terminal color schemes and makes them available in formats for many terminal emulators, not just iTerm2. You’ll find well-known palettes like Solarized, Dracula, Nord, and hundreds more, each with previews that showcase how code, prompts, and text look under the theme. The repository includes export files for multiple terminals—such as iTerm2, Apple Terminal, Alacritty, Kitty, Windows Terminal, and others—so you can apply the same aesthetic...

Downloads: 14 This Week

Last Update: 2026-05-25
See Project
15

Phi-3-MLX

Phi-3.5 for Mac: Locally-run Vision and Language Models

Phi-3-Vision-MLX is an Apple MLX (machine learning on Apple silicon) implementation of Phi-3 Vision, a lightweight multi-modal model designed for vision and language tasks. It focuses on running vision-language AI efficiently on Apple hardware like M1 and M2 chips.

Downloads: 3 This Week

Last Update: 2025-03-13
See Project
16

refinery

Open-source choice to scale, assess and maintain natural language data

The data scientist's open-source choice to scale, assess and maintain natural language data. Treat training data like a software artifact. You are one of the people we've built refinery for. refinery helps you to build better NLP models in a data-centric approach. Semi-automate your labeling, find low-quality subsets in your training data, and monitor your data in one place. refinery doesn't get rid of manual labeling, but it makes sure that your valuable time is spent well. Also, the makers...

Downloads: 0 This Week

Last Update: 2024-06-13
See Project
17

Argilla

The open-source data curation platform for LLMs

Argilla is a production-ready framework for building and improving datasets for NLP projects. Deploy your own Argilla Server on Spaces with a few clicks. Use embeddings to find the most similar records with the UI. This feature uses vector search combined with traditional search (keyword and filter based). Argilla is free, open-source, and 100% compatible with major NLP libraries (Hugging Face transformers, spaCy, Stanford Stanza, Flair, etc.). In fact, you can use and combine your preferred...

Downloads: 0 This Week

Last Update: 2025-03-10
See Project
18

ChatterBot

Machine learning, conversational dialog engine for creating chat bots

ChatterBot is a Python library that makes it easy to generate automated responses to a user’s input. ChatterBot uses a selection of machine learning algorithms to produce different types of responses. This makes it easy for developers to create chat bots and automate conversations with users. For more details about the ideas and concepts behind ChatterBot see the process flow diagram. The language independent design of ChatterBot allows it to be trained to speak any language. Additionally,...

Downloads: 2 This Week

Last Update: 2026-03-24
See Project
19

GraphRAG

A modular graph-based Retrieval-Augmented Generation (RAG) system

The GraphRAG project is a data pipeline and transformation suite that is designed to extract meaningful, structured data from unstructured text using the power of LLMs.

Downloads: 0 This Week

Last Update: 6 days ago
See Project
20

MOSS-TTS-Nano

MOSS-TTS-Nano is an open-source multilingual tiny speech generation

MOSS-TTS-Nano is a lightweight text-to-speech model designed for real-time voice generation in resource-constrained environments. It is part of the broader MOSS-TTS family and focuses on delivering high-quality speech synthesis with a compact architecture. The model operates efficiently on CPU-only systems, enabling deployment without specialized hardware. It supports multilingual voice cloning and produces high-fidelity audio with low latency. The system uses an autoregressive audio...

Downloads: 5 This Week

Last Update: 1 day ago
See Project
21

NetworkX

Network analysis in Python

...Many standard graph algorithms. Network structure and analysis measures. Generators for classic graphs, random graphs, and synthetic networks. Nodes can be "anything" (e.g., text, images, XML records). Edges can hold arbitrary data (e.g., weights, time-series). Open source 3-clause BSD license. Well tested with over 90% code coverage. Additional benefits from Python include fast prototyping, easy to teach, and multi-platform. Find the shortest path between two nodes in an undirected graph. Python’s None object is not allowed to be used as a node. ...

Downloads: 5 This Week

Last Update: 2025-12-08
See Project
22

GLM-4.6V

GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning

GLM-4.6V represents the latest generation of the GLM-V family and marks a major step forward in multimodal AI by combining advanced vision-language understanding with native “tool-call” capabilities, long-context reasoning, and strong generalization across domains. Unlike many vision-language models that treat images and text separately or require intermediate conversions, GLM-4.6V allows inputs such as images, screenshots or document pages directly as part of its reasoning pipeline — and...

Downloads: 0 This Week

Last Update: 2026-05-16
See Project
23

Semantra

Multi-tool for semantic search

Semantra is an open-source semantic search tool designed to help users explore large collections of documents by meaning rather than simple keyword matching. The software analyzes text and PDF documents stored locally and creates embeddings that allow queries to retrieve results based on conceptual similarity. It is primarily intended for individuals who need to extract insights from large document collections, including researchers, journalists, students, and historians. ...

Downloads: 0 This Week

Last Update: 2026-03-11
See Project
24

Vidi2

Large Multimodal Models for Video Understanding and Editing

Vidi is a family of large multimodal models developed for deep video understanding and editing tasks, integrating vision, audio, and language to allow sophisticated querying and manipulation of video content. It’s designed to process long-form, real-world videos and answer complex queries such as “when in this clip does X happen?” or “where in the frame is object Y during that moment?” — offering temporal retrieval, spatio-temporal grounding (i.e. locating objects over time + space), and...

Downloads: 1 This Week

Last Update: 2026-03-04
See Project
25

TexText

Re-editable LaTeX/ typst graphics for Inkscape

Re-editable LaTeX and typst graphics for Inkscape. TexText is a Python extension for the vector graphics editor Inkscape providing the possibility to add and re-edit LaTeX and typst generated SVG elements to your drawing.

Downloads: 3 This Week

Last Update: 2026-01-06
See Project

Previous
9
10
11
12
You're on page 13
14
15
16
17
Next

Related Searches

ocr

windows journal

nvidia

phi

chatterbot

networkx

latex

tesseract-ocr-w64-setup.exe

tesseract-ocr-w64-setup-5.5.0.20241111.exe

scan

Related Categories

Artificial Intelligence

Text Editors

Software Development

Multimedia

Business

SourceForge

Create a Project
Open Source Software
Business Software
Top Downloaded Projects

Company

About
Team
SourceForge Headquarters
1320 Columbia Street Suite 310
San Diego, CA 92101
+1 (858) 422-6466

Resources

Support
Site Documentation
Site Status
SourceForge Reviews

© 2026 Slashdot Media. All Rights Reserved.

Terms Privacy Opt Out Advertise