Join/Login
Business Software
Open Source Software
For Vendors
Blog
About
More

For Vendors Help Create Join Login

Business Software

Open Source Software

SourceForge Podcast

Resources

Articles
Case Studies
Blog

Menu

Help
Create
Join
Login

Home
Open Source Software
Scientific/Engineering
Information Analysis Software
Search Results

Search Results for "open document"

x

Sort By:

Relevance

Clear All Filters

OS

Linux 25
Windows 24
Mac 22
More...
BSD 18
ChromeOS 15
Desktop Operating Systems 2
Mobile Operating Systems 1

Category

Scientific/Engineering 25
- Information Analysis 25
- Linguistics 2
Artificial Intelligence 10
Business 6
Software Development 5
Internet 4
Multimedia 4
System 3
Formats and Protocols 2
Text Editors 2
Database 1
Education 1

License

OSI-Approved Open Source 23
Creative Commons Attribution License 2

Translations

English 10
French 2
Indonesian 2
German 1
More...
Japanese 1
Russian 1

Programming Language

Java 14
C++ 4
PHP 4
Python 4
More...
C 3
Prolog 2
Perl 1
PL/SQL 1
S/R 1
XSL (XSLT/XPath/XSL-FO) 1

Status

Production/Stable 10
Beta 8
Alpha 3
Planning 2

Showing 25 open source projects for "open document"

View related business solutions

Information Analysis Linux Clear Filters & Widen Search

Train ML Models With SQL You Already Know
BigQuery automates data prep, analysis, and predictions with built-in AI assistance.

Build and deploy ML models using familiar SQL. Automate data prep with built-in Gemini. Query 1 TB and store 10 GB free monthly.

Try Free
Go from Code to Production URL in Seconds
Cloud Run deploys apps in any language instantly. Scales to zero. Pay only when code runs.

Skip the Kubernetes configs. Cloud Run handles HTTPS, scaling, and infrastructure automatically. Two million requests free per month.

Try it free
1

elasticsearc-php

PHP low-level client for Elasticsearch

Introducing Elasticsearch DSL library to provide objective query builder for Elasticsearch bundle and elasticsearch-php client. You can easily build any Elasticsearch query and transform it to an array. This agnostic package is a lightweight wrapper on top of the Elasticsearch PHP client. Its main goal is to allow for easier structuring of queries and indices in your application. It does not want to hide or replace the functionality of the Elasticsearch PHP client. Feature complete, object...

Downloads: 0 This Week

Last Update: 5 days ago
See Project
2

DynaQ

Innovative text document search. http://dynaq.opendfki.de for details.

The goal of DynaQ is to develop an inquiry system to explore the personal information space, supporting you with the searching paradigm 'orienteering'. DynaQ is a (desktop)search engine with enhanced functionality for file, email and blog search. Look at our GitLab homepage for sourcecode and documentation: http://dynaq.opendfki.de

Downloads: 0 This Week

Last Update: 2021-08-05
See Project
3

AI learning

AiLearning, data analysis plus machine learning practice

We actively respond to the Research Open Source Initiative (DOCX) . Open source today is not just open source, but datasets, models, tutorials, and experimental records. We are also exploring other categories of open source solutions and protocols. I hope you will understand this initiative, combine this initiative with your own interests, and do what you can. Everyone's tiny contributions, together, are the entire open source ecosystem.

Downloads: 0 This Week

Last Update: 2022-02-18
See Project
4

Gamera

Gamera is a framework for the creation of structured document analysis applications by domain experts. It combines a programming library with GUI tools for the training and interactive development of recognition systems.

Downloads: 0 This Week

Last Update: 2016-05-11
See Project
Forever Free Full-Stack Observability | Grafana Cloud
Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.

Create free account
5

libcrn

libcrn is document image processing library written in C++11 for Linux, Windows, Mac OsX and Google Android. It is a toolbox that allows to create easily software such as OCRs and layout analysis tools.

Downloads: 0 This Week

Last Update: 2016-10-23
See Project
6

jLDADMM

A Java package for the LDA and DMM topic models

The Java package jLDADMM is released to provide alternative choices for topic modeling on normal or short texts. It provides implementations of the Latent Dirichlet Allocation topic model and the one-topic-per-document Dirichlet Multinomial Mixture model (i.e. mixture of unigrams), using collapsed Gibbs sampling. In addition, jLDADMM supplies a document clustering evaluation to compare topic models. See the usage of jLDADMM in its website at http://jldadmm.sourceforge.net/

1 Review

Downloads: 0 This Week

Last Update: 2016-03-13
See Project
7

SCAN

SCAN (Smart Content Aggregation and Navigation) is a universal semantic content aggregator. It combines search, text analysis, tagging and metadata functions to provide new user experience of desktop navigation and document management.

3 Reviews

Downloads: 3 This Week

Last Update: 2014-06-19
See Project
8

Texalyzer

Text analyzer

Analyzes text document using TF-IDF and optionally stopword list, and extracts important keywords.

Downloads: 0 This Week

Last Update: 2017-04-04
See Project
9

MyNook

A machine learning system for supervised document classification

An open source system for supervised document classification based on statistical machine learning techniques. On the contrary of the state of art classification techniques, MyNook just requires the title of the document, not the content itself.

Downloads: 0 This Week

Last Update: 2016-10-31
See Project
Add Two Lines of Code. Get Full APM.
AppSignal installs in minutes and auto-configures dashboards, alerts, and error tracking.

Works out of the box for Rails, Django, Express, Phoenix, and more. Monitoring exceptions and performance in no time.

Start Free
10

XmlView

GUI utility in pure Java for viewing and editing XML content; example of application built with Superficial http://superficial.sourceforge.net

Downloads: 0 This Week

Last Update: 2012-05-22
See Project
11

SIDoBI

SIDoBI is an automatic summarization system for documents in Indonesian language. It is an acronym for Sistem Ikhtisar Dokumen untuk Bahasa Indonesia. SIDoBI is built based on MEAD, a public domain portable multi-document summarization system.

Downloads: 0 This Week

Last Update: 2019-02-14
See Project
12

OpenSHORE

OpenSHORE is an XML based Semantic Document Repository (SDR) with a free definable meta model that builds up a semantic network from sections and relations in documents. The acronym SHORE means Semantic Hypertext Object Repository.

Downloads: 0 This Week

Last Update: 2013-04-15
See Project
13

Maui Topic Indexer

Maui is a multi-purpose automatic topic indexing algorithm. Given a document, Maui automatically identifies its topics. Depending on the task topics are tags, keywords, keyphrases, vocabulary terms, descriptors or Wikipedia titles.

Downloads: 0 This Week

Last Update: 2014-04-25
See Project
14

RDF Document Manager

RDF-DocMan is a document manager based on a Sesame (RDF repository) backend. Documents are stored in the filesystem and their metadata in a Sesame repository. It was developed for porQual web content generator (also in sf.net).

Downloads: 0 This Week

Last Update: 2013-04-23
See Project
15

Trainable Relation Extraction framework

T-Rex (Trainable Relation Extraction) is a highly configurable machine learning-based Information Extraction from Text framework, which includes tools for document classification, entity extraction and relation extraction.

Downloads: 0 This Week

Last Update: 2013-05-02
See Project
16

iDocs

iDocs is a intellectual document work flow with text mining options project.

Downloads: 0 This Week

Last Update: 2014-04-08
See Project
17

Irudiko

Irudiko is a library written in C++ for generating Locality Sensitive Hashing sketches from any textual and web document. Mainly designed to work with HTML pages, it has also an optimization support for English or Italian documents.

Downloads: 0 This Week

Last Update: 2013-03-13
See Project
18

Graphist

Graphist uses PHP's GD library to produce data plots, in real time, served up as standard images for consumption by web pages (though such images could be saved for use in other document types).

Downloads: 0 This Week

Last Update: 2013-03-26
See Project
19

Flesh

Flesh is a Java application designed to analyze a document (plain text, rich text, Word documents, and PDFs) and display the difficulty associated with comprehending using the Flesch-Kincaid Grade Level and the Flesch Reading Ease Score.

2 Reviews

Downloads: 13 This Week

Last Update: 2013-04-03
See Project
20

Qualiweb

Qualiweb aims at providing semantic web metrics for modeling a website visitors needs according to a given taxonomy or document classification. Web metrics provided by Qualiweb give an indication of how successful each of the website topics have been.

Downloads: 0 This Week

Last Update: 2013-03-19
See Project
21

Kriterion

Kriterion is a document retrieval and categorization engine capable of full text searching. There is no need for keyword or context-based information.

Downloads: 0 This Week

Last Update: 2013-04-19
See Project
22

Phoenix Information Extraction

Phoenix is an information extraction engine written in java. Controlled by rules (declared in xml), it extracts information form any XML document (unstructured XHTML/OpenOffice documents). Supports XPath, additional conditions and top-down decomposit

Downloads: 0 This Week

Last Update: 2013-03-14
See Project
23

SimpleRDF/XSL+PHP5

SimpleRDF/XSL template simplifies RDF/XML sources as much as possible to allow easy processing. SimpleRDF/PHP5 parser takes advantage of SimpleRDF/XSL. It has extremly simple API. You can parse any RDF/XML compatible document (incl. RSS) and much more...

Downloads: 1 This Week

Last Update: 2013-03-19
See Project
24

Judge

JUDGE (Java Utility for Document Genre Eduction) features automatic classification and clustering of documents, optionally as a webservice. The program is written entirely in Java and makes use of the Weka machine learning toolkit.

Downloads: 0 This Week

Last Update: 2015-12-01
See Project
25

db-docit

This browser-based tool is a flexible solution for documenting both logical and physical database schema designs. It supports simple version tracking concepts to document schema changes in varying stages of planning and implementation.

Downloads: 0 This Week

Last Update: 2016-09-22
See Project

Previous
You're on page 1
Next

Related Searches

task management php

document classification

ai

gamera

linux kodachi 32

latent dirichlet allocation

document management

sidobi

xml merge

rdf

Related Categories

Scientific/Engineering

Artificial Intelligence

Business

Software Development

Internet

SourceForge

Create a Project
Open Source Software
Business Software
Top Downloaded Projects

Company

About
Team
SourceForge Headquarters
1320 Columbia Street Suite 310
San Diego, CA 92101
+1 (858) 422-6466

Resources

Support
Site Documentation
Site Status
SourceForge Reviews

© 2026 Slashdot Media. All Rights Reserved.

Terms Privacy Opt Out Advertise