MongoDB Atlas is the developer-friendly database used to build, scale, and run gen AI and LLM-powered apps—without needing a separate vector database. Atlas offers built-in vector search, global availability across 115+ regions, and flexible document modeling. Start building AI apps faster, all in one place.
Start Free
No-Nonsense Code-to-Cloud Security for Devs | Aikido
Connect your GitHub, GitLab, Bitbucket, or Azure DevOps account to start scanning your repos for free.
Aikido provides a unified security platform for developers, combining 12 powerful scans like SAST, DAST, and CSPM. AI-driven AutoFix and AutoTriage streamline vulnerability management, while runtime protection blocks attacks.
TF-IDF.jar is a Java Archive file to measure TF-IDF of each document
in a document collection (corpus).
The jar can be used to
(a) get all the terms in the corpus
(b) get the document frequency (DF) and inverse document frequency (IDF) of
all the terms in the corpus
(c) get the TF-IDF of each document in the corpus
(d) get each term with their frequency (no. of presence), term frequency (TF) and TF-IDF in every document
CoGrOO (A LibreOffice & OpenOffice.org Grammar Checker), the only open source Portuguese grammar checker that can be used with LibreOffice & OpenOffice.org.
dbacl is a general purpose digramic Bayesian text classifier. It can learn text documents you provide, and then compare new input with the learned categories. It can be used for spam filtering, or within your own shell scripts. Sometimes it plays che
Secure remote access solution to your private network, in the cloud or on-prem.
Deliver secure remote access with OpenVPN.
OpenVPN is here to bring simple, flexible, and cost-effective secure remote access to companies of all sizes, regardless of where their resources are located.
OpenPoplog is a multi-language, incrementally-compiled virtual machine-based development environment with its own extensive teaching and support material, supporting the languages Pop-11, Prolog, Common Lisp and ML on Unix, VMS, Linux and Windows.
The Information Extraction Plugin allows the use of information extraction techniques within RapidMiner.
It can be seen as an interface between natural language and IE- or datamining-methods, by extracting interesting information out of documents.
TextMarker is now developed and hosted at Apache UIMA (http://uima.apache.org/textmarker.html). TextMarker is a UIMA-based tool for information extraction and more. The full featured editor of the rule language and the build process of UIMA descriptors are complemented with components for visualization, explanation, testing and rule learning.
Spirit, a different way to write! You guide your mouse through streams of predicted words. As the intelligent agent learns your writing style, its predictions become more accurate. You can prime Spirit with the prose and poetry of any author!
Lay a foundation for success with Tested Reference Architectures developed by Fortinet’s experts. Learn more in this white paper.
Moving to the cloud brings new challenges. How can you manage a larger attack surface while ensuring great network performance? Turn to Fortinet’s Tested Reference Architectures, blueprints for designing and securing cloud environments built by cybersecurity experts. Learn more and explore use cases in this white paper.
UCR is a project name for the development of an handwritten characters in Korean language. The goal is to create a UCR Library for handwriting as well as OCR from off-line, on-line data. And we have a plan to build a UCR library for mobile.
SemNotes is a semantic note taking tool for KDE4, built on top of Nepomuk-KDE. The tool is still under development, but it is already usable, provided that KDE4 is installed and the Nepomuk running.
MediaGlyphs: an international language based on multimedia ideograms. It allows to read, think or type in your own language sentences written with the shared mediaglyphs: a common writing system for the world, Simple Unambiguous Neutral & Universal.
OCR c++ library. Include: contour recognition; vectorisation; matrix letter feature recognition; auto page segmentation and detect rotation; SS3 ASM core; XML base; web-based GUI; 99,6% printed Unicode text recognition; letter base up to 1200 letters.
Concrete Voice is a text to speech program. It can read the time, anounce weather, read text file, save text files to audio files, open any text file (supports all text encoding formats) and many more advance stuff!
"Java ArtificialIntelligence Markup Language PAD" is a tool that manages ProgramD AI (on local or remote machines) and AIML files with real-time previews and it provides a network support to test AI capabilities over many network protocols.
Auvai is a Java API and Java Swing based application for Text to Speech conversion of Unicode Tamil. Future direction of this API and application is to support Text to Speech conversion for all "Indic" languages.
PDF Annot is a piece of software that enables you to add audio and text annotation to a PDF. It uses JPedal SimpleViewer and iText library. Annotations are supported by Adobe'sofficial PDF Reader. Report any bug here: krakosia[at]gmail.com
Akshara Malayalam OCR is a project for the development of an OCR for printed and handwritten documents in Malayalam language. The inspiration is from similar OCR softwares in other languages etc.
Got any emails with obnoxious inline text? Long text stories with bad formatting? Files that an OCR didn't quite translate right? RTF format files and no easy way to read or modify them? Then eBookFormatter is for you!
gTamillOCR- Free, Cross-platform Optical Character Recognition for Printed Tamil text released under GNU GPL .This Free Software uses IPL98 and wxWidgets Cross platform GUI Libraries..
When translating becomes a game ! Text to translate can be graphically selected. Several dictionnaries can be sorted according to the context. A large choice of matching strategies is available. The OCR engine is tunable.