Transform your applications and workflows into powerful agentic systems at global scale.
Gemini Enterprise Agent Platform lets you rapidly build, scale, govern and optimize production-ready agents grounded in your organization's data. The platform enables developers to build custom or pre-built agents for virtually any use case. New customers get $300 in free credits.
Get Started Free
Enterprise-grade ITSM, for every business
Give your IT, operations, and business teams the ability to deliver exceptional services—without the complexity.
Freshservice is an intuitive, AI-powered platform that helps IT, operations, and business teams deliver exceptional service without the usual complexity. Automate repetitive tasks, resolve issues faster, and provide seamless support across the organization. From managing incidents and assets to driving smarter decisions, Freshservice makes it easy to stay efficient and scale with confidence.
Fast and accurate automatic speech recognition (ASR) for edge devices
moonshine is an open-source automatic speech recognition toolkit optimized for fast and accurate transcription on edge devices and local environments. The project is designed to enable real-time voice applications such as live transcription, voice commands, and embedded speech interfaces without requiring heavy cloud infrastructure. Its architecture emphasizes low latency and flexible input handling, allowing audio streams of varying durations rather than relying on fixed processing windows. ...
MozJPEG improves JPEG compression efficiency achieving higher visual quality and smaller file sizes at the same time. It is compatible with the JPEG standard, and the vast majority of the world's deployed JPEG decoders. MozJPEG is compatible with the libjpeg API and ABI. It is intended to be a drop-in replacement for libjpeg. MozJPEG is a strict superset of libjpeg-turbo's functionality. All MozJPEG's improvements can be disabled at run time, and in that case it behaves exactly like...
Fork of OCR software cuneiform
Original software see:
https://launchpad.net/cuneiform-linux
by Cognitive Technologies and Jussi Pakkanen
Other Open Source OCR stuff see
* Tesseract by Ray Smith (using the Leptonica image library)
* GOCR
* OCRAD
Deploy in 115+ regions with the modern database for every enterprise.
MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
OpenPR stands for Open Pattern Recognition project and is intended to be an open source library for algorithms of image processing, computer vision, natural language processing, pattern recognition, machine learning and the related fields.
Animal is AN IMAging Library written in C. Its simple API supports over 80 image formats, and is intended to make massive use of other image processing libraries. Animal aims at image analysis and recognition. It is mainly the C basis of the SIP toolbox.
Simplified interface to Leap Motion designed for art and music apps
LeapInto provides a simplified interface to the Leap Motion hand sensor input device. Multiple hand recognition is simplified to several stable categories and coordinates are normalised. The interface comes two flavours at present, an open broadcast system using the OSC protocol and a plugin for the Csound audio/music programming language.
Auth0 Token Vault handles secure token storage, exchange, and refresh for external providers so you don't have to build it yourself.
Rolling your own OAuth token storage can be a security liability. Token Vault securely stores access and refresh tokens from federated providers and handles exchange and renewal automatically. Connected accounts, refresh exchange, and privileged worker flows included.
Real time face tracking and recognition refers to the task of locating human faces in a video stream and identifying the faces by matching them against the database of known faces.
This system provides printed and cursive handwriting recognition service as a shared library on GNU/Linux/Unix systems.
Until someone writes another client, try the S/HIP:
https://sourceforge.net/projects/ship-project/
Arabisc is speaker independent large vocabulary continuous speech recognizer for Arabic language released under GNU license.It is also a collection of open source tools that allows researchers and developers to build speech recognition systems for Arab
1.) Investigation with cosine transform, and anti transform algorithm, with some voice recognition code. 2.) Translator: Croatian, English. 3.) 2D to 3D picture algorithm (principle) and new 2Dto3D video conversion code with AviSynth video scripting
Obsolete for most. Use the Ink2Text project instead - no Windows needed! A Linux/Unix library providing full cursive & printed handwriting recognition by connecting to a Windows XP Tablet back end. Need Service & client files.
Scalable Language API (SLAPI) The most comprehensive architecture for conversational natural-language applications including speech recognition/synthesis, semantics, & machine translation. Used on Android & other mobile app platforms.
The Common OCR Service Interface. COSI is an API that allows developpers to easily bring OCR (Optical Character Recognition) capabilities to image processing applications. COSI supports existing OCR tools such as Tesseract, GOCR or GNU Ocrad.
VoxForge collects user-submitted speech audio files for the creation of Acoustic Models for Free and Open Source Speech Recognition Engines such as HTK, Julius, ISIP and Sphinx.
A project being an introduction to fuzzy pattern recognition. This is only fuzzy image segmentation, together with an image processing pipeline to operate on colour images.