Showing 166 open source projects for "artificial intelligence python"

View related business solutions
  • Gen AI apps are built with MongoDB Atlas Icon
    Gen AI apps are built with MongoDB Atlas

    The database for AI-powered applications.

    MongoDB Atlas is the developer-friendly database used to build, scale, and run gen AI and LLM-powered apps—without needing a separate vector database. Atlas offers built-in vector search, global availability across 115+ regions, and flexible document modeling. Start building AI apps faster, all in one place.
    Start Free
  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • 1
    VMZ (Video Model Zoo)

    VMZ (Video Model Zoo)

    VMZ: Model Zoo for Video Modeling

    The codebase was designed to help researchers and practitioners quickly reproduce FAIR’s results and leverage robust pre-trained backbones for downstream tasks. It also integrates Gradient Blending, an audio-visual modeling method that fuses modalities effectively (available in the Caffe2 implementation). Although VMZ is now archived and no longer actively maintained, it remains a valuable reference for understanding early large-scale video model training, transfer learning, and multimodal...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    ArrayFire

    ArrayFire

    ArrayFire, a general purpose GPU library

    ArrayFire is a general-purpose tensor library that simplifies the process of software development for the parallel architectures found in CPUs, GPUs, and other hardware acceleration devices. The library serves users in every technical computing market. Data structures in ArrayFire are smartly managed to avoid costly memory transfers and to take advantage of each performance feature provided by the underlying hardware. The community of ArrayFire developers invites you to build with us if...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 3
    OpenCV

    OpenCV

    Open Source Computer Vision Library

    The Open Source Computer Vision Library has >2500 algorithms, extensive documentation and sample code for real-time computer vision. It works on Windows, Linux, Mac OS X, Android, iOS in your browser through JavaScript. Languages: C++, Python, Julia, Javascript Homepage: https://opencv.org Q&A forum: https://forum.opencv.org/ Documentation: https://docs.opencv.org Source code: https://github.com/opencv Please pay special attention to our tutorials!...
    Leader badge
    Downloads: 3,461 This Week
    Last Update:
    See Project
  • 4
    Librum

    Librum

    The Librum client application

    A clean and powerful way to read and manage your e-books on any device. Create your own personalized online library that you can access from any device, anytime, anywhere. Librum automatically saves everything you need to the cloud. Focus on what actually matters, using a simple and straight forward interface. Librum works out of the box, everywhere and at any time. Organize your library the way you want it to be using Librum's organization features. Add your books to collections, tag them,...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Keep company data safe with Chrome Enterprise Icon
    Keep company data safe with Chrome Enterprise

    Protect your business with AI policies and data loss prevention in the browser

    Make AI work your way with Chrome Enterprise. Block unapproved sites and set custom data controls that align with your company's policies.
    Download Chrome
  • 5
    ViewBots-V2

    ViewBots-V2

    Free Streaming Bot: Compatible with Twitch, YouTube and Facebook

    "Maximize Your Stream's Impact on Twitch, Facebook Live, and YouTube with Our Advanced Free Viewer Bot" Elevate your streaming game on key platforms like Twitch, Facebook Live, and YouTube. Our cutting-edge viewer bot is expertly designed to boost your channel's visibility and engagement, making your content more accessible to a broader audience. Streamline your growth and increase your impact with ease.
    Leader badge
    Downloads: 1,929 This Week
    Last Update:
    See Project
  • 6
    eGuideDog free software for the blind
    eGuideDog project develops free software for the blind. Currently, we focus on WebSpeech, Ekho TTS and WebAnywhere.
    Leader badge
    Downloads: 250 This Week
    Last Update:
    See Project
  • 7
    Source code from the Research Institute for Signals, Systems and Computational Intelligence http://fich.unl.edu.ar/sinc
    Leader badge
    Downloads: 14 This Week
    Last Update:
    See Project
  • 8

    Russian Text-to-speech programs

    читание, чтение, говорение

    For Windows (on Linux trought Wine can work) 3 russian text-to-speech programs (Chitanie, Chtenie and Govorenie). If you want donate. paypal.me/alkbab Читание, Чтение, Говорение есть программы пробующие преобразовать русский текст в русскую речь . Для Windows. На Linux через Wine... Кто хочет может пожертвовать paypal.me/alkbab
    Leader badge
    Downloads: 1 This Week
    Last Update:
    See Project
  • 9
    SMILI

    SMILI

    Scientific Visualisation Made Easy

    The Simple Medical Imaging Library Interface (SMILI), pronounced 'smilie', is an open-source, light-weight and easy-to-use medical imaging viewer and library for all major operating systems. The main sMILX application features for viewing n-D images, vector images, DICOMs, anonymizing, shape analysis and models/surfaces with easy drag and drop functions. It also features a number of standard processing algorithms for smoothing, thresholding, masking etc. images and models, both with...
    Leader badge
    Downloads: 7 This Week
    Last Update:
    See Project
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Learn More
  • 10
    RemoteTTS

    RemoteTTS

    Tool to remotely activate Text-To-Speech (TTS) on a server

    The tool provides a simple TCP/UDP interface to let a remote machine perform TTS outputs.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    Dead Deer 3.14.56.2025

    Dead Deer 3.14.56.2025

    3D modeler, 3D game maker, 3D demo maker

    3D modeler, 3D game maker, 3D demo maker. to model and create games, demos. Scripting language allows you to code interactions in pseudo-C with the animation and synthesize your own rendering with own-made shaders. Import FBX, BLEND, GLTF, OBJ, 3DS, DAE, X, XML, STL, PCB, ASC, PLY, GSPLATS. Cross-platform project WINDOWS 32/64 /MACOSX 10.6/ 10.8+/APPLSilicon /LINUX/iOS/ANDROID/WINDOWS PHONE/GOOGLE VR/OPEN VR/OCULUS VR/WEBASM/UWP8/10/OPENXR, PIs (ARM32/64), RISCV Players and...
    Leader badge
    Downloads: 17 This Week
    Last Update:
    See Project
  • 12
    OCR Manga Reader for Android

    OCR Manga Reader for Android

    Android Manga reader with Japanese OCR and dictionary capabilities

    OCR Manga Reader is a free and open source Android app that allows you to quickly OCR and lookup Japanese words in real-time. It does not have ads or telemetry/spyware and does not require an Internet connection. Supports both EDICT and EPWING dictionaries. Requires Android 4.0 (Ice Cream Sandwich) or higher. See http://ocrmangareaderforandroid.sourceforge.net/ for details.
    Leader badge
    Downloads: 49 This Week
    Last Update:
    See Project
  • 13

    EngraverAI

    This first-world Photoshop's plug-in features artificial intelligence

    The AlphaPlugins EngraverAI is a newly released Photoshop plug-in that utilizes artificial intelligence to make routine technical work easy for you. This product replicates pictures resulting in high-quality products comparable to handmade engraving art. The usage of artificial intelligence makes the creative process easier and more time efficient while maintaining overall quality. The Engraver AI is the next level for the AlphaPlugins series of tools for engraving and etching. ...
    Downloads: 17 This Week
    Last Update:
    See Project
  • 14
    My MP3 Player

    My MP3 Player

    Put your music in it then play it forever. Free MP3 player for Windows

    Put your .mp3 files in a folder then click +Add Music button then click play button and listen it forever. (You have to put .mp3 files in a folder then choose the folder unless the application won't work) This application is free forever no ad. Made by HorizonPublish ⚠️Windows will show unknown application error, click "More info" then "Run anyway".
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    VideoSubFinder
    The main purpose of this program is to provide functionality for extract hardcoded subtitles (hardsub) from video. It provides two main features: 1) Autodetection of frames with hardcoded text (hardsub) on video with saving info about timing positions. 2) Generation of cleared from background text images, which allows with usage of OCR programs (like FineReader, Subtitle Edit, Google Drive) to generate complete subtitles with original text and timing. For working of this program on...
    Leader badge
    Downloads: 348 This Week
    Last Update:
    See Project
  • 16
    Coqui STT

    Coqui STT

    The deep learning toolkit for speech-to-text

    Coqui STT is a fast, open-source, multi-platform, deep-learning toolkit for training and deploying speech-to-text models. Coqui STT is battle-tested in both production and research. Multiple possible transcripts, each with an associated confidence score. Experience the immediacy of script-to-performance. With Coqui text-to-speech, production times go from months to minutes. With Coqui, the post is a pleasure. Effortlessly clone the voices of your talent and have the clone handle the problems...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 17
    Capture2Text

    Capture2Text

    Quickly OCR part of the screen and save resulting text to clipboard

    Capture2Text enables users to quickly OCR a portion of the screen using a keyboard shortcut. The resulting text will be saved to the clipboard by default. Supports 90+ languages including Chinese, English, French, German, Japanese, Korean, Russian, and Spanish. Portable and does not require installation. See http://capture2text.sourceforge.net for details.
    Leader badge
    Downloads: 3,316 This Week
    Last Update:
    See Project
  • 18

    AhoTTS - TTS for Basque and Spanish

    Text-to-Speech for Basque and Spanish

    Text-to-Speech conversor for Basque and Spanish. It includes linguistic processing and built voices for the languages aforementioned. Its acoustic engine is based on hts_engine and it uses a high quality vocoder called AhoCoder. Developed by Aholab Signal Processing Laboratory: https://aholab.ehu.es/aholab/ http://aholab.ehu.es/ahocoder/
    Downloads: 1 This Week
    Last Update:
    See Project
  • 19
    Physics Simulation Software based on user sketchs running a pattern recognition agent, this app is able to animate a physics sketch, from a blackboard
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    Screen Translator

    Screen Translator

    Screen capture, OCR and translation tool

    This software allows you to translate any text on screen. Basically it is a combination of screen capture, OCR and translation tools. More info and the latest release on the homepage (https://github.com/OneMoreGres/ScreenTranslator)
    Leader badge
    Downloads: 1,402 This Week
    Last Update:
    See Project
  • 21
    gImageReader

    gImageReader

    A graphical frontend to tesseract-ocr

    gImageReader is a simple Gtk/Qt front-end to tesseract. Features include: - Import PDF documents and images from disk, scanning devices, clipboard and screenshots - Process multiple images and documents in one go - Manual or automatic recognition area definition - Recognize to plain text or to hOCR documents - Recognized text displayed directly next to the image - Post-process the recognized text, including spellchecking - Generate PDF documents from hOCR documents **Note**:...
    Leader badge
    Downloads: 328 This Week
    Last Update:
    See Project
  • 22
    DeepSpeech

    DeepSpeech

    Open source embedded speech-to-text engine

    DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers. DeepSpeech is an open-source Speech-To-Text engine, using a model trained by machine learning techniques based on Baidu's Deep Speech research paper. Project DeepSpeech uses Google's TensorFlow to make the implementation easier. A pre-trained English model is available for use and can be downloaded following the...
    Downloads: 23 This Week
    Last Update:
    See Project
  • 23

    cuneiformplus

    Fork of OCR software cuneiform

    Fork of OCR software cuneiform Original software see: https://launchpad.net/cuneiform-linux by Cognitive Technologies and Jussi Pakkanen Other Open Source OCR stuff see * Tesseract by Ray Smith (using the Leptonica image library) * GOCR * OCRAD
    Downloads: 4 This Week
    Last Update:
    See Project
  • 24
    LTI-Lib is an object oriented computer vision library written in C++ for Windows/MS-VC++ and Linux/gcc. It provides lots of functionality to solve mathematical problems, many image processing algorithms, some classification tools and much more...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25

    FastoCloud PRO

    IPTV/NVR/CCTV/Video cloud https://fastocloud.com

    IPTV/Video cloud Features: Cross-platform (Linux, MacOSX, FreeBSD, Raspbian/Armbian) GPU/CPU Encode/Decode/Post Processing Stream statistics CCTV Adaptive hls streams Load balancing Temporary urls HLS push EPG scanning Subtitles to text conversions AD insertion Logo overlay Video effects Relays Timeshifts Catchups Playlists Restream/Transcode from online streaming services like Youtube, Twitch ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • 3
  • 4
  • 5
  • Next