Showing 554 open source projects for "language processing"

View related business solutions
  • $300 Free Credits for Your Google Cloud Projects Icon
    $300 Free Credits for Your Google Cloud Projects

    Start building on Google Cloud with $300 in free credits. No commitment, no credit card required until you're ready to scale.

    Launch your next project with $300 in free Google Cloud credits—no strings attached. Test, build, and deploy without risk. Use your credits across the entire Google Cloud platform to find what works best for your needs. After your credits are used, continue with always-free tier services. Only pay when you're ready to scale. Sign up in minutes and start exploring.
    Start Free Trial
  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • 1
    rust-bert

    rust-bert

    Rust native ready-to-use NLP pipelines and transformer-based models

    rust-bert is a Rust-based implementation of transformer-based natural language processing models that provides ready-to-use pipelines for tasks such as text classification, summarization, and question answering. The project ports many capabilities of the Hugging Face Transformers ecosystem into the Rust programming language. It allows developers to run state-of-the-art NLP models like BERT, GPT-2, and DistilBERT directly within Rust applications while maintaining high performance and memory efficiency. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    AiLearning-Theory-Applying

    AiLearning-Theory-Applying

    Quickly get started with AI theory and practical applications

    ...Advanced sections explore modern AI topics including transformers, BERT-based natural language processing systems, and practical competition-style machine learning workflows.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    VibeVoice

    VibeVoice

    Open-source multi-speaker long-form text-to-speech model

    ...A key innovation is its use of continuous acoustic and semantic speech tokenizers operating at an ultra-low frame rate of 7.5 Hz, enabling high audio fidelity with efficient processing of long sequences. The model integrates a Qwen2.5-based large language model with a diffusion head to produce realistic acoustic details and capture conversational context. Training involved curriculum learning with increasing sequence lengths up to 65K tokens, allowing VibeVoice to handle very long dialogues effectively. Safety mechanisms include an audible disclaimer and imperceptible watermarking in all generated audio to mitigate misuse risks.
    Downloads: 15 This Week
    Last Update:
    See Project
  • 4
    Mem0

    Mem0

    The Memory layer for AI Agents

    Mem0 is a self-improving memory layer designed for Large Language Model (LLM) applications, enabling personalized AI experiences that save costs and delight users. It remembers user preferences, adapts to individual needs, and continuously improves over time. Key features include enhancing future conversations by building smarter AI that learns from every interaction, reducing LLM costs by up to 80% through intelligent data filtering, delivering more accurate and personalized AI outputs by...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Enterprise-grade ITSM, for every business Icon
    Enterprise-grade ITSM, for every business

    Give your IT, operations, and business teams the ability to deliver exceptional services—without the complexity.

    Freshservice is an intuitive, AI-powered platform that helps IT, operations, and business teams deliver exceptional service without the usual complexity. Automate repetitive tasks, resolve issues faster, and provide seamless support across the organization. From managing incidents and assets to driving smarter decisions, Freshservice makes it easy to stay efficient and scale with confidence.
    Try it Free
  • 5
    React Native ExecuTorch

    React Native ExecuTorch

    Declarative way to run AI models in React Native on device

    React Native ExecuTorch is a library for running AI models directly on mobile devices from React Native. It is powered by ExecuTorch and provides a declarative approach to on-device model execution. The project supports a range of AI use cases, including large language models, computer vision, OCR, object detection, speech processing, segmentation, and embeddings. It helps React Native developers use local AI capabilities without needing deep native programming or machine learning infrastructure expertise. The library is especially relevant for privacy-first apps, offline experiences, and mobile products that need low-latency inference. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 6
    Mods

    Mods

    AI on the command line

    ...Mods fits naturally into Unix-style workflows because it can read from standard input and produce output that other commands can continue processing. Its main value is bringing practical AI assistance into everyday terminal automation.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 7
    StoryGen Atelier

    StoryGen Atelier

    AI-assisted storyboard and video generation tool

    StoryGen Atelier is an advanced creative tool that blends AI with visual storytelling, making it possible to generate fully structured storyboards and stitched videos from text prompts without requiring manual art or animation skills. Users begin with natural language descriptions of their story or scene, and the system uses state-of-the-art large models to generate both the script and corresponding frames. Once individual frames are created, a second AI model generates transition clips that smoothly link the frames into a coherent short video sequence, and the tool then assembles everything into a finished video using standard video processing tools. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 8
    Swirl

    Swirl

    Swirl queries any number of data sources with APIs

    Swirl queries any number of data sources with APIs and uses spaCy and NLTK to re-rank the unified results without extracting and indexing anything! Includes zero-code configs for Apache Solr, ChatGPT, Elastic Search, OpenSearch, PostgreSQL, Google BigQuery, RequestsGet, Google PSE, NLResearch.com, Miro & more! SWIRL adapts and distributes queries to anything with a search API - search engines, databases, noSQL engines, cloud/SaaS services etc - and uses AI (Large Language Models) to re-rank...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    SALMONN family

    SALMONN family

    A suite of advanced multi-modal LLMs

    SALMONN is a family of advanced multi-modal large language models (LLMs) developed by ByteDance — designed to handle and integrate multiple data modalities (e.g. text, audio, video) rather than just plain text. The repository bundles different branches targeting specialized tasks (e.g. video-SALMONN, speech-quality assessment, general multimodal tasks), suggesting that the project is modular and extensible across domains. SALMONN aims to push the frontier of multi-modal AI by allowing models...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Build Securely on AWS with Proven Frameworks Icon
    Build Securely on AWS with Proven Frameworks

    Lay a foundation for success with Tested Reference Architectures developed by Fortinet’s experts. Learn more in this white paper.

    Moving to the cloud brings new challenges. How can you manage a larger attack surface while ensuring great network performance? Turn to Fortinet’s Tested Reference Architectures, blueprints for designing and securing cloud environments built by cybersecurity experts. Learn more and explore use cases in this white paper.
    Download Now
  • 10
    Matrix

    Matrix

    Multi-Agent daTa geneRation Infra and eXperimentation framework

    Matrix is a distributed, large-scale engine for multi-agent synthetic data generation and experiments: it provides the infrastructure to run thousands of “agentic” workflows concurrently (e.g. multiple LLMs interacting, reasoning, generating content, data-processing pipelines) by leveraging distributed computing (like Ray + cluster management). The idea is to treat data generation as a “data-to-data” transformation: each input item defines a task, and the runtime orchestrates asynchronous, peer-to-peer agent workflows, avoiding global synchronization bottlenecks. That design makes Matrix particularly well-suited for large-batch inference, model benchmarking, data curation, augmentation, or generation — whether for language, code, dialogue, or multimodal tasks. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    TorchRL

    TorchRL

    A modular, primitive-first, python-first PyTorch library

    TorchRL is an open-source Reinforcement Learning (RL) library for PyTorch. TorchRL provides PyTorch and python-first, low and high-level abstractions for RL that are intended to be efficient, modular, documented, and properly tested. The code is aimed at supporting research in RL. Most of it is written in Python in a highly modular way, such that researchers can easily swap components, transform them, or write new ones with little effort.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 12
    IMS Toucan

    IMS Toucan

    Controllable and fast Text-to-Speech for over 7000 languages

    IMS-Toucan is a toolkit for training, using, and teaching state-of-the-art text-to-speech systems, built at the Institute for Natural Language Processing (IMS), University of Stuttgart. It is the official home of ToucanTTS, a massively multilingual TTS system designed to support over 7,000 languages with a single unified framework. The toolkit focuses on being fast and controllable while not requiring huge amounts of compute, making it practical for research labs and smaller teams. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    Alan AI

    Alan AI

    In-App assistant SDK to build a multimodal conversational UX websites

    ...A powerful web-based IDE where you can write, test and debug dialog scenarios for your voice assistant or chatbot. Alan's AI-backend powered by the industry’s best Automatic Speech Recognition (ASR), Natural Language Understanding (NLU) and Speech Synthesis. The Alan Cloud provisions and handles the infrastructure required to maintain your voice deployments and perform all the voice processing tasks. To voice enable your app, you only need to get the Alan Client SDK and drop it to your app. No need to plan for, deploy and maintain any infrastructure or speech components - the Alan Platform does the bulk of the work.
    Downloads: 8 This Week
    Last Update:
    See Project
  • 14
    OpenClaw Medical Skills

    OpenClaw Medical Skills

    The largest open-source medical AI skills library for OpenClaw

    ...Each skill is packaged as a modular component that can be integrated into an OpenClaw-based AI assistant, allowing the agent to perform expert-level reasoning and workflows in medical contexts. Instead of relying on general-purpose language model responses, the repository equips AI agents with structured instructions and tools tailored to medical knowledge and datasets. This modular design allows developers and researchers to build AI systems that can access specialized medical reasoning processes, retrieve relevant biomedical information, and generate structured outputs suitable for analysis or downstream processing.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 15
    The Machine & Deep Learning Compendium

    The Machine & Deep Learning Compendium

    List of references in my private & single document

    ...Originally created as a personal knowledge base, the repository evolved into a public educational resource designed to help learners explore the rapidly expanding machine learning ecosystem. The compendium includes explanations of concepts across multiple domains such as natural language processing, computer vision, time-series analysis, anomaly detection, and graph learning. In addition to technical algorithms, the project also covers practical topics related to data science workflows, engineering practices, and product development in AI systems.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 16
    Open Vision Agents by Stream

    Open Vision Agents by Stream

    Build Vision Agents quickly with any model or video provider

    Open Vision Agents by Stream is an open source framework from Stream for building real time, multimodal AI agents that watch, listen, and respond to live video streams. It focuses on combining video understanding models, such as YOLO and Roboflow based detectors, with real time large language models like OpenAI Realtime and Gemini Live to create interactive experiences. The framework uses Stream’s ultra low latency edge network so agents can join sessions quickly and maintain very low audio and video latency while processing frames and generating responses. Developers work with an agent abstraction that connects video edge providers, LLMs, and processors into pipelines, making it easier to orchestrate tasks like object detection, pose estimation, and conversational guidance. ...
    Downloads: 5 This Week
    Last Update:
    See Project
  • 17
    AlphaTree

    AlphaTree

    DNN && GAN && NLP && BIG DATA

    ...The repository organizes these architectures into a structured learning path that helps learners understand how deep learning models improved over time through changes in depth, architectural complexity, and training techniques. In addition to neural networks used for image classification, the project also references broader AI fields such as generative adversarial networks, natural language processing, and graph neural networks.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    course.fast.ai

    course.fast.ai

    The fast.ai course notebooks

    ...The repository includes lesson notebooks, slide presentations, spreadsheets, and supplementary materials that help students understand neural networks, computer vision, and natural language processing tasks. The materials are designed to work alongside the fast.ai book and video lectures so learners can follow a structured learning pathway through modern deep learning techniques.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    machine_learning_examples

    machine_learning_examples

    A collection of machine learning examples and tutorials

    ...It includes implementations of many machine learning algorithms and neural network architectures using Python and popular libraries such as TensorFlow and NumPy. The repository covers a wide range of topics including supervised learning, unsupervised learning, reinforcement learning, and natural language processing. Many of the examples are accompanied by tutorials and educational materials that explain how the algorithms work and how they can be applied in real-world projects. The code is organized into small independent experiments so that learners can explore specific algorithms or techniques without needing to understand the entire codebase.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    Auto-Deep-Research

    Auto-Deep-Research

    Your Fully-Automated Personal AI Assistant

    Auto-Deep-Research is a system designed to fully automate deep research workflows using language models, retrieval, planning, and multi-stage reasoning to produce structured research artifacts such as surveys, benchmarks, reports, and even prototypes without heavy human intervention. Users provide a research topic or multifaceted goal, and the system autonomously breaks the objective down into subtasks like literature collection, critical summarization, cross-comparison, citation extraction,...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    GLM-4.5V

    GLM-4.5V

    GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning

    GLM-4.5V is the preceding iteration in the GLM-V series that laid much of the groundwork for general multimodal reasoning and vision-language understanding. It embodies the design philosophy of mixing visual and textual modalities into a unified model capable of general-purpose reasoning, content understanding, and generation, while already supporting a wide variety of tasks: from image captioning and visual question answering to content recognition, GUI-based agents, video understanding,...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    Step-Audio 2

    Step-Audio 2

    Multi-modal large language model designed for audio understanding

    Step-Audio2 is an advanced, end-to-end multimodal large language model designed for high-fidelity audio understanding and natural speech conversation: unlike many pipelines that separate speech recognition, processing, and synthesis, Step-Audio2 processes raw audio, reasons about semantic and paralinguistic content (like emotion, speaker characteristics, non-verbal cues), and can generate contextually appropriate responses — including potentially generating or transforming audio output. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    AI_Tutorial

    AI_Tutorial

    A selection of learning materials, search, recommendation, advertising

    AI_Tutorial is a large curated repository that aggregates high-quality learning resources related to artificial intelligence, machine learning, deep learning, natural language processing, and data engineering. The project functions as a centralized knowledge base designed to help engineers and researchers discover tutorials, technical articles, algorithm explanations, and architecture discussions from across the AI ecosystem. Rather than focusing on a single framework or course, the repository collects materials from many sources such as open-source projects, technical blogs, research papers, and industry engineering posts. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    AI Deadlines

    AI Deadlines

    AI conference deadline countdowns

    ...The repository powers a website that displays countdown timers and structured information for top research conferences across subfields such as computer vision, natural language processing, machine learning, and robotics. The project maintains a curated dataset of conferences that includes metadata such as submission deadlines, abstract deadlines, event dates, conference locations, and related information. Researchers and students use the platform to plan their paper submissions and manage academic schedules without manually tracking multiple conference announcements. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    Supertonic

    Supertonic

    Lightning-fast, on-device TTS, running natively via ONNX

    Supertonic is a lightning-fast, on-device text-to-speech system built around ONNX Runtime for maximum speed and portability. It focuses on running entirely locally, eliminating the need for cloud APIs and providing low latency and strong privacy guarantees, even on constrained devices like Raspberry Pi boards and e-readers. The core model is highly compact at around 66 million parameters, yet benchmarks show it can generate speech up to 167× faster than real time on modern consumer hardware...
    Downloads: 2 This Week
    Last Update:
    See Project