Search Results for "language processing" - Page 7

Showing 842 open source projects for "language processing"

View related business solutions
  • Earn up to 16% annual interest with Nexo. Icon
    Earn up to 16% annual interest with Nexo.

    Let your crypto work for you

    Put idle assets to work with competitive interest rates, borrow without selling, and trade with precision. All in one platform. Geographic restrictions, eligibility, and terms apply.
    Get started with Nexo.
  • Atera - an All-in-one platform for IT management Icon
    Atera - an All-in-one platform for IT management

    Ideal for IT departments and MSPs (managed service providers)

    Your IT essentials, integrated & elevated. Take your IT management from automated to autonomous, download Atera's agent to start your free trial!
    Try Atera now
  • 1
    Docspell

    Docspell

    Assist in organizing your piles of documents

    Docspell is a personal document organizer. Or sometimes called a "Document Management System" (DMS). You'll need a scanner to convert your papers into files. Docspell can then assist in organizing the resulting mess. It can unify your files from scanners, emails, and other sources. It is targeted for home use, i.e. families, households, and also for smaller groups/companies. You can associate tags, set correspondent,s and lots of other predefined and custom metadata. If your documents are...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 2
    Botonic

    Botonic

    Build chatbots and conversational experiences using React

    Botonic is a full-stack Javascript framework to create chatbots and modern conversational apps that work on multiple platforms, web, mobile and messaging apps (Messenger, Whatsapp, Telegram, etc). Building modern applications on top of messaging apps like Whatsapp or Messenger is much more than creating simple text-based chatbots. Botonic is a full-stack serverless framework that combines the power of React and Tensorflow.js to create amazing experiences at the intersection of text and...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 3
    SimpleITK

    SimpleITK

    A layer built on top of the Insight Toolkit (ITK)

    SimpleITK is an image analysis toolkit with a large number of components supporting general filtering operations, image segmentation and registration. It is built on top of the Insight Segmentation and Registration Toolkit ITK with the intent of providing a simplified interface to ITK. SimpleITK itself is written in C++ but is available for a large number of programming languages. Wrapping of the C++ code is accomplished through SWIG, in principle, any language wrapped by SWIG should be...
    Downloads: 5 This Week
    Last Update:
    See Project
  • 4
    TigerBot

    TigerBot

    TigerBot: A multi-language multi-task LLM

    TigerBot is an open-source family of large language models designed to support multilingual and multi-task natural language processing applications. The project focuses on building high-performance models capable of handling both English and Chinese tasks while maintaining strong reasoning and conversational abilities. TigerBot models are based on modern transformer architectures and are trained on large datasets that cover multiple domains and languages.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Stop Storing Third-Party Tokens in Your Database Icon
    Stop Storing Third-Party Tokens in Your Database

    Auth0 Token Vault handles secure token storage, exchange, and refresh for external providers so you don't have to build it yourself.

    Rolling your own OAuth token storage can be a security liability. Token Vault securely stores access and refresh tokens from federated providers and handles exchange and renewal automatically. Connected accounts, refresh exchange, and privileged worker flows included.
    Try Auth0 for Free
  • 5
    HuixiangDou

    HuixiangDou

    Overcoming Group Chat Scenarios with LLM-based Technical Assistance

    HuixiangDou is an open-source large language model assistant designed specifically for technical question answering in group chat environments. The project addresses a common problem in developer communities where discussion channels become overwhelmed by repeated or irrelevant questions. To solve this issue, HuixiangDou implements a multi-stage pipeline that analyzes incoming messages, filters irrelevant conversations, and selectively generates responses when the assistant determines it can...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 6
    LangChain for Java

    LangChain for Java

    LangChain4j is an open-source Java library

    LangChain for Java is an open-source Java framework designed to simplify the development of applications powered by large language models. The library provides a unified API that allows developers to connect Java applications to multiple AI providers and embedding databases without having to implement separate integrations for each service. Its architecture includes abstractions for prompts, chat interactions, document processing, embeddings, and vector storage, enabling developers to build complex AI workflows with minimal boilerplate code. ...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 7
    Dict UK

    Dict UK

    Project to generate POS tag dictionary for Ukrainian language

    A Java-based tool for generating full morphological dictionaries for Ukrainian, applying affix rules to base lexemes to produce all inflected forms with part-of-speech tags—used for natural language processing and spell-checking.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 8
    Google Calendar MCP

    Google Calendar MCP

    Google Calendar MCP server for Claude Desktop integration

    A Model Context Protocol server that allows AI assistants like Claude to interact with Google Calendar, enabling seamless calendar management through natural language conversations. ​
    Downloads: 1 This Week
    Last Update:
    See Project
  • 9
    SwiftFFmpeg

    SwiftFFmpeg

    A Swift wrapper for the FFmpeg API

    SwiftFFmpeg is a Swift wrapper for FFmpeg that provides direct access to multimedia processing capabilities within Apple development environments. It exposes FFmpeg’s libraries through Swift-friendly APIs, enabling developers to perform encoding, decoding, and stream manipulation tasks. The project is designed to maintain performance while improving usability through modern language features. It supports working with media containers, codecs, and frames at a low level, offering fine control over processing pipelines. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • 10
    Stripe AI

    Stripe AI

    One-stop shop for building AI-powered products and businesses

    ...The project acts as a centralized repository containing resources, libraries, and examples that simplify the process of incorporating payments, billing, and financial workflows into AI applications. It enables developers to connect large language models and AI agents with Stripe APIs so that automated systems can perform actions such as handling transactions, managing subscriptions, or processing financial events. The platform is particularly relevant for companies building AI-driven products that require monetization, usage-based billing, or programmable financial interactions. ...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 11
    SemTools

    SemTools

    Semantic search and document parsing tools for the command line

    ...The project focuses on enabling developers and AI agents to process large document collections and extract meaningful semantic representations that can be searched efficiently. Built with Rust for performance and reliability, the toolchain provides fast processing of text and structured documents while maintaining low system overhead. SemTools can parse documents, build semantic embeddings, and perform similarity searches across datasets, making it useful for research, knowledge management, and AI-assisted coding workflows. The toolkit is designed to work well with modern AI pipelines, particularly those involving large language models that require structured knowledge retrieval.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 12
    OpenPlanter

    OpenPlanter

    Language-model investigation agent with a terminal UI

    OpenPlanter is an open-source Python project focused on building an intelligent automated planting or gardening system powered by software control and data processing. The repository is designed to help developers and hobbyists create programmable plant management workflows that can monitor, schedule, and optimize growing conditions. It emphasizes automation and extensibility, allowing integration with sensors, environmental data, and control logic for smart cultivation setups. The system is...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 13
    SAG

    SAG

    SQL-Driven RAG Engine

    SAG is an open-source SQL-driven retrieval-augmented generation engine that dynamically constructs knowledge graphs during query processing. Instead of relying on a static knowledge graph prepared in advance, the system automatically builds relational structures between entities while processing user queries. Documents are first decomposed into atomic semantic events, which are then represented using multidimensional natural language vectors. These vectors allow the system to identify relationships between concepts and construct a graph representation of knowledge at runtime. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    Robin

    Robin

    AI-powered tool for dark web OSINT search and investigation

    Robin is an AI-powered open source tool designed to assist investigators and researchers in conducting dark web OSINT (Open Source Intelligence) investigations. It combines automated dark web search capabilities with large language models (LLMs) to analyze and summarize information discovered across hidden services and Tor-based search engines. The tool helps refine investigative queries, collect results from multiple dark web sources, and filter relevant intelligence using AI-driven processing. Robin also performs scraping of discovered pages through Tor sessions, allowing users to gather additional context from dark web sites while maintaining the required network routing. ...
    Downloads: 34 This Week
    Last Update:
    See Project
  • 15
    Perl 5

    Perl 5

    The Perl programming language

    This repository contains the reference implementation of the Perl 5 programming language, including the interpreter, core modules, build system, and an extensive test suite. Perl 5 is a multi-paradigm language renowned for powerful text processing, rich regular expressions, and pragmatic glue code across systems. The core distribution is highly portable, building on Unix, Linux, Windows, and many other platforms, with stable release cycles and careful back-compatibility. ...
    Downloads: 6 This Week
    Last Update:
    See Project
  • 16
    ai-renamer

    ai-renamer

    A Node.js CLI that uses Ollama and LM Studio models

    ...The tool supports batch processing, making it particularly useful for organizing large collections of files quickly and efficiently. It also provides customization options such as naming conventions, language preferences, and prompt modifications to tailor the output to specific use cases. By leveraging AI for semantic understanding, it significantly reduces the time spent on manual file organization and improves discoverability.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 17
    ML-NLP

    ML-NLP

    This project is a common knowledge point and code implementation

    ...The repository also includes example implementations and explanatory materials that help readers understand the mechanics behind machine learning and NLP algorithms. In addition to technical explanations, the project organizes content into topic areas such as deep learning fundamentals, natural language processing techniques, and algorithm engineering practices.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    OpenAI PHP Client

    OpenAI PHP Client

    OpenAI PHP is a supercharged community-maintained PHP API client

    The openai-php/client library is a PHP client for interacting with OpenAI’s API, enabling developers to integrate OpenAI models into their PHP applications. It simplifies sending API requests to access language models, generating text, processing natural language, and utilizing OpenAI’s advanced features. The library is designed to be easy to use, allowing developers to quickly set up and use OpenAI’s capabilities within their PHP projects.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 19
    TorchDistill

    TorchDistill

    A coding-free framework built on PyTorch

    torchdistill (formerly kdkit) offers various state-of-the-art knowledge distillation methods and enables you to design (new) experiments simply by editing a declarative yaml config file instead of Python code. Even when you need to extract intermediate representations in teacher/student models, you will NOT need to reimplement the models, which often change the interface of the forward, but instead specify the module path(s) in the yaml file. In addition to knowledge distillation, this...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    flair

    flair

    A very simple framework for state-of-the-art NLP

    A very simple framework for state-of-the-art NLP. Developed by Humboldt University of Berlin and friends. A powerful NLP library. Flair allows you to apply our state-of-the-art natural language processing (NLP) models to your text, such as named entity recognition (NER), sentiment analysis, part-of-speech tagging (PoS), special support for biomedical texts, sense disambiguation and classification, with support for a rapidly growing number of languages. A text embedding library. Flair has simple interfaces that allow you to use and combine different word and document embeddings, including our proposed Flair embeddings and various transformers. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    llms-from-scratch-cn

    llms-from-scratch-cn

    Build a large language model from 0 only with Python foundation

    llms-from-scratch-cn is an educational open-source project designed to teach developers how to build large language models step by step using practical code and conceptual explanations. The repository provides a hands-on learning path that begins with the fundamentals of natural language processing and gradually progresses toward implementing full GPT-style architectures from the ground up. Rather than focusing on using pre-trained models through APIs, the project emphasizes understanding the internal mechanisms of modern language models, including tokenization, attention mechanisms, transformer architecture, and training workflows. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    MCP Neo4j

    MCP Neo4j

    Model Context Protocol with Neo4j

    An implementation of the Model Context Protocol with Neo4j, enabling natural language interactions with Neo4j databases and facilitating operations such as schema retrieval and Cypher query execution. ​
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    toxic-repos

    toxic-repos

    We are against discrediting open source software

    toxic-repos is a tool that analyzes GitHub repositories to detect signs of hostile or problematic behavior in open-source communities. It helps developers and contributors identify repositories with negative engagement patterns.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    Open Generative AI

    Open Generative AI

    Uncensored, open-source alternative to Higgsfield AI

    ...The repository organizes information about models, libraries, datasets, and learning materials, making it easier for developers to navigate the rapidly evolving AI landscape. It includes references to tools for natural language processing, computer vision, and multimodal systems. The project is designed as a knowledge hub, helping users discover technologies and best practices for building generative AI applications. It is particularly useful for beginners who need a structured overview as well as for experienced developers looking for new tools. The repository is continuously updated to reflect the latest developments in the field. ...
    Downloads: 41 This Week
    Last Update:
    See Project
  • 25
    xLSTM

    xLSTM

    Neural Network architecture based on ideas of the original LSTM

    xLSTM is an open-source machine learning architecture that reimagines the classic Long Short-Term Memory (LSTM) network for modern large-scale language modeling and sequence processing tasks. The project introduces a new recurrent neural network design that incorporates exponential gating mechanisms and enhanced memory structures to overcome limitations of traditional LSTM models. By introducing innovations such as matrix-based memory and improved normalization techniques, xLSTM improves the ability of recurrent networks to capture long-range dependencies in sequential data. ...
    Downloads: 1 This Week
    Last Update:
    See Project