Showing 521 open source projects for "learning language"

View related business solutions
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • AI-powered service management for IT and enterprise teams Icon
    AI-powered service management for IT and enterprise teams

    Enterprise-grade ITSM, for every business

    Give your IT, operations, and business teams the ability to deliver exceptional services—without the complexity. Maximize operational efficiency with refreshingly simple, AI-powered Freshservice.
    Try it Free
  • 1
    AI Deadlines

    AI Deadlines

    AI conference deadline countdowns

    AI Deadlines is an open-source project that provides a centralized system for tracking important submission deadlines for major artificial intelligence and machine learning conferences. The repository powers a website that displays countdown timers and structured information for top research conferences across subfields such as computer vision, natural language processing, machine learning, and robotics. The project maintains a curated dataset of conferences that includes metadata such as submission deadlines, abstract deadlines, event dates, conference locations, and related information. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    Sapiens

    Sapiens

    High-resolution models for human tasks

    Sapiens is a research framework from Meta AI focused on embodied intelligence and human-like multimodal learning, aiming to train agents that can perceive, reason, and act in complex environments. It integrates sensory inputs such as vision, audio, and proprioception into a unified learning architecture that allows agents to understand and adapt to their surroundings dynamically. The project emphasizes long-horizon reasoning and cross-modal grounding—connecting language, perception, and action into a single agentic model capable of following abstract goals. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    how-to-optim-algorithm-in-cuda

    how-to-optim-algorithm-in-cuda

    How to optimize some algorithm in cuda

    how-to-optim-algorithm-in-cuda is an open educational repository focused on teaching developers how to optimize algorithms for high-performance execution on GPUs using CUDA. The project combines technical notes, code examples, and practical experiments that demonstrate how common computational kernels can be optimized to improve speed and memory efficiency. Instead of presenting only theoretical explanations, the repository includes hand-written CUDA implementations of fundamental operations...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    Apache OpenNLP

    Apache OpenNLP

    Apache OpenNLP

    Apache OpenNLP is a machine learning-based NLP library that provides tools for text-processing tasks such as tokenization, sentence segmentation, and named entity recognition.
    Downloads: 1 This Week
    Last Update:
    See Project
  • Try Google Cloud Risk-Free With $300 in Credit Icon
    Try Google Cloud Risk-Free With $300 in Credit

    No hidden charges. No surprise bills. Cancel anytime.

    Use your credit across every product. Compute, storage, AI, analytics. When it runs out, 20+ products stay free. You only pay when you choose to.
    Start Free
  • 5
    Tongyi DeepResearch

    Tongyi DeepResearch

    Tongyi Deep Research, the Leading Open-source Deep Research Agent

    DeepResearch (Tongyi DeepResearch) is an open-source “deep research agent” developed by Alibaba’s Tongyi Lab designed for long-horizon, information-seeking tasks. It’s built to act like a research agent: synthesizing, reasoning, retrieving information via the web and documents, and backing its outputs with evidence. The model is about 30.5 billion parameters in size, though at any given token only ~3.3B parameters are active. It uses a mix of synthetic data generation, fine-tuning and...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 6
    AI Engineering from Scratch

    AI Engineering from Scratch

    Learn it. Build it. Ship it for others

    AI Engineering from Scratch is a comprehensive open-source curriculum designed to teach artificial intelligence by building every component from first principles rather than relying on prebuilt frameworks. The project is structured into more than 20 phases and hundreds of lessons, covering topics that range from foundational mathematics to advanced systems such as large language models, retrieval pipelines, and multi-agent architectures. Each lesson emphasizes hands-on implementation,...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 7
    Chinese-XLNet

    Chinese-XLNet

    Chinese XLNet pre-trained model

    Chinese-XLNet is a Chinese language pre-trained model based on the XLNet architecture, providing an advanced foundation for natural language processing tasks in Mandarin and other Chinese dialects. Unlike traditional masked language modeling, XLNet uses a permutation language modeling objective that captures bidirectional context more effectively by training over all possible token orderings, yielding richer contextual representations.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    DoWhy

    DoWhy

    DoWhy is a Python library for causal inference

    DoWhy is a Python library for causal inference that supports explicit modeling and testing of causal assumptions. DoWhy is based on a unified language for causal inference, combining causal graphical models and potential outcomes frameworks. Much like machine learning libraries have done for prediction, DoWhy is a Python library that aims to spark causal thinking and analysis. DoWhy provides a wide variety of algorithms for effect estimation, causal structure learning, diagnosis of causal structures, root cause analysis, interventions and counterfactuals. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    rust-bert

    rust-bert

    Rust native ready-to-use NLP pipelines and transformer-based models

    rust-bert is a Rust-based implementation of transformer-based natural language processing models that provides ready-to-use pipelines for tasks such as text classification, summarization, and question answering. The project ports many capabilities of the Hugging Face Transformers ecosystem into the Rust programming language. It allows developers to run state-of-the-art NLP models like BERT, GPT-2, and DistilBERT directly within Rust applications while maintaining high performance and memory efficiency. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Go From AI Idea to AI App Fast Icon
    Go From AI Idea to AI App Fast

    One platform to build, fine-tune, and deploy ML models. No MLOps team required.

    Access Gemini 3 and 200+ models. Build chatbots, agents, or custom models with built-in monitoring and scaling.
    Try Free
  • 10
    Llama Stack

    Llama Stack

    Composable building blocks to build Llama Apps

    Llama-Stack is an open-source framework designed to facilitate the deployment and fine-tuning of large language models (LLMs) for various natural language processing tasks.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    Open Generative AI

    Open Generative AI

    Uncensored, open-source alternative to Higgsfield AI

    Open Generative AI is a curated collection of resources, tools, and frameworks related to generative AI, covering a wide range of topics from foundational concepts to advanced applications. The repository organizes information about models, libraries, datasets, and learning materials, making it easier for developers to navigate the rapidly evolving AI landscape. It includes references to tools for natural language processing, computer vision, and multimodal systems. The project is designed as a knowledge hub, helping users discover technologies and best practices for building generative AI applications. ...
    Downloads: 66 This Week
    Last Update:
    See Project
  • 12
    Transformers.jl

    Transformers.jl

    Julia Implementation of Transformer models

    Transformers.jl is a Julia library that implements Transformer models for natural language processing tasks. Inspired by architectures like BERT, GPT, and T5, the library offers a modular and flexible interface for building, training, and using transformer-based deep learning models. It supports training from scratch and fine-tuning pretrained models, and integrates with Flux.jl for automatic differentiation and optimization.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    Resume-Matcher

    Resume-Matcher

    Improve your resumes with Resume Matcher

    Resume-Matcher is a command-line application that compares resumes against job descriptions using natural language processing. It provides a compatibility score based on keyword relevance and highlights areas where the resume aligns—or doesn't—with the target role. Designed for job seekers and HR professionals, it helps improve resume tailoring and streamlines candidate screening.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 14
    RAG from Scratch

    RAG from Scratch

    Demystify RAG by building it from scratch

    ...The project walks through key concepts such as generating embeddings, building vector databases, retrieving relevant documents, and integrating the retrieved context into language model prompts. Each example is written with detailed explanations so that developers can understand the internal mechanics of semantic search and context-aware language generation. The repository emphasizes learning through direct implementation, allowing users to see how each component of the RAG architecture functions independently.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 15
    Understand Anything

    Understand Anything

    Turn any codebase into an interactive knowledge graph

    Understand-Anything is an AI-driven tool designed to help users deeply understand any topic by generating structured explanations, summaries, and breakdowns. It focuses on transforming complex or unfamiliar subjects into clear, step-by-step explanations that are easier to grasp. The system leverages language models to provide layered insights, allowing users to explore topics at different levels of detail. It is particularly useful for learning, research, and quick comprehension of new concepts across various domains. The project emphasizes accessibility, making advanced knowledge more approachable for a wider audience. It also supports iterative questioning, enabling users to refine their understanding through follow-up queries.
    Downloads: 8 This Week
    Last Update:
    See Project
  • 16
    The Algorithms - C++ #

    The Algorithms - C++ #

    Collection of various algorithms in mathematics, machine learning

    TheAlgorithms/C-Plus-Plus is a large open-source repository that collects implementations of many classic algorithms and data structures written in the C++ programming language. The project is part of the broader “The Algorithms” initiative, which maintains algorithm implementations in several programming languages to support education and knowledge sharing. Within the C++ repository, contributors implement algorithms across a wide range of fields including sorting, graph theory, number theory, machine learning, cryptography, and data structures. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    VibeTensor

    VibeTensor

    Our first fully AI generated deep learning system

    ...What makes VibeTensor remarkable is that every major component, from core libraries and dispatch systems to CUDA runtime support, caching allocators, and language bindings, was created and validated by coding agents using automated builds and tests rather than manual line-by-line human coding. The system includes both a Python frontend via a torch-like API and an experimental Node.js/TypeScript interface.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    Zero to Mastery Deep Learning TensorFlow

    Zero to Mastery Deep Learning TensorFlow

    All course materials for the Zero to Mastery Deep Learning with TF

    ...The material covers core machine learning workflows including regression, classification, computer vision, natural language processing, and time series forecasting, allowing users to build a well-rounded understanding of modern AI tasks. It also integrates milestone projects that simulate real-world scenarios, helping users translate abstract concepts into deployable solutions.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    Lobe Icons

    Lobe Icons

    Brings AI/LLM brand logos to your React & React Native apps

    Lobe Icons is an open-source icon library designed to provide developers with a comprehensive collection of logos and visual assets representing popular artificial intelligence platforms, language models, and related technologies. The project focuses on making it easy for developers to include recognizable AI brand icons in applications such as dashboards, AI tools, documentation sites, or developer portals. The library includes icons for a wide range of AI providers and models, allowing developers to visually represent integrations with tools such as large language models, AI APIs, and machine learning platforms. ...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 20
    AI Notes

    AI Notes

    Curated AI engineering notes on LLMs, generative models, and tools

    ...These notes include observations, references, experiments, and summaries of important research and industry developments in AI. ai-notes also contains collections of prompts, curated learning materials, and categorized resources intended to help developers explore AI capabilities and practical applications.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 21
    MLX Engine

    MLX Engine

    LM Studio Apple MLX engine

    MLX Engine is the Apple MLX-based inference backend used by LM Studio to run large language models efficiently on Apple Silicon hardware. Built on top of the mlx-lm and mlx-vlm ecosystems, the engine provides a unified architecture capable of supporting both text-only and multimodal models. Its design focuses on high-performance on-device inference, leveraging Apple’s MLX stack to accelerate computation on M-series chips. The project introduces modular VisionAddOn components that allow image...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 22
    Synthetic Data Generator

    Synthetic Data Generator

    SDG is a specialized framework

    ...The platform enables developers and data scientists to create artificial datasets that preserve important relationships between variables without containing sensitive personal information. This makes the generated data suitable for tasks such as machine learning model training, testing software systems, sharing datasets across organizations, and conducting research without violating privacy regulations. The system supports multiple generation methods including statistical models, generative adversarial networks, and large language model–based synthesis. It also includes a data processing module capable of handling different data types, preprocessing columns, managing missing values, and converting formats automatically before model training.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    slime LLM

    slime LLM

    slime is an LLM post-training framework for RL Scaling

    slime is an open-source large language model (LLM) post-training framework developed to support reinforcement learning (RL)-based scaling and high-performance training workflows for advanced LLMs, blending training and rollout modules into an extensible system. It offers a flexible architecture that connects high-throughput training (e.g., via Megatron-LM) with a customizable data generation pipeline, enabling researchers and engineers to iterate on new RL training paradigms effectively. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    AI Engineering Hub

    AI Engineering Hub

    In-depth tutorials on LLMs, RAGs and real-world AI agent applications

    The AI Engineering Hub repository is a large open-source collection of hands-on projects, tutorials, and real-world AI engineering resources designed to help developers learn and build with modern AI technologies, especially large language models (LLMs), retrieval-augmented generation (RAG), and agent-based systems. It includes more than 90 production-ready projects across skill levels, organized into beginner, intermediate, and advanced categories to guide users progressively from simple experiments to complex AI workflows. Projects range from OCR applications and local chatbot UIs to multimodal RAG systems and multi-agent automation pipelines, making the hub valuable both as a learning resource and as a practical reference. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    Context Engineering

    Context Engineering

    A frontier, first-principles handbook

    ...Moving beyond traditional prompt engineering, this repository defines and explores how to craft and provide complete context payloads — not just single prompts — to large language models so they can perform tasks more reliably and intelligently. It takes inspiration from thought leaders like Andrej Karpathy and bridges theory with practical examples, offering structured guidance on context orchestration, memory, retrieval, and state control within AI workflows. With extensive materials drawn from research, surveys, and visual explanations, the project acts as both a learning resource and a reference for practitioners looking to improve model behavior by engineering richer inputs.
    Downloads: 5 This Week
    Last Update:
    See Project
MongoDB Logo MongoDB