Showing 205 open source projects for "knowledge"

View related business solutions
  • $300 Free Credits for Your Google Cloud Projects Icon
    $300 Free Credits for Your Google Cloud Projects

    Start building on Google Cloud with $300 in free credits. No commitment, no credit card required until you're ready to scale.

    Launch your next project with $300 in free Google Cloud credits—no strings attached. Test, build, and deploy without risk. Use your credits across the entire Google Cloud platform to find what works best for your needs. After your credits are used, continue with always-free tier services. Only pay when you're ready to scale. Sign up in minutes and start exploring.
    Start Free Trial
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • 1
    LlamaIndex

    LlamaIndex

    Central interface to connect your LLM's with external data

    LlamaIndex (GPT Index) is a project that provides a central interface to connect your LLM's with external data. LlamaIndex is a simple, flexible interface between your external data and LLMs. It provides the following tools in an easy-to-use fashion. Provides indices over your unstructured and structured data for use with LLM's. These indices help to abstract away common boilerplate and pain points for in-context learning. Dealing with prompt limitations (e.g. 4096 tokens for Davinci) when...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 2
    MathCode

    MathCode

    A Frontier Mathematical Coding Agent

    ...It supports an agentic proving workflow where the system behaves more like an interactive mathematical engineer than a one-shot text generator. MathCode also includes visualization-oriented tooling such as theorem graph generation for Obsidian knowledge workflows. Its main value is bridging natural-language mathematics with formal verification systems in a way that is more automated, inspectable, and iterative than traditional theorem-proving pipelines.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    Data Science Articles from CodeCut

    Data Science Articles from CodeCut

    Collection of useful data science topics along with articles

    The Data-science repository from CodeCutTech is a curated collection of educational content focused on practical tools and workflows used in modern data science projects. Instead of providing a single software package, the repository aggregates articles, tutorials, and examples covering many topics within the data science ecosystem. The materials address areas such as MLOps, data management, project organization, testing practices, visualization techniques, and productivity tools used by...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    All-in-RAG

    All-in-RAG

    Big Model Application Development Practice 1

    ...The repository provides a structured learning path that covers both theoretical foundations and practical implementation steps for RAG systems. It explains the full development pipeline required to create knowledge-aware AI assistants, including data preparation, document indexing, vector embedding generation, and retrieval strategies. The project also explores advanced topics such as hybrid retrieval methods, query optimization, and evaluation techniques for improving system accuracy. Alongside theoretical explanations, the repository includes hands-on exercises and example projects that demonstrate how to build production-ready RAG systems. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Stop Storing Third-Party Tokens in Your Database Icon
    Stop Storing Third-Party Tokens in Your Database

    Auth0 Token Vault handles secure token storage, exchange, and refresh for external providers so you don't have to build it yourself.

    Rolling your own OAuth token storage can be a security liability. Token Vault securely stores access and refresh tokens from federated providers and handles exchange and renewal automatically. Connected accounts, refresh exchange, and privileged worker flows included.
    Try Auth0 for Free
  • 5
    MemMachine

    MemMachine

    Universal memory layer for AI Agents

    MemMachine is a universal memory layer designed for AI agents that provides persistent, rich memory storage and retrieval capabilities so autonomous agent systems can recall context, personal preferences, and long-term interaction history across sessions, models, and use cases. Unlike ephemeral LLM prompt state, MemMachine supports distinct memory types—short-term conversational context, long-term persistent knowledge, and profile memory for personalized facts—persisted in optimized stores (e.g., graph databases for episodic lines of reasoning and SQL for user facts) to support robust, context-aware intelligence in agents. It offers flexible APIs, a Python SDK, REST interfaces, and MCP (Model Context Protocol) connectivity to integrate seamlessly with agent frameworks receiving and storing memories over time, effectively boosting relevance, continuity, and tailored behavior.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    AI-Researcher

    AI-Researcher

    AI-Researcher: Autonomous Scientific Innovation

    ...It lets users input high-level research goals or questions in natural language and then automatically plans, decomposes, and executes tasks such as literature surveying, summarization, synthesis, experiment design, and draft generation. The system integrates retrieval mechanisms to pull in external knowledge sources, contextually analyze documents and papers, and build structured representations of ideas and arguments that can later be turned into coherent reports or drafts. Rather than simply generating text from prompts, AI-Researcher orchestrates sequences of subtasks — such as extracting definitions, identifying key experiments, and tracking citations — and uses self-refinement loops to iteratively improve outputs.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    MemClaw

    MemClaw

    Persistent memory for AI agent fleets (OSS)

    MemClaw is an open-source governed shared memory platform for AI agent fleets. It is designed to help agents remember information across sessions, teams, tools, and models instead of keeping knowledge trapped inside isolated conversations. The project emphasizes enterprise-style governance, including permissions, tenant isolation, audit trails, visibility scopes, and agent trust tiers. It also supports agent integrations through MCP and OpenClaw-style workflows, making it useful for multi-agent systems that need persistent recall. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    Anything to NotebookLM

    Anything to NotebookLM

    Multi-source content processor for NotebookLM

    ...The tool can process files locally, extract or transcribe content when needed, and hand the cleaned material to NotebookLM for generation. It is best suited for researchers, students, content curators, and knowledge workers who regularly turn scattered information into organized learning assets.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    autoMate

    autoMate

    AI tool for automating desktop tasks via natural language input

    autoMate is an AI-powered local automation tool designed to enable users to control and automate their computers using natural language instructions instead of traditional scripting or rule-based systems. It combines large language models with computer vision techniques to interpret user intent and understand on-screen content, allowing it to interact with graphical interfaces similarly to a human user. autoMate follows an observe-decide-act workflow, where it analyzes the screen, plans...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Stop Cyber Threats with VM-Series Next-Gen Firewall on Azure Icon
    Stop Cyber Threats with VM-Series Next-Gen Firewall on Azure

    Native application identity and user-based security for your Azure cloud

    Gain integrated visibility across all traffic in a single pass. Deploy Palo Alto Networks VM-Series to determine application identity and content while automating security policy updates via rich APIs.
    Get a free trial
  • 10
    TypeAgent Python

    TypeAgent Python

    Structured RAG: ingest, index, query

    TypeAgent Python is an experimental Python implementation of Microsoft’s TypeAgent architecture designed to explore how large language models can interact with structured software systems. The project focuses on implementing structured Retrieval-Augmented Generation workflows that allow agents to ingest information, index it in structured form, and answer queries using language models. Instead of relying solely on free-form prompts, the architecture emphasizes converting natural language...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    NVIDIA PhysicsNeMo

    NVIDIA PhysicsNeMo

    Open-source deep-learning framework for building and training

    NVIDIA PhysicsNeMo is an open-source deep learning framework designed for building artificial intelligence models that incorporate physical laws and scientific knowledge into machine learning workflows. The framework focuses on the emerging field of physics-informed machine learning, where neural networks are used alongside physical equations to model complex scientific systems. PhysicsNeMo provides modular Python components that allow developers to create scalable training and inference pipelines for models that combine data-driven learning with physics-based constraints. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    Data Science Interviews

    Data Science Interviews

    Data science interview questions and answers

    Data Science Interviews is an open-source repository that collects common data science interview questions along with community-provided answers and explanations. The project serves as a preparation resource for students, job seekers, and professionals who want to review the technical knowledge required for data science roles. The repository organizes questions into different categories including theoretical machine learning concepts, technical programming questions, and probability or statistics problems. Many of the questions cover fundamental machine learning topics such as linear models, decision trees, neural networks, and evaluation metrics. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    MiroThinker

    MiroThinker

    MiroThinker is an open source deep research agent

    ...Rather than simply generating responses from a single prompt, the agent performs structured multi-step reasoning processes that involve searching for information, analyzing evidence, and synthesizing conclusions. The platform is optimized for research tasks such as financial forecasting, knowledge discovery, and large-scale information synthesis. MiroThinker has been evaluated on several agent benchmarks and has demonstrated strong performance on tests designed to measure deep research capabilities.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    RecAI

    RecAI

    Bridging LLM and Recommender System

    RecAI is an open-source research platform developed by Microsoft to explore how large language models can be integrated into modern recommender systems. Traditional recommender systems rely on structured behavioral data such as user interactions and item embeddings, while large language models excel at understanding language and reasoning about user preferences. RecAI aims to bridge these two domains by creating architectures and training methods that allow LLMs to function as intelligent...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    Llama-Chinese

    Llama-Chinese

    Llama Chinese community, real-time aggregation

    ...In addition to model development, the project collects learning resources and open research contributions related to LLM technology in Chinese environments. Overall, Llama-Chinese acts as both a technical ecosystem and knowledge hub dedicated to advancing Chinese-language large model development.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    ReMe

    ReMe

    Memory Management Kit for Agents

    ...The toolkit provides APIs to offload large, ephemeral outputs to external storage and reload them on demand, which reduces memory bloat and keeps active context concise. By combining embeddings, vector search, and summarization workflows, ReMe lets developers build agent systems that can recall and apply past knowledge in future reasoning tasks. The project fits into the broader agent-oriented programming ecosystem by supplying a standardized memory layer that integrates with agent frameworks.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    MiniRAG

    MiniRAG

    Making RAG Simpler with Small and Open-Sourced Language Models

    ...When a query is issued, MiniRAG retrieves the most relevant contexts and feeds them into a generative model to produce an answer that is grounded in the source material rather than hallucinated. Its minimal footprint makes it suitable for local research assistants, chatbots, help desks, or knowledge bases embedded in applications with limited resources. Despite its simplicity, it includes features such as chunking logic, configurable embedding models, and optional caching to balance performance and accuracy.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    Anthropic's Original Performance

    Anthropic's Original Performance

    Anthropic's original performance take-home, now open for you to try

    ...This take-home includes starter code, tests, and tools to debug performance, aiming to measure how effectively one can apply algorithmic improvements and optimizations. Because it’s framed around beating baseline scores — and even outperforming previous automated systems — it encourages both deep knowledge of Python and creative problem-solving.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    AtomAI

    AtomAI

    Deep and Machine Learning for Microscopy

    AtomAI is a Pytorch-based package for deep and machine-learning analysis of microscopy data that doesn't require any advanced knowledge of Python or machine learning. The intended audience is domain scientists with a basic understanding of how to use NumPy and Matplotlib. It was developed by Maxim Ziatdinov at Oak Ridge National Lab. The purpose of the AtomAI is to provide an environment that bridges the instrument-specific libraries and general physical analysis by enabling the seamless deployment of machine learning algorithms including deep convolutional neural networks, invariant variational autoencoders, and decomposition/unmixing techniques for image and hyperspectral data analysis. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    RecBole

    RecBole

    A unified, comprehensive and efficient recommendation library

    ...We have implemented more than 100 recommender system models, covering four common recommender system categories in RecBole and eight toolkits of RecBole2.0, including General Recommendation, Sequential Recommendation, Context-aware Recommendation, and Knowledge-based Recommendation and sub-packages.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    AI Engineer Headquarters

    AI Engineer Headquarters

    A collection of scientific methods, processes, algorithms

    ...The project serves as a curated collection of resources, methodologies, and tools covering topics across the entire artificial intelligence development lifecycle. Rather than focusing only on theoretical knowledge, the repository emphasizes applied learning and encourages engineers to build real systems that incorporate machine learning, large language models, data pipelines, and AI infrastructure. The curriculum includes a progression of topics such as foundational AI engineering skills, machine learning systems design, large language model usage, retrieval-augmented generation systems, model fine-tuning, and autonomous AI agents. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    Kaggle Solutions

    Kaggle Solutions

    Collection of Kaggle Solutions and Ideas

    Kaggle Solutions is an open-source repository that compiles winning solutions, insights, and educational resources from hundreds of Kaggle data science competitions. The repository acts as a knowledge base for competitive machine learning by collecting solution write-ups, discussion threads, code notebooks, and tutorial resources shared by top Kaggle participants. Each competition entry typically includes information about the dataset, evaluation metrics, modeling strategies, and techniques used by high-ranking competitors. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    Transfer Learning Repo

    Transfer Learning Repo

    Transfer learning / domain adaptation / domain generalization

    Transfer Learning Repo is an open-source repository that compiles resources, code implementations, and academic references related to transfer learning and its related research areas. The project functions as a large knowledge hub that organizes papers, tutorials, datasets, and software implementations across topics such as domain adaptation, domain generalization, multi-task learning, and few-shot learning. The repository includes surveys and theoretical explanations that help readers understand how transfer learning methods allow models trained in one domain to adapt to new tasks or datasets. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    MiroFlow

    MiroFlow

    Agent framework that enables tool-use agent tasks

    ...One of the core innovations of MiroFlow is its use of agent graphs, which enable flexible orchestration of multiple sub-agents and tools in order to complete complex workflows. This architecture allows agents to perform advanced reasoning tasks such as deep research, future event prediction, and multi-step knowledge analysis. The framework emphasizes reliability and scalability by incorporating robust workflow execution, concurrency management, and fault-tolerant design to handle unstable APIs or network conditions.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    Dynamiq

    Dynamiq

    An orchestration framework for agentic AI and LLM applications

    ...The framework supports the creation of multi-agent systems where different AI agents collaborate to solve tasks such as information retrieval, document analysis, or automated decision making. Dynamiq also includes built-in support for retrieval-augmented generation pipelines that allow models to access external documents and knowledge bases during inference.
    Downloads: 0 This Week
    Last Update:
    See Project
Auth0 Logo