Showing 42 open source projects for "highlight"

View related business solutions
  • Build Agents and Models on One Platform Icon
    Build Agents and Models on One Platform

    Everything you need to build production-ready agents and models. Access 200+ Google and third-party AI models and tools.

    Gemini Enterprise Agent Platform is Google Cloud's comprehensive platform for developers to build, scale, govern, and optimize agents and models. Choose from Google's most advanced models and third-party models like Anthropic's Claude Model Family.
    Try It Free
  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • 1
    AutoClip

    AutoClip

    AI-powered video clipping and highlight generation

    AutoClip is an open-source, AI-powered video processing system designed to automate the extraction of “highlight” segments from full-length videos — ideal for creators who want to generate bite-sized clips, compilations, or highlight reels without manually sifting through hours of footage. The system supports downloading videos from major platforms (e.g. YouTube, Bilibili), or accepting local uploads, and then applies AI analysis to identify segments worth clipping based on content (e.g. high energy moments, speech, or other heuristics). ...
    Downloads: 7 This Week
    Last Update:
    See Project
  • 2
    diff2html

    diff2html

    Pretty diff to html javascript library (diff2html)

    ...The AI community building the future. Build, train and deploy state of the art models powered by the reference open source in natural language processing. Wrapper and helper adding syntax highlight, synchronized scroll, and other nice features. You can use it without syntax highlight or by passing your own implementation with the languages you prefer. Diff2Html can be used in various ways as listed in the distributions section.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    Pot Desktop

    Pot Desktop

    A cross-platform software for text translation and recognition

    Pot-Desktop is a cross-platform productivity tool aimed at helping users quickly translate, perform OCR (optical character recognition), and synthesize speech for selected text or images — all with minimal friction. It supports picking text via mouse selection (“highlight-and-translate”), clipboard listening, or screenshot-based OCR; this makes it ideal for reading webpages, documents, images — or any on-screen text — and instantly getting translations or text extraction. The tool supports external plugin extensions, which means its functionality can be expanded far beyond the built-in options: you can add translation engines, OCR backends, TTS engines, vocabulary export (e.g. for language learning), and more. ...
    Downloads: 23 This Week
    Last Update:
    See Project
  • 4
    AI YouTube Shorts Generator

    AI YouTube Shorts Generator

    A python tool that uses GPT-4, FFmpeg, and OpenCV

    ...It analyzes input video (whether a local file or a YouTube URL), transcribes audio (with optional GPU-accelerated speech-to-text), uses an AI model to identify the most compelling or engaging segments, and then crops/resizes the video and applies subtitle overlays, producing a polished short video without manual editing. The tool streamlines multiple steps of the tedious short-form video workflow: highlight detection, clipping, subtitle generation, cropping to vertical 9:16 format, and final rendering — reducing hours of editing to a mostly automated pipeline. Because it supports both local and online video sources, it's flexible whether you're working with your own recorded content or repurposing existing longer-form videos.
    Downloads: 8 This Week
    Last Update:
    See Project
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • 5
    Claude Code Haha

    Claude Code Haha

    Claude Code leaked source - locally runnable version

    Claude Code Haha is an experimental and often humorous adaptation of Claude-style coding agents, designed to explore and demonstrate how agentic coding systems behave under different configurations and prompts. While it retains the core functionality of analyzing and modifying codebases, the project introduces variations that highlight both the strengths and quirks of autonomous coding assistants. It serves as a sandbox for testing how agents interpret instructions, manage context, and execute development tasks in a less formal or more exploratory setting. The repository likely includes playful modifications, custom prompts, or unconventional workflows that reveal edge cases in agent behavior. ...
    Downloads: 16 This Week
    Last Update:
    See Project
  • 6
    ChatALL

    ChatALL

    Concurrently chat with ChatGPT, Bing Chat, Bard, Alpaca, Vicuna, etc.

    Large Language Models (LLMs) based AI bots are amazing. However, their behavior can be random, and different bots excel at different tasks. If you want the best experience, don't try them one by one. ChatALL (Chinese name: 齐叨) can send prompts to several AI bots concurrently, helping you to discover the best results. All you need to do is download, install, and ask.
    Downloads: 6 This Week
    Last Update:
    See Project
  • 7
    ChatALL

    ChatALL

    Concurrently chat with ChatGPT, Bing Chat, Bard, Alpaca, Vincuna, etc.

    Concurrently chat with ChatGPT, Bing Chat, bard, Alpaca, Vincuna, Claude, ChatGLM, MOSS, iFlytek Spark, ERNIE and more, discover the best answers. Large Language Models (LLMs) based AI bots are amazing. However, their behavior can be random and different bots excel at different tasks. If you want the best experience, don't try them one by one. ChatALL (Chinese name: 齐叨) can send prompt to several AI bots concurrently, help you to discover the best results.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 8
    CS-Ebook

    CS-Ebook

    Curated list of classic, high-quality computer science books

    ...Its organized structure allows users to navigate topics efficiently and follow a progressive learning path. Contributions are encouraged, ensuring the list evolves with community input and continues to highlight valuable resources.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 9
    Embedding Atlas

    Embedding Atlas

    Tool that provides interactive visualizations for large embeddings

    Embedding Atlas is an open-source tool by Apple that provides scalable, interactive visualizations for large embedding datasets. It enables users to visualize, cross-filter, and search through embeddings alongside rich metadata, all in real time using modern web-based technologies. In addition to the command line tool, Embedding Atlas is also available as a Jupyter widget. Finally, components from Embedding Atlas are also available in an npm package. Order-independent transparency ensuring...
    Downloads: 1 This Week
    Last Update:
    See Project
  • Fully Managed MySQL, PostgreSQL, and SQL Server Icon
    Fully Managed MySQL, PostgreSQL, and SQL Server

    Automatic backups, patching, replication, and failover. Focus on your app, not your database.

    Cloud SQL handles your database ops end to end, so you can focus on your app.
    Try Free
  • 10
    Ax

    Ax

    Build LLM powered Agents and "Agentic workflows"

    ...Seamlessly integrates with multiple LLMs and VectorDBs to build RAG pipelines or collaborative agents that can solve complex problems. Advanced features streaming validation, multi-modal DSPy, etc. We've renamed from "llmclient" to "ax" to highlight our focus on powering agentic workflows. We agree with many experts like "Andrew Ng" that agentic workflows are the key to unlocking the true power of large language models and what can be achieved with in-context learning. Also, we are big fans of the Stanford DSPy paper, and this library is the result of all of this coming together to build a powerful framework for you to build with.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 11
    Spring AI Alibaba Examples

    Spring AI Alibaba Examples

    Spring AI Alibaba examples for building and testing AI apps

    ...It is designed to help developers understand core concepts, explore practical implementations, and follow best practices when building AI-powered systems using the Spring ecosystem. Each module focuses on a specific use case such as chat, image processing, audio handling, graph workflows, and retrieval-augmented generation. The examples highlight how to integrate AI models, manage prompts, handle memory, and build multi-model or multi-agent workflows. Developers can explore individual project folders for detailed instructions and implementation guidance. Spring AI Alibaba Examples also supports experimentation through playground modules and encourages contributions to expand real-world AI use cases and improve development practices.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 12
    n8n-MCP

    n8n-MCP

    A MCP for Claude Desktop / Claude Code / Windsurf / Cursor

    ...The project targets practical agent ops: safer mutations, better error reporting, and predictable behavior when automating or refactoring automations. Community posts highlight the goal of giving agents accurate knowledge of hundreds of n8n nodes and keeping that knowledge fresh as n8n evolves.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 13
    Advanced AI explainability for PyTorch

    Advanced AI explainability for PyTorch

    Advanced AI Explainability for computer vision

    pytorch-grad-cam is an open-source library that provides advanced explainable AI techniques for interpreting the predictions of deep learning models used in computer vision. The project implements Grad-CAM and several related visualization methods that highlight the regions of an image that most strongly influence a neural network’s decision. These visualization techniques allow developers and researchers to better understand how convolutional neural networks and transformer-based vision models make predictions. The library supports a wide variety of tasks including image classification, object detection, semantic segmentation, and similarity analysis. ...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 14
    MobileCLIP

    MobileCLIP

    Implementation of "MobileCLIP" CVPR 2024

    ...The repo provides training, inference, and evaluation code for MobileCLIP models trained on DataCompDR, and for newer MobileCLIP2 models trained on DFNDR. It includes an iOS demo app and Core ML artifacts to showcase practical, offline photo search and classification on iPhone-class hardware. Project notes highlight latency/accuracy trade-offs, with MobileCLIP2 variants matching or surpassing larger baselines at notably lower parameter counts and runtime on mobile devices. A companion “mobileclip-dr” repository details large-scale, distributed data-generation pipelines used to reinforce datasets across billions of samples on thousands of GPUs. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 15
    Serena

    Serena

    Agent toolkit providing semantic retrieval and editing capabilities

    ...It emphasizes symbol-level understanding rather than naive file-wide diffs, enabling more precise refactors and additions. The repository and ecosystem materials highlight rapid setup, agent interoperability, and examples that show agents iterating on a codebase with guardrails. It’s actively maintained by Oraios, with recent updates, community showcases, and third-party write-ups underscoring interest from the agent tooling community.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 16
    VGGT

    VGGT

    [CVPR 2025 Best Paper Award] VGGT

    ...The repo provides inference pipelines to estimate geometry from monocular inputs, stereo pairs, or brief sequences, together with evaluation harnesses for common geometry benchmarks. Training utilities highlight data curation and augmentations that preserve geometric cues while improving generalization across scenes and cameras.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 17
    Gonzo

    Gonzo

    Real-time terminal log analyzer with AI insights and dashboards

    ...Users can explore logs through a k9s-inspired layout, combining visualizations like heatmaps, severity distributions, and timelines. Advanced filtering with regex and attribute search helps isolate issues quickly. Gonzo also integrates AI capabilities to detect patterns, highlight anomalies, and suggest root causes, making it easier to understand complex system behavior. With customizable themes, keyboard and mouse navigation, and support for local or external AI models, it provides a fast, developer-friendly way to turn raw logs into actionable insights without leaving the terminal.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 18
    InvestBrain

    InvestBrain

    LLM-enabled investment tracker that consolidates market performance

    InvestBrain is a financial portfolio management and investment insight platform designed to help individual investors track assets, analyze performance, and explore data-driven insights across markets. It provides tools to import financial data such as stocks, cryptocurrencies, or ETFs, maintain watchlists, and view performance summaries that highlight gains, losses, allocations, and historical trends. The interface blends real-time or near-real-time market data with personalized analytics, so users can assess portfolio health, diversification, and risk exposure with intuitive charts and tables. Beyond tracking, the platform offers educational insights and indicators (like technical or fundamental signals) that can inform investment decisions and help users recognize patterns or opportunities. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    Habit Tracker

    Habit Tracker

    Habit Tracker for the AI Coding Workshop

    ...The app provides streak tracking and completion rates for each habit, giving users feedback on consistency and motivation by showing how often habits are completed and where they may be lagging. A calendar view lets users see a monthly grid of their habit history with color-coded days to highlight patterns and encourage daily engagement. Habit-Tracker also supports planned absences so users can skip days without breaking their streaks, reducing frustration and keeping long-term habits on track.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    Bytebot

    Bytebot

    Bytebot is an AI desktop agent that automates computer tasks

    ...It typically includes capabilities for code generation, refactoring suggestions, automated testing assistance, and integration with source control systems to make commits or generate pull requests guided by natural language prompts. Bytebot can be embedded into editors, terminals, or CI/CD pipelines to provide contextual recommendations, highlight quality issues, and propose fixes based on understanding of code and project context. The framework is often extensible, allowing teams to define custom commands, workflows, or plug-ins that connect to internal APIs, coding standards, or proprietary repositories.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    UltraRAG

    UltraRAG

    Less Code, Lower Barrier, Faster Deployment

    ...It encourages pipeline composition via configuration, enabling researchers to swap retrievers, rerankers, and generators without heavy refactoring. Community posts highlight its focus on reducing engineering overhead so more effort goes to experimental design. Backed by the OpenBMB org, it is actively maintained with tutorials and updates.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    LISA

    LISA

    LISA: Reasoning Segmentation via Large Language Model

    LISA is an open-source multimodal AI system designed to enable language models to perform pixel-level reasoning and segmentation tasks on images. The project introduces a framework where a large language model can interpret natural language instructions and produce segmentation masks that highlight relevant regions in an image. Instead of relying solely on predefined object categories, the model is capable of reasoning about complex textual queries and translating them into visual segmentation outputs. This approach allows the system to identify objects or regions in images based on semantic descriptions, contextual reasoning, and world knowledge. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    LitterBox

    LitterBox

    A secure sandbox environment for malware developers and red teamers

    ...The README frames typical use cases: testing evasion, validating detections, analyzing behavior, and keeping sensitive tooling in-house. Repo metadata and author pages highlight an active security-tools ecosystem around the maintainer, with CI and pull-request activity suggesting ongoing development. The project positions itself as a safe proving ground to reduce surprises in the field while minimizing operational risk. For teams exploring MCP integrations, notes mention pairing with LLM agents for assisted analysis.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    FastVLM

    FastVLM

    This repository contains the official implementation of FastVLM

    ...Instead of elaborate pruning stages, the design trades off resolution and token count through input scaling, simplifying the pipeline while maintaining strong accuracy. Reported results highlight dramatic speedups in time-to-first-token and competitive quality versus contemporary open VLMs, including comparisons across small and larger variants. The repository documents model variants, showcases head-to-head numbers against known baselines, and explains how the encoder integrates with common LLM backbones. Apple’s research brief frames FastVLM as targeting real-time or latency-sensitive scenarios, where lowering visual token pressure is critical to interactive UX. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    Code-Mode

    Code-Mode

    Plug-and-play library to enable agents to call MCP and UTCP tools

    ...The repository contains both TypeScript and Python libraries, plus a code-mode-mcp component for integrating with MCP and UTCP ecosystems. Benchmarks in the README highlight improvements in latency and token cost for scenarios involving multiple tools, showing that code execution often outperforms traditional JSON-based function calling.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • Next