Search Results for "structured text" - Page 6

Sort By:

Showing 332 open source projects for "structured text"

View related business solutions

Linux Clear Filters & Widen Search

Build Agents and Models on One Platform
Everything you need to build production-ready agents and models. Access 200+ Google and third-party AI models and tools.

Gemini Enterprise Agent Platform is Google Cloud's comprehensive platform for developers to build, scale, govern, and optimize agents and models. Choose from Google's most advanced models and third-party models like Anthropic's Claude Model Family.

Try It Free
$300 Free Credits to Build on Google Cloud
New to Google Cloud? Get $300 in credits to explore Compute Engine, BigQuery, Cloud Run, Gemini Enterprise Agent Platform, and more.

Start your next project with $300 in free Google Cloud credit. Spin up VMs, run containers, query petabytes in BigQuery, or build agents with Gemini Enterprise Agent Platform. Once your credits are used, keep building with 20+ always-free tier products including Compute Engine, Cloud Storage, GKE, and Cloud Run functions. No commitment required—just sign up and start building.

Claim $300 Free
1

Qwen-Image-Layered

Qwen-Image-Layered: Layered Decomposition for Inherent Editablity

...This architecture allows richer semantic interpretation, enabling use cases such as scene decomposition, object-level editing, layered captioning, and more fine-grained multimodal reasoning than with flat image encodings alone. By combining text and structured image representations, it aims to facilitate tasks where both descriptive and structural understanding are important, such as detailed image QA, interactive image editing via prompt layers, and image-conditioned generation with structural control. The layered approach supports training signals that help the model learn how visual elements relate to each other and to textual context, rather than simply learning global image embeddings.

Downloads: 0 This Week

Last Update: 2026-01-05
See Project
2

DeerFlow

Deep Research framework, combining language models with tools

DeerFlow is an open-source, community-driven “deep research” framework / multi-agent orchestration platform developed by ByteDance. It aims to combine the reasoning power of large language models (LLMs) with automated tool-use — such as web search, web crawling, Python execution, and data processing — to enable complex, end-to-end research workflows. Instead of a monolithic AI assistant, DeerFlow defines multiple specialized agents (e.g. “planner,” “searcher,” “coder,” “report generator”)...

Downloads: 47 This Week

Last Update: 2 days ago
See Project
3

Ollama Swift Client

A Swift client library for interacting with Ollama

...It is designed to feel natural within the Swift ecosystem, using modern language features like async/await and strong typing to provide a clean and intuitive developer experience. The library wraps the Ollama REST API into structured Swift calls, making it easy to perform tasks such as chat completion, text generation, and embeddings without dealing with raw HTTP requests. It supports streaming responses, allowing applications to display generated content progressively, which is especially useful for chat interfaces. The project emphasizes simplicity and integration with native app development, making it ideal for building AI-powered mobile or desktop applications that leverage local models.

Downloads: 0 This Week

Last Update: 2026-04-20
See Project
4

Ollama-Laravel Package

Ollama-Laravel is a Laravel package providing seamless integration

...It abstracts the Ollama API into a developer-friendly facade, allowing Laravel developers to interact with models using familiar patterns such as service containers, configuration files, and fluent method chaining. The package supports a wide range of AI capabilities including text generation, chat-based interactions, embeddings, and multimodal vision analysis, making it suitable for both simple features and complex AI-driven systems. It also includes support for reasoning models and function calling, enabling developers to build more advanced workflows where models can trigger tools or structured actions. Real-time streaming responses are supported, allowing applications to deliver incremental outputs for better user experience.

Downloads: 0 This Week

Last Update: 2026-04-20
See Project
Stop Cyber Threats with VM-Series Next-Gen Firewall on Azure
Native application identity and user-based security for your Azure cloud

Gain integrated visibility across all traffic in a single pass. Deploy Palo Alto Networks VM-Series to determine application identity and content while automating security policy updates via rich APIs.

Get a free trial
5

Aix-DB

Based on the LangChain/LangGraph framework

Aix-DB is an open-source intelligent data analysis platform that combines large language models with database technologies to enable conversational data exploration. The system is designed as a ChatBI solution that allows users to query datasets using natural language and receive structured insights, charts, and visualizations automatically. Built on frameworks such as LangChain and LangGraph, Aix-DB integrates retrieval-augmented generation and Text-to-SQL capabilities to convert user questions into executable database queries. The platform supports multiple types of data sources and provides an end-to-end pipeline that includes intent recognition, SQL generation, database execution, and visual presentation of results. ...

Downloads: 0 This Week

Last Update: 2026-04-11
See Project
6

OceanBase seekdb

The AI-Native Search Database

seekdb is an AI-native search database from OceanBase that unifies vector, full-text, relational, JSON, and GIS data into a single query engine. The system is designed to support hybrid search workloads and in-database AI workflows without requiring multiple specialized databases. It enables developers to perform semantic search, keyword search, and structured SQL queries within the same platform, simplifying modern AI application stacks. seekdb also embeds AI capabilities directly in the database layer, including embedding generation, reranking, and LLM inference for end-to-end RAG pipelines. ...

Downloads: 0 This Week

Last Update: 2026-05-25
See Project
7

MiniRAG

Making RAG Simpler with Small and Open-Sourced Language Models

MiniRAG is a lightweight retrieval-augmented generation tool designed to bring the benefits of RAG workflows to smaller datasets, edge environments, and constrained compute settings by simplifying embedding, indexing, and retrieval. It extracts text from documents, codes, or other structured inputs and converts them into embeddings using efficient models, then stores these vectors for fast nearest-neighbor search without requiring huge databases or separate vector servers. When a query is issued, MiniRAG retrieves the most relevant contexts and feeds them into a generative model to produce an answer that is grounded in the source material rather than hallucinated. ...

Downloads: 0 This Week

Last Update: 2026-02-03
See Project
8

endlessh-go

A golang implementation of endlessh exporting Prometheus metrics

...Besides trapping the attackers, I also want to visualize the Geolocations and other statistics of the sources of attacks. Unfortunately the wonderful original C implementation of endlessh only provides text based log, but I do not like the solution that writes extra scripts to parse the log outputs, then exports the results to a dashboard, because it would introduce extra layers in my current setup and it would depend on the format of the text log file rather than some structured data. Thus I create this golang implementation of endlessh to export Prometheus metrics and a Grafana dashboard to visualize them.

Downloads: 0 This Week

Last Update: 2026-03-29
See Project
9

baoyu-skills

Skills shared by Baoyu for improving daily work efficiency with Claude

...The project organizes its functionality into categories such as content creation, AI generation, and utility tools, enabling users to extend their workflows through reusable components. Each skill is implemented as a structured module with its own configuration, scripts, and execution logic, allowing for flexible customization and extension. The system supports marketplace-style installation, where users can selectively install or update individual skills rather than a monolithic package. It integrates with various external services, including AI APIs and browser automation tools, to expand its capabilities beyond basic text processing.

Downloads: 3 This Week

Last Update: 2026-06-18
See Project
Stop vibe-debugging.
Plug Claude into your app's actual errors.

AppSignal's MCP server hands Claude, Cursor, or Zed your real errors, traces, and the deploy that shipped them. AI writes the fix; you review the diff.

Free 30 days.
10

DeepTutor

AI-Powered Personalized Learning Assistant

...It goes beyond simple Q&A by constructing multi-stage educational narratives, breaking down complex topics into sequenced “lesson steps,” and offering prompts, examples, and exercises that build on each other in a logical curriculum. The core architecture combines LLM-based reasoning with structured pedagogy modules so that explanations accommodate different learning styles and address misconceptions in follow-up responses. DeepTutor supports retrieval of external references, definitions, and diagrams so responses are grounded in authoritative content and not just generative text, and it includes internal checks to ensure accuracy and conceptual consistency.

Downloads: 3 This Week

Last Update: 3 days ago
See Project
11

DeepSeek-OCR 2

Visual Causal Flow

DeepSeek-OCR-2 is the second-generation optical character recognition system developed to improve document understanding by introducing a “visual causal flow” mechanism, enabling the encoder to reorder visual tokens in a way that better reflects semantic structure rather than strict raster scan order. It is designed to handle complex layouts and noisy documents by giving the model causal reasoning capabilities that mimic human visual scanning behavior, enhancing OCR performance on documents...

Downloads: 3 This Week

Last Update: 2026-02-03
See Project
12

os-tutorial

How to create an OS from scratch

os-tutorial is an open source educational project by cfenollosa that teaches the basics of building an operating system from scratch. The repository provides step-by-step lessons starting with bootloaders and moving through kernel development, interrupts, memory management, and system calls. Each tutorial is accompanied by clear explanations, code examples, and references to deepen understanding. The project uses x86 assembly and C to illustrate concepts, making it accessible to students and...

Downloads: 3 This Week

Last Update: 5 days ago
See Project
13

InkOS

Autonomous novel writing CLI AI Agent

InkOS is a multi-agent creative writing system designed to automate the production of long-form narrative content such as novels through coordinated AI workflows. The system organizes multiple specialized agents that collaborate in stages, including drafting, reviewing, editing, and refining text, with optional human checkpoints to ensure quality and coherence. Its architecture reflects a pipeline approach where each agent contributes a specific function, allowing iterative improvement...

Downloads: 6 This Week

Last Update: 2026-06-10
See Project
14

PaperSpine

Motivation-driven skill for learning from strong academic papers

...It is best suited for users who need format-aware, evidence-aware academic writing support rather than generic text generation.

Downloads: 1 This Week

Last Update: 2026-06-11
See Project
15

video-use

Edit videos with Claude Code

...The system intelligently analyzes audio transcripts and visual cues to make precise, context-aware editing decisions. It supports a wide range of content types, including interviews, tutorials, montages, and talking-head videos. By combining structured text representations with on-demand visual previews, it minimizes processing overhead while maintaining high-quality results. Overall, Video Use reimagines video editing as an AI-driven, conversational workflow.

Downloads: 12 This Week

Last Update: 2026-05-15
See Project
16

Advanced NLP with spaCy

Advanced NLP with spaCy: A free online course

...The course is designed to teach developers how to build real-world NLP systems by combining rule-based techniques with machine learning models. The repository includes lessons, exercises, and examples that guide learners through tasks such as tokenization, named entity recognition, text classification, and training custom NLP models. It also demonstrates how spaCy pipelines work and how developers can extend them with custom components and training data. The course is structured as a hands-on learning environment where students can run code examples, experiment with NLP techniques, and build practical language processing applications. ...

Downloads: 0 This Week

Last Update: 2026-03-12
See Project
17

Gitingest

Create prompt-friendly codebase digests from any Git repository URL

Gitingest is a developer utility that converts an entire Git repository into a structured, prompt-friendly text digest suitable for use with large language models. It analyzes a repository and produces a consolidated textual representation that includes the file structure and code content in an organized format. This makes it easier to provide meaningful code context when working with AI systems that require compact, readable inputs.

Downloads: 0 This Week

Last Update: 2026-03-13
See Project
18

PaperBanana

Extension of Google Research’s PaperBanana

PaperBanana is an open-source agentic framework designed to automatically generate publication-quality academic diagrams and statistical plots directly from text descriptions. The project focuses on helping researchers, educators, and data scientists transform conceptual descriptions of figures into structured visual outputs suitable for research papers, presentations, and technical reports. Instead of manually designing charts or diagrams using traditional visualization tools, users can describe the desired figure in natural language and allow the system to generate the visual representation automatically. ...

Downloads: 0 This Week

Last Update: 2026-06-12
See Project
19

InternLM-XComposer-2.5

InternLM-XComposer2.5-OmniLive: A Comprehensive Multimodal System

InternLM-XComposer is an open-source multimodal AI system designed to generate long-form content that combines text with visual elements such as images and diagrams. The model is built on top of the InternLM language model architecture and extends its capabilities to handle multimodal inputs and outputs. Instead of producing only textual responses, the system can generate visually enriched documents such as illustrated articles, presentations, and educational materials.

Downloads: 0 This Week

Last Update: 2026-03-05
See Project
20

Lingvo

Framework for building neural networks

Lingvo is a TensorFlow based framework focused on building and training sequence models, especially for language and speech tasks. It was originally developed for internal research and later open sourced to support reproducible experiments and shared model implementations. The framework provides a structured way to define models, input pipelines, and training configurations using a common interface for layers, which encourages reuse across different tasks. It has been used to implement state...

Downloads: 0 This Week

Last Update: 2025-11-28
See Project
21

Google Antigravity SDK

Python library for building agents that leverages Google Antigravity

Google Antigravity SDK for Python is a Python library for building AI agents powered by Antigravity and Gemini. It provides a secure, scalable, and stateful infrastructure layer so developers can focus on agent behavior instead of manually implementing the full agent loop. The SDK includes a high-level Agent class for quick setup, as well as lower-level conversation and connection abstractions for more controlled workflows. It supports streaming responses, stateful sessions, custom Python...

Downloads: 6 This Week

Last Update: 3 days ago
See Project
22

GPT-Image2-Skill

GPT Image 2 prompt gallery, image prompt library, agentic skill

GPT-Image2-Skill is a prompt gallery, image prompt library, agent skill, and CLI for OpenAI image generation and editing workflows. It collects curated prompt examples with generated outputs so users can reuse strong visual patterns instead of starting from scratch. The project includes categories such as anime, gaming, cyberpunk, animation, character design, typography, illustration, watercolor, ink, pixel art, isometric scenes, product visuals, and food imagery. It can be installed as an...

Downloads: 3 This Week

Last Update: 2026-05-19
See Project
23

YuE

Open source AI model for generating full songs from lyrics prompts

YuE is an open source project that provides a foundation model designed for full-song music generation using artificial intelligence. It focuses on transforming text inputs such as lyrics and genre prompts into complete musical compositions that include both vocal and instrumental tracks. Unlike many shorter audio generators, the model is capable of producing songs that last several minutes while maintaining coherent musical structure and alignment with the provided lyrics. YuE introduces a...

Downloads: 10 This Week

Last Update: 1 day ago
See Project
24

DIO Lab

Repository for "Contributing to an Open Source Project on GitHub" lab

dio-lab-open-source is an educational repository created as part of the “Contributing to an Open Source Project” course offered by Digital Innovation One (DIO). The project serves as a practical learning environment where students can explore the fundamentals of contributing to open source software through GitHub. It provides hands-on experience with the Git workflow, including forking repositories, creating branches, submitting pull requests, and collaborating with other developers. The...

Downloads: 3 This Week

Last Update: 5 days ago
See Project
25

System Design Visualizer

An interactive tool that transforms static system design diagrams

...It provides a drag-and-drop canvas with reusable components representing servers, databases, APIs, message queues, and user clients, enabling stakeholders to create diagrams that describe data flows, dependencies, and interactions clearly and intuitively. Beyond drawing diagrams, the tool supports semantic relationships — meaning elements can have structured metadata, links, and annotations that explain behavior, constraints, and performance assumptions directly within the design. It’s particularly useful for collaborative design sessions, documentation, and architecture reviews, bringing visual clarity to topics that are often buried in text or code comments.

Downloads: 0 This Week

Last Update: 2026-02-05
See Project