Showing 20 open source projects for "structured text"

View related business solutions
  • Ship Agents Faster Icon
    Ship Agents Faster

    Transform your applications and workflows into powerful agentic systems at global scale.

    Gemini Enterprise Agent Platform lets you rapidly build, scale, govern and optimize production-ready agents grounded in your organization's data. The platform enables developers to build custom or pre-built agents for virtually any use case. New customers get $300 in free credits.
    Get Started Free
  • Stop Cyber Threats with VM-Series Next-Gen Firewall on Azure Icon
    Stop Cyber Threats with VM-Series Next-Gen Firewall on Azure

    Native application identity and user-based security for your Azure cloud

    Gain integrated visibility across all traffic in a single pass. Deploy Palo Alto Networks VM-Series to determine application identity and content while automating security policy updates via rich APIs.
    Get a free trial
  • 1
    LangExtract

    LangExtract

    A Python library for extracting structured information

    LangExtract is a Python library developed by Google that leverages large language models (LLMs) to extract structured information from unstructured text—such as clinical notes, research papers, or literary works—based on user-defined instructions. It is designed to transform free-form text into reliable, schema-constrained data while maintaining traceability back to the source material. Each extracted entity is precisely grounded in its original context, allowing visual inspection and validation via automatically generated interactive HTML visualizations. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    xAI Python SDK

    xAI Python SDK

    The official Python SDK for the xAI API

    ...It is a gRPC-based SDK designed for Python 3.10 and above, with both synchronous and asynchronous clients for different application styles. Developers can use it to generate text, images, videos, and structured outputs through xAI’s model services. The package is built for direct integration into Python projects, making it useful for backend apps, automation scripts, AI tools, research prototypes, and production workflows. It uses xAI’s native gRPC interface, which is intended for high-performance communication with the API. ...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 3
    o1-engineer

    o1-engineer

    o1-engineer is a command-line tool designed to assist developers

    o1-engineer is a command-line development assistant powered by OpenAI’s API. It helps developers interact with projects through commands for code generation, file editing, project planning, and code review. The tool can add, edit, and manage both files and folders directly from the terminal. Its planning command can create structured project plans that can then guide systematic file and directory generation. It also keeps conversation history and allows users to save or reset context as...
    Downloads: 5 This Week
    Last Update:
    See Project
  • 4
    PPT Builder Skill

    PPT Builder Skill

    AI-friendly PPT builder skill: 17 hand-polished Chinese PPTX templates

    PPT Builder Skill is an AI-friendly PowerPoint builder skill designed to create editable native PPTX presentations from structured content. It includes polished Chinese presentation templates and uses python-pptx-based tools to preserve layout while applying controlled text edits. The skill supports workflows where an agent selects a template, writes an edits file, and produces a real PowerPoint file with the original design intact. It can also work with user-provided templates by inspecting slide images and shape structures before applying non-destructive changes. ...
    Downloads: 2 This Week
    Last Update:
    See Project
  • $300 Free Credits for Your Google Cloud Projects Icon
    $300 Free Credits for Your Google Cloud Projects

    Start building on Google Cloud with $300 in free credits. No commitment, no credit card required until you're ready to scale.

    Launch your next project with $300 in free Google Cloud credits—no strings attached. Test, build, and deploy without risk. Use your credits across the entire Google Cloud platform to find what works best for your needs. After your credits are used, continue with always-free tier services. Only pay when you're ready to scale. Sign up in minutes and start exploring.
    Start Free Trial
  • 5
    RAG Anything

    RAG Anything

    RAG-Anything: All-in-One RAG Framework

    RAG-Anything is an open-source unified framework that extends the Retrieval-Augmented Generation (RAG) paradigm to fully multimodal document and knowledge retrieval, enabling systems to ingest, parse, represent, and query rich content that includes text, images, tables, formulas, and other structured or visual elements. Traditional RAG systems are typically limited to text and cannot effectively work across heterogeneous document layouts, but RAG-Anything addresses this by modeling multimodal content in ways that preserve cross-modal relationships and semantic context, often treating content elements as interconnected knowledge entities rather than separate data silos. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    Claude Cookbooks

    Claude Cookbooks

    A collection of notebooks/recipes showcasing ways of using Claude

    ...It serves as both a learning resource and a reference library, helping developers understand how to apply AI capabilities such as classification, summarization, and retrieval-augmented generation in real-world scenarios. The repository includes structured examples for integrating Claude with external tools, databases, and APIs, showcasing how to extend its functionality beyond basic text generation. It also covers advanced techniques like sub-agent orchestration, prompt optimization, and automated evaluation workflows. The content is organized into thematic sections, allowing users to explore specific capabilities or integration patterns systematically. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    Atheris

    Atheris

    A Coverage-Guided, Native Python Fuzzer

    ...The tool integrates smoothly with Python’s packaging and unit-test ecosystems, so you can wrap existing tests as fuzz targets and keep results understandable. It supports structured input strategies and custom mutators, which is especially helpful for text and data formats common in Python workloads. In practice, Atheris compresses weeks of edge-case brainstorming into hours of automated exploration with actionable, minimized reproductions.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    Django Wiki

    Django Wiki

    A wiki system with complex functionality for simple integration

    A wiki system with complex functionality for simple integration and a superb interface. Store your knowledge with style: Use django models. Readability, however, is emphasized above all else. A Markdown-formatted document should be publishable as-is, as plain text, without looking like it's been marked up with tags or formatting instructions. While Markdown's syntax has been influenced by several existing text-to-HTML filters -- including Setext, atx, Textile, reStructuredText, Grutatext,...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    PaperSpine

    PaperSpine

    Motivation-driven skill for learning from strong academic papers

    ...It is best suited for users who need format-aware, evidence-aware academic writing support rather than generic text generation.
    Downloads: 1 This Week
    Last Update:
    See Project
  • Stop vibe-debugging. Icon
    Stop vibe-debugging.

    Plug Claude into your app's actual errors.

    AppSignal's MCP server hands Claude, Cursor, or Zed your real errors, traces, and the deploy that shipped them. AI writes the fix; you review the diff.
    Free 30 days.
  • 10
    GPT-Image2-Skill

    GPT-Image2-Skill

    GPT Image 2 prompt gallery, image prompt library, agentic skill

    GPT-Image2-Skill is a prompt gallery, image prompt library, agent skill, and CLI for OpenAI image generation and editing workflows. It collects curated prompt examples with generated outputs so users can reuse strong visual patterns instead of starting from scratch. The project includes categories such as anime, gaming, cyberpunk, animation, character design, typography, illustration, watercolor, ink, pixel art, isometric scenes, product visuals, and food imagery. It can be installed as an...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 11
    txtai

    txtai

    Build AI-powered semantic search applications

    ...Innovation is happening at a rapid pace, models can understand concepts in documents, audio, images and more. Machine-learning pipelines to run extractive question-answering, zero-shot labeling, transcription, translation, summarization and text extraction. Cloud-native architecture that scales out with container orchestration systems (e.g. Kubernetes). Applications range from similarity search to complex NLP-driven data extractions to generate structured databases. The following applications are powered by txtai.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 12
    Software Copyright Materials Skill

    Software Copyright Materials Skill

    Skills, a Chinese software copyright application material generator

    ...The skill reads the real project, guides the user through key confirmations, and produces organized materials that can be reviewed and edited locally. It can generate application-form reference information, an operation manual, and source-code materials in Word and text formats. The project is designed to avoid invented code by extracting only from the user’s existing source files. It is especially useful for developers who want control over sensitive project details while still producing structured, submission-ready drafts.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    sqlite-utils

    sqlite-utils

    Python CLI utility and library for manipulating SQLite databases

    ...It focuses on making common tasks like importing CSV/JSON, exploring tables, and running ad-hoc queries feel ergonomic and scriptable. As a CLI, it lets you build databases from structured data in one line, run queries against local files or in-memory databases, output results as JSON, CSV, or pretty tables, and configure full-text search. As a library, it exposes high-level APIs for inserting records, creating or transforming tables, normalizing schemas, and running migrations that SQLite’s limited ALTER TABLE cannot handle directly. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    PageIndex

    PageIndex

    Document Index for Vectorless, Reasoning-based RAG

    PageIndex is an innovative open-source framework that reimagines retrieval-augmented generation (RAG) by eliminating conventional vector similarity search and instead building hierarchical semantic indexes that mirror a document’s natural structure. Rather than chunking text and embedding it into a vector database, PageIndex constructs a tree-structured index — similar to a detailed, AI-enhanced table of contents — that a large language model can traverse to locate the most relevant sections of long documents. This reasoning-driven retrieval aligns more naturally with how humans explore complex texts, improving relevance and traceability, especially in professional domains like financial reports, legal contracts, and technical manuals. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    Gretel Synthetics

    Gretel Synthetics

    Synthetic data generators for structured and unstructured text

    Unlock unlimited possibilities with synthetic data. Share, create, and augment data with cutting-edge generative AI. Generate unlimited data in minutes with synthetic data delivered as-a-service. Synthesize data that are as good or better than your original dataset, and maintain relationships and statistical insights. Customize privacy settings so that data is always safe while remaining useful for downstream workflows. Ensure data accuracy and privacy confidently with expert-grade reports....
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    PROJECT MOVED TO https://github.com/paulhtremblay/rtf2xml The script rtf2xml faithfully converts Microsoft's RTF format to structured XML. Developers can make further transformations using standard XML tools, or use the stylsheets provided to convert to sdocbook or TEI.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17

    SVNStartCommitHelper

    Useful form to support SVN Commits as an SVN Start-Commit Hook Script

    ...E.g. by usage of server side commit hooks to check for minimum acceptance levels on code and documentation quality including commit message structure and content. TortoiseSVN offers only a free form text field to edit inside the Commit Dialog. Developers might recall situations when struggling with commit message structure and fighting the server side commit hooks instead of focusing on message content! Thus being annoyed instead of feeling an incentive to deliver high quality descriptions here. The SVNStartCommitHelper is a client side start commit hook script (as a first version written in Python / Tkinter) exactly offering a well-structured form to fill in. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    Nested Editor

    Nested Editor

    Specialized editor for structured documents.

    Nested is a specialized editor focused on creating structured documents such as reports, publications, presentations, books, etc. It is designed to help the user concentrate on writing content without been distracted by format or markup. It offers a rich WYSIWYM interface where the user writes plain text with a lightweight markup language.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    FramerD is a distributed semi-structured object database originally developed at MIT. It provides an internationalized Scheme-based scripting language, built-in text analysis tools, and special support for web scripting.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    A modular system for extracting and converting Python docstrings into useful structured formats like HTML, XML, and TeX. Project inactive. Development taken over by Docutils, http://docutils.sourceforge.net/.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • Next
Auth0 Logo