23 projects for "data json" with 2 filters applied:

  • Custom VMs From 1 to 96 vCPUs With 99.95% Uptime Icon
    Custom VMs From 1 to 96 vCPUs With 99.95% Uptime

    General-purpose, compute-optimized, or GPU/TPU-accelerated. Built to your exact specs.

    Live migration and automatic failover keep workloads online through maintenance. One free e2-micro VM every month.
    Try Free
  • Forever Free Full-Stack Observability | Grafana Cloud Icon
    Forever Free Full-Stack Observability | Grafana Cloud

    Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

    Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.
    Create free account
  • 1
    JSON_REPAIR

    JSON_REPAIR

    A python module to repair invalid JSON from LLMs

    json_repair is an open-source Python library designed to automatically fix malformed JSON data and convert it into valid, parseable structures. The tool is particularly useful in scenarios where JSON output is generated by large language models or external services that may produce syntactically invalid responses. Instead of failing when encountering errors such as missing quotes, trailing commas, or incomplete objects, the library analyzes the malformed data and reconstructs it into valid JSON.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 2
    Profile Data

    Profile Data

    Analyze computation-communication overlap in V3/R1

    profile-data is a repository that publishes profiling traces and metrics from DeepSeek’s training and inference infrastructure (especially during DeepSeek-V3 / R1 experiments). The profiling data targets insights into computation-communication overlap, pipeline scheduling (e.g. DualPipe), and how MoE / EP / parallelism strategies interact in real systems.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    Token-Oriented Object Notation

    Token-Oriented Object Notation

    Token-Oriented Object Notation (TOON)

    Token-Oriented Object Notation is an open specification and toolkit for a data serialization format called Token-Oriented Object Notation (TOON), designed specifically to optimize how structured data is passed to large language models. The format aims to reduce token overhead compared with traditional formats like JSON while remaining human-readable and structurally expressive. TOON represents the same data model as JSON but removes unnecessary syntax such as braces and quotes, relying instead on indentation and structured tokens to represent objects and arrays. ...
    Downloads: 5 This Week
    Last Update:
    See Project
  • 4
    AI-Crawler

    AI-Crawler

    Crawl a website starting from a URL, find relevant pages

    ...Users can define their data requirements in plain English, and the system will interpret those instructions to crawl a domain and extract structured data. The tool supports output formats such as JSON and Markdown, and it can generate or accept schemas to ensure that extracted data is structured according to application needs. It is designed as a low-code solution, reducing the complexity of building and maintaining custom scraping pipelines.
    Downloads: 4 This Week
    Last Update:
    See Project
  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • 5
    LLM Scraper

    LLM Scraper

    Extract structured data from webpages using LLM-powered scraping

    LLM Scraper is a TypeScript library designed to extract structured data from webpages using large language models. Instead of relying on fragile HTML selectors or manual parsing rules, the tool interprets webpage content with language models and converts it into structured data according to a defined schema. Developers can specify the data structure using tools such as Zod or JSON Schema, enabling the model to extract relevant information directly into typed objects. ...
    Downloads: 5 This Week
    Last Update:
    See Project
  • 6
    ChatGPT Exporter

    ChatGPT Exporter

    Export and Share your ChatGPT conversation history

    ...It integrates directly into the ChatGPT web environment, typically via tools like Tampermonkey, and adds export functionality without requiring backend services or complex setup. The tool supports a wide range of output formats including plain text, HTML, Markdown, JSON, and even image-based exports, making it suitable for documentation, knowledge management, and data analysis workflows. One of its key strengths is its ability to preserve formatting such as code blocks, tables, and structured responses, ensuring that exported content remains usable and readable. It also allows exporting entire conversations or selected portions, giving users flexibility depending on their needs.
    Downloads: 13 This Week
    Last Update:
    See Project
  • 7
    Puck

    Puck

    Open source visual editor for building React drag-and-drop pages

    ...Puck integrates directly into existing React environments and works well with frameworks such as Next.js, enabling developers to embed editing capabilities directly inside their applications. It operates with a structured data model that stores content as a JSON payload, allowing the same configuration to power both the editing interface and the production rendering of pages. This approach gives developers full ownership of their content data while avoiding vendor lock-in.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 8
    Browser Agent

    Browser Agent

    AI Browser Agent is an advanced Browser AI tool

    ...It also provides structured output formats such as JSON, HTML, Markdown, or screenshots, making it easy to integrate results into other systems or pipelines. Because it can interact with dynamic, JavaScript-heavy websites, it is suitable for modern web scraping and automation tasks.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 9
    python-small-examples

    python-small-examples

    Focus on creating classic Python small examples and cases

    python-small-examples is an open-source educational repository that contains hundreds of concise Python programming examples designed to illustrate practical coding techniques. The project focuses on teaching programming concepts through small, focused scripts that demonstrate common tasks in data processing, visualization, and general programming. Each example highlights a specific function or programming pattern so that learners can quickly understand how to apply Python features in real-world scenarios. The repository includes examples covering topics such as file processing, JSON manipulation, data visualization, and library usage. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Compliant and Reliable File Transfers Backed by Top Security Certifications Icon
    Compliant and Reliable File Transfers Backed by Top Security Certifications

    Cerberus FTP Server delivers SOC 2 Type II certified security and FIPS 140-2 validated encryption.

    Stop relying on non-certified, legacy file transfer tools that creak under the weight of modern security demands. Get full audit trails, advanced access controls and more supported by an award-winning team of experts. Start your free 25-day trial today.
    Start Free Trial
  • 10
    FormCreate

    FormCreate

    The easy-to-use Vue low-code visual AI form designer

    FormCreate is a low-code visual form builder built on Vue that enables developers to create complex, dynamic forms through a drag-and-drop interface rather than manual coding. It is part of the broader form-create ecosystem and leverages JSON-based schema generation to dynamically render forms, handle validation, and manage data collection workflows. The tool is designed to significantly reduce development time by allowing users to visually assemble forms while automatically generating the underlying configuration and logic. It supports multiple UI frameworks such as Element Plus and Vant, making it adaptable for both desktop and mobile applications. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 11
    Chandra

    Chandra

    OCR model for complex documents with layout-aware structured outputs

    ...It focuses on preserving full document layout, meaning that extracted text is accompanied by positional metadata like bounding boxes for each element. Chandra supports multiple output formats including Markdown, HTML, and JSON, making it suitable for downstream processing and integration into data pipelines. It is capable of handling over 40 languages and is optimized to read difficult inputs such as messy handwriting and multi-column layouts. Chandra can be run locally using transformer-based inference or deployed with a high-performance server setup for large-scale processing. ...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 12
    A2UI

    A2UI

    A Protocol for Agent-Driven Interfaces

    ...A key design principle of A2UI is security, as it avoids executing arbitrary code generated by models and instead restricts output to structured data that maps to a predefined catalog of trusted UI components. The system also supports incremental updates, allowing agents to progressively modify the interface as a conversation evolves.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    TTRL

    TTRL

    Test-Time Reinforcement Learning

    TTRL is an open-source framework for test-time reinforcement learning in large language models, with a particular focus on reasoning tasks where ground-truth labels are not available during inference. The project addresses the problem of how to generate useful reward signals from unlabeled test-time data, and its central insight is that common test-time scaling practices such as majority voting can be repurposed into reward estimates for online reinforcement learning. This makes the...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    DocStrange

    DocStrange

    Extract and convert data from any document, images, pdfs, word doc

    DocStrange is an open-source document understanding and extraction library designed to convert complex files into structured, LLM-ready outputs such as Markdown, JSON, CSV, and HTML. Developed by Nanonets, the project combines OCR, layout detection, table understanding, and structured extraction into one end-to-end pipeline, which reduces the need to stitch together multiple separate services. It is built for developers who need high-quality parsing from scans, photos, PDFs, office files,...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    TONL

    TONL

    TONL (Token-Optimized Notation Language)

    TONL is a cutting-edge data platform built around a production-ready serialization format designed to be both compact and powerful, combining human readability with performance features that make it suitable for large-scale applications and AI workflows. It provides a serialization format that significantly reduces token usage compared with traditional JSON, which can result in lower costs and more efficient prompt size utilization in LLM-driven systems.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    ENScan Go

    ENScan Go

    ENScan_GO is an enterprise information reconnaissance tool

    ENScan_GO is an enterprise information reconnaissance tool focused on Chinese corporate data sources. It aggregates official and third-party APIs to pull records like ICP filings, affiliated/holding companies, apps, mini-programs, and WeChat official accounts, then exports merged results for analysis. The tool targets analysts who need one-click collection and normalized output to reduce manual lookups across registries and platforms. Recent releases added a reworked task model with...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 17
    ChatGPT Retrieval Plugin

    ChatGPT Retrieval Plugin

    The ChatGPT Retrieval Plugin lets you easily find personal documents

    The chatgpt-retrieval-plugin repository implements a semantic retrieval backend that lets ChatGPT (or GPT-powered tools) access private or organizational documents in natural language by combining vector search, embedding models, and plugin infrastructure. It can serve as a custom GPT plugin or function-calling backend so that a chat session can “look up” relevant documents based on user queries, inject those results into context, and respond more knowledgeably about a private knowledge...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    LangChain Extract

    LangChain Extract

    Did you say you like data?

    LangChain Extract is an open-source reference application designed to demonstrate how large language models can be used to extract structured data from unstructured text and document files. The project implements a lightweight web service that allows developers to define extraction schemas and apply them to various sources such as plain text, HTML, or PDF documents. Built using FastAPI and the LangChain framework, the application exposes a REST API that can process documents and return structured outputs that match user-defined JSON schemas. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 19
    Qwen2.5

    Qwen2.5

    Open source large language model by Alibaba

    ...Trained on a comprehensive dataset of up to 18 trillion tokens, Qwen2.5 models exhibit significant improvements in instruction following, long-text generation (exceeding 8,000 tokens), and structured data comprehension, such as tables and JSON formats. They support context lengths up to 128,000 tokens and offer multilingual capabilities in over 29 languages, including Chinese, English, French, Spanish, and more. The models are open-source under the Apache 2.0 license, with resources and documentation available on platforms like Hugging Face and ModelScope. ...
    Downloads: 33 This Week
    Last Update:
    See Project
  • 20
    ReinventCommunity

    ReinventCommunity

    Jupyter Notebook tutorials for REINVENT 3.2

    This repository is a collection of useful jupyter notebooks, code snippets and example JSON files illustrating the use of Reinvent 3.2.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    HyperGAN

    HyperGAN

    Composable GAN framework with api and user interface

    A composable GAN built for developers, researchers, and artists. HyperGAN builds generative adversarial networks in PyTorch and makes them easy to train and share. HyperGAN is currently in pre-release and open beta. Everyone will have different goals when using hypergan. HyperGAN is currently beta. We are still searching for a default cross-data-set configuration. Each of the examples supports search. Automated search can help find good configurations. If you are unsure, you can start with...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    Qwen2.5-VL-3B-Instruct

    Qwen2.5-VL-3B-Instruct

    Qwen2.5-VL-3B-Instruct: Multimodal model for chat, vision & video

    Qwen2.5-VL-3B-Instruct is a 3.75 billion parameter multimodal model by Qwen, designed to handle complex vision-language tasks in both image and video formats. As part of the Qwen2.5 series, it supports image-text-to-text generation with capabilities like chart reading, object localization, and structured data extraction. The model can serve as an intelligent visual agent capable of interacting with digital interfaces and understanding long-form videos by dynamically sampling resolution and frame rate. It uses a SwiGLU and RMSNorm-enhanced ViT architecture and introduces mRoPE updates for robust temporal and spatial understanding. The model supports flexible image input (file path, URL, base64) and outputs structured responses like bounding boxes or JSON, making it highly versatile in commercial and research settings. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    Qwen2.5-14B-Instruct

    Qwen2.5-14B-Instruct

    Powerful 14B LLM with strong instruction and long-text handling

    ...Qwen2.5-14B-Instruct is built on a transformer backbone with RoPE, SwiGLU, RMSNorm, and attention QKV bias. It’s resilient to varied prompt styles and is especially effective for JSON and tabular data generation. The model is instruction-tuned and supports chat templating, making it ideal for chatbot and assistant use cases.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • Next
MongoDB Logo MongoDB