2022 projects for "data file" with 1 filter applied:

  • Full-stack observability with actually useful AI | Grafana Cloud Icon
    Full-stack observability with actually useful AI | Grafana Cloud

    Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

    Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.
    Create free account
  • Go From AI Idea to AI App Fast Icon
    Go From AI Idea to AI App Fast

    One platform to build, fine-tune, and deploy ML models. No MLOps team required.

    Access Gemini 3 and 200+ models. Build chatbots, agents, or custom models with built-in monitoring and scaling.
    Try Free
  • 1
    Preswald

    Preswald

    Python tool for browser-based interactive data apps in one file

    Preswald is an open source Python-based framework and static-site generator designed for building interactive data applications that run entirely in the browser. It packages application logic, data processing, and user interface components into a single self-contained output, enabling easy sharing and deployment without requiring local dependencies. Preswald leverages a WebAssembly runtime along with technologies like Pyodide and DuckDB to execute Python code directly in the browser...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    Convert to it!

    Convert to it!

    Truly universal online file converter

    Convert to it! is a lightweight utility project designed to provide simple and fast file or data conversion workflows through a developer-friendly interface. The repository focuses on practical transformations between formats, making it useful in scripting pipelines, automation tasks, and small backend utilities where reliability matters more than heavy frameworks. Its structure emphasizes clarity and minimal dependencies so developers can quickly understand and extend the codebase. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    Tox

    Tox

    A New Kind of Instant Messaging

    ...In today's world where digital surveillance is rampant, Tox offers users a communication software alternative that's free from prying eyes and ears, and is, quite literally free and without advertising. Tox comes with all the great features you'd expect from an instant messenger application, including voice calls, video calls, file sharing and screen sharing. Everything done on Tox is encrypted using open-source libraries so you can be assured of completely private conversations. It also has no central servers that can be raided, shut down, or forced to turn over data. There are no corporate interests, no hidden agendas-- just simple, secure and easy to use messaging.
    Downloads: 7 This Week
    Last Update:
    See Project
  • 4
    RustDesk Server PRO

    RustDesk Server PRO

    Some scripts for RustDesk Server Pro are hosted here

    ...It encompasses the components required to broker, relay, and supervise connections while adding administrative layers suitable for regulated or large environments. Organizations can standardize how clients authenticate, how sessions are permitted and audited, and how data paths are routed under diverse network constraints. The server emphasizes reliability for interactive use cases like remote control, file transfer, and support workflows, even when clients sit behind strict NAT or firewall rules. It aims to balance performance with privacy by letting companies retain custody over metadata and configuration within their own perimeter. ...
    Downloads: 13 This Week
    Last Update:
    See Project
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • 5
    DiscordChatExporter

    DiscordChatExporter

    Saves Discord chat logs to a file

    DiscordChatExporter is a cross-platform utility that allows users to export Discord chat histories into structured files for archival, analysis, or backup purposes. The tool connects to Discord accounts and retrieves message history from servers, channels, or direct messages, then converts the data into formats such as HTML or JSON. It is designed to preserve message structure, timestamps, attachments, and formatting so exported conversations remain easy to navigate and review. The project...
    Downloads: 42 This Week
    Last Update:
    See Project
  • 6
    Uncloud

    Uncloud

    A lightweight tool for deploying and managing containerised apps

    Uncloud is a self-hosted personal cloud and file synchronization platform that gives individuals full control over their data without relying on centralized third-party cloud providers. Designed to replace services like Dropbox or Google Drive for people who want privacy and ownership, uncloud lets you sync files across multiple devices—such as laptops, phones, or home servers—while keeping all contents under your own infrastructure.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    OpenForm

    OpenForm

    Open Source TypeForm Clone

    ...It’s built with modern web technologies including Next.js and Tailwind CSS, offering developers a flexible, mobile-friendly foundation for surveys, feedback forms, and data collection, with support for authentication via Google OAuth and Magic Link. The platform includes a built-in dashboard to review, search, filter, and export responses, making it suitable for both simple and advanced data collection scenarios. OpenForm features multiple form themes and a rich variety of question types, such as rating scales, file uploads, and dropdowns, so form creators can design interactive experiences without heavy custom code. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 8
    Python 100 Days

    Python 100 Days

    Python - From Novice to Master in 100 Days

    Python-100-Days is a comprehensive, practice-first learning roadmap by Luo Hao that spans 100 days from absolute Python basics to professional, production-grade skills. It starts with foundational syntax, control flow, data structures, and functions, then advances through object-oriented programming, file I/O, exceptions, and modules. The middle sections focus on real-world Python applications, including working with CSV, Excel, Word, PowerPoint, PDFs, images, email/SMS, and regular expressions. The curriculum expands into databases and SQL, Linux essentials, web fundamentals, and a substantial Practical Django track that covers ORM, sessions, RESTful APIs, caching with Redis, asynchronous tasks with Celery, authentication, testing, and deployment. ...
    Downloads: 11 This Week
    Last Update:
    See Project
  • 9
    DynamicalSystems.jl

    DynamicalSystems.jl

    Award winning software library for nonlinear dynamics timeseries

    DynamicalSystems.jl is an award-winning Julia software library for nonlinear dynamics and nonlinear time series analysis. To install DynamicalSystems.jl, run import Pkg; Pkg.add("DynamicalSystems"). To learn how to use it and see its contents visit the documentation, which you can either find online or build locally by running the docs/make.jl file. DynamicalSystems.jl is part of JuliaDynamics, an organization dedicated to creating high-quality scientific software. All implemented algorithms...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Fully Managed MySQL, PostgreSQL, and SQL Server Icon
    Fully Managed MySQL, PostgreSQL, and SQL Server

    Automatic backups, patching, replication, and failover. Focus on your app, not your database.

    Cloud SQL handles your database ops end to end, so you can focus on your app.
    Try Free
  • 10
    Python

    Python

    The Python programming language

    ...CPython, the reference implementation, is developed and maintained by the Python Software Foundation and the global open-source community. The language includes a vast standard library that accelerates development by providing built-in modules for file handling, networking, data manipulation, and more. Python runs seamlessly across platforms such as Linux, macOS, and Windows, making it ideal for both development and production environments. With constant updates, optimizations, and an active community, Python continues to be one of the most widely adopted languages worldwide.
    Downloads: 30 This Week
    Last Update:
    See Project
  • 11
    Apache Drill

    Apache Drill

    Apache Drill is a distributed MPP query layer for self describing data

    ...Drill supports a variety of NoSQL databases and file systems, including HBase, MongoDB, MapR-DB, HDFS, MapR-FS, Amazon S3, Azure Blob Storage, Google Cloud Storage, Swift, NAS and local files. A single query can join data from multiple datastores. For example, you can join a user profile collection in MongoDB with a directory of event logs in Hadoop.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    TanStack Router

    TanStack Router

    Fully typesafe Router for React (and friends) w/ built-in caching

    TanStack Router is a modern, fully type-safe routing library designed for React and Solid applications, offering robust routing capabilities with first-class support for data loading, search parameters, nested layouts, and seamless integration with client-side caching libraries like TanStack Query. nd‑to‑end type safety (routes, params, loaders). Schema‑driven search params with validation. Built‑in caching, prefetching & invalidation. Nested layouts, transitions & error boundaries.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 13
    log4rs

    log4rs

    A highly configurable logging framework for Rust

    log4rs is a highly configurable logging framework modeled after Java's Logback and log4j libraries. If you are using the file rotation in your configuration there is a known substantial performance issue so listen up! By default the gzip feature is enabled and when rolling files it will zip log archives automatically. This is a problem when the log archives are large as the zip happens in the main thread and will halt the process while the zip is completed. Be advised that the gzip feature...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 14
    gTTS

    gTTS

    Python library and CLI tool to interface with Google Translate

    gTTS (Google Text-to-Speech) is a Python library and command-line tool that wraps the speech functionality of Google Translate. It lets you send text to the Google Translate TTS endpoint and receive spoken audio back as MP3 data, either written to a file, a file-like object, or standard output. The library is designed to handle long texts, using a speech-specific sentence tokenizer that keeps intonation and punctuation natural while splitting requests into acceptable chunks. It supports customizable text pre-processors, which can correct pronunciations, tweak formatting, or handle domain-specific vocabulary before sending it to the API. gTTS is primarily aimed at developers who want a quick way to add cloud-backed speech to scripts, apps, or pipelines without managing any model weights locally. ...
    Downloads: 6 This Week
    Last Update:
    See Project
  • 15
    E2M

    E2M

    E2M converts various file types (doc, docx, epub, html, htm, url

    E2M is a SourceForge mirror of the e2m open-source project, which focuses on providing tools or services designed to convert or process content between different formats or systems. Projects with similar naming conventions typically emphasize automation workflows where input data from one environment is transformed into another representation or output structure. The mirrored repository allows users to access the project’s codebase independently from its original hosting platform while...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    Smallpond

    Smallpond

    A lightweight data processing framework built on DuckDB and 3FS

    smallpond is a lightweight distributed data processing framework built by DeepSeek, designed to scale DuckDB workloads over clusters using their 3FS (Fire-Flyer File System) backend. The idea is to preserve DuckDB’s fast analytics engine but lift it from single-node to multi-node settings, giving you the ability to operate on large datasets (e.g. petabyte scale) without moving to a heavyweight system like Spark.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    Bionic GPT

    Bionic GPT

    Bionic is an on-premise replacement for ChatGPT

    Bionic is an on-premise generative AI platform positioned as a private replacement for ChatGPT, with a strong emphasis on data confidentiality, team collaboration, and enterprise deployment. It can run locally on a laptop for small pilots, but it is also designed to scale into data center and Kubernetes environments for much larger usage. The interface is intentionally familiar, offering a ChatGPT-like experience with customizable branding, fast Rust-based performance, and conversation...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 18
    Harpoon

    Harpoon

    Command line OSINT and threat intelligence automation tool

    ...This design allows users to automate many reconnaissance and intelligence gathering tasks directly from the terminal. Harpoon integrates with numerous security and data services such as Shodan, VirusTotal, AlienVault OTX, and many other intelligence providers to retrieve information about domains, IP addresses, emails, and other indicators. Many commands rely on API keys that can be configured through a central configuration file, allowing users to connect their own intelligence accounts and data sources.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 19
    Christmas Tree

    Christmas Tree

    Just a simple Christmas tree, based on reddit story

    atree is a lightweight JavaScript library for managing and visualizing hierarchical tree data structures in a flexible and efficient way. It offers core abstractions for creating nodes, building parent–child relationships, and traversing trees with utilities for depth, breadth, and custom iteration strategies. Because tree structures are ubiquitous in computing—from file systems and organizational charts to DOM structures and evolutionary taxonomies—atree provides a reusable foundation for representing and interacting with nested data.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    Osquery

    Osquery

    SQL operating system instrumentation and monitoring framework

    ...The tools make low-level operating system analytics and monitoring both performant and intuitive. Osquery exposes an operating system as a high-performance relational database. This allows you to write SQL queries to explore operating system data. With osquery, SQL tables represent abstract concepts such as running processes, loaded kernel modules, open network connections, browser plugins, hardware events or file hashes.Osquery queries your devices like a database. Osquery uses basic SQL commands to leverage a relational data-model to describe a device. Frequently, attackers will leave a malicious process running but delete the original binary on disk. ...
    Downloads: 11 This Week
    Last Update:
    See Project
  • 21
    Output

    Output

    TypeScript framework for building AI workflows and agents

    Output is an open-source TypeScript framework designed to build, orchestrate, and manage AI workflows and agents within a single unified system. It consolidates multiple aspects of AI development, including prompt management, evaluation, tracing, cost tracking, and orchestration, into a file-based architecture that lives entirely within the codebase. The framework is built specifically to work with AI coding agents, enabling them to read, modify, and execute workflows directly from structured project folders. It eliminates reliance on fragmented SaaS tools by providing all necessary components locally, ensuring better transparency and control over data and processes. ...
    Downloads: 5 This Week
    Last Update:
    See Project
  • 22
    TwelveMonkeys

    TwelveMonkeys

    Additional plug-ins and extensions for Java's ImageIO

    TwelveMonkeys ImageIO provides extended image file format support for the Java platform, through plugins for the javax.imageio. package. The main goal of this project is to provide support for formats not covered by the JRE itself. Support for these formats is important, to be able to read data found "in the wild", as well as to maintain access to data in legacy formats. As there is lots of legacy data out there, we see the need for open implementations of readers for popular formats.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    Plaso

    Plaso

    Super timeline all the things

    Plaso (Plaso Langar Að Safna Öllu), or "super timeline all the things," is a Python-based engine designed for automatic creation of timelines in digital forensic investigations. It processes various log files and artifacts to generate a chronological sequence of events, aiding analysts in understanding system activities.​
    Downloads: 5 This Week
    Last Update:
    See Project
  • 24
    Groq TypeScript / Node.s

    Groq TypeScript / Node.s

    The official Node.js / Typescript library for the Groq API

    Groq TypeScript / Node.s (also often referred to as “groq-sdk” on npm) is the official Node.js / TypeScript client library for Groq’s REST API, enabling JavaScript/TypeScript developers to integrate LLM and AI-powered services into web backends, serverless functions, or frontend apps. It exports strongly-typed interfaces for models, chat completions, file uploads (e.g. for audio transcription), and other endpoints, allowing for better type safety and developer experience when using Groq from...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 25
    Corcel

    Corcel

    Use WordPress backend with Laravel or any PHP application

    A collection of Model classes that allows you to get data directly from a WordPress database. Corcel is a collection of PHP classes built on top of Eloquent ORM (from Laravel framework), that provides a fluent interface to connect and get data directly from a WordPress database. You can use WordPress as the backend (administration panel) or CMS, for inserting posts, custom types, etc, and any other PHP app in the other side querying those data (as a Model layer). It's easier to use Corcel...
    Downloads: 0 This Week
    Last Update:
    See Project
MongoDB Logo MongoDB