Showing 642 open source projects for "linux file parser"

View related business solutions
  • AI-powered service management for IT and enterprise teams Icon
    AI-powered service management for IT and enterprise teams

    Enterprise-grade ITSM, for every business

    Give your IT, operations, and business teams the ability to deliver exceptional services—without the complexity. Maximize operational efficiency with refreshingly simple, AI-powered Freshservice.
    Try it Free
  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • 1
    Mind Guard Droid

    Mind Guard Droid

    Mind Guard OS Made For Minds

    Have you ever wanted to have a Mobile Anti Mind Control device?. Now you can! with Mind Guard Droid for Android. Complete Linux Installer, is an application for the Android that enables users to boot Mind Guard X OS from an Android device via chroot. The user interacts with the Mind Guard X OS via VNC. Mind Guard is the Glasses.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 2
    VnCoreNLP

    VnCoreNLP

    A Vietnamese natural language processing toolkit

    VnCoreNLP is a Java-based natural language processing toolkit tailored for Vietnamese. It offers a fast and accurate pipeline for essential NLP tasks, facilitating research and application development in Vietnamese language processing. ​
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    Common Resource Grep - crgrep

    Common Resource Grep - crgrep

    Common Resource Grep

    CRGREP searches for matching text in databases, various document formats, archives and other difficult to access resources. A command line tool for name and content text matching in database tables, plain files, MS Office documents, PDF, archives, MP3 audio, image meta-data, scanned documents, maven dependencies and web resources. CRGREP will search resources within resources of any arbitrary combination or depth, so text within a document within a zip archive, and so on. Here you...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 4

    WhisperBatchRun

    This batch file will run OpenAi's whisper to transcribe (or translate)

    Currently, Version #1 of the batch file does the following: (1) checks of whisper is installed, and if so, starts to run; (2) Asks if you want to process sub-folders, and if an answer is not provided in 10 seconds, defaults to "N"; (3) Applies the following command to each mp3 or wav file in the folder/sub-folders: "whisper "FILENAME" --model large-v2 --output_format vtt" (4) Creates a log file in the active directory, but only if there were any errors; (5) ends. To use, simply place...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Gemini 3 and 200+ AI Models on One Platform Icon
    Gemini 3 and 200+ AI Models on One Platform

    Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

    Build, govern, and optimize agents and models with Gemini Enterprise Agent Platform.
    Start Free
  • 5
    File-sharing-Bot

    File-sharing-Bot

    Telegram Bot to store Posts and Documents

    Telegram Bot to store posts and documents and it can be accessed by special links.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    ChatGPT-bot

    ChatGPT-bot

    Run your own GPTChat Telegram bot, with a single command

    Go CLI to fuels a Telegram bot that lets you interact with ChatGPT, a large language model trained by OpenAI.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    pdf-extractor

    pdf-extractor

    Node.js module for rendering pdf pages to images, svgs and HTML files

    Pdf-extractor is a wrapper around pdf.js to generate images, svgs, html files, text files and json files from a pdf on node.js. A DOM Canvas is used to render and export the graphical layer of the pdf. Canvas exports *.png as a default but can be extended to export to other file types like .jpg. Pdf objects are converted to svg using the SVGGraphics parser of pdf.js. Pdf text is converted to HTML. This can be used as a (transparent) layer over the image to enable text selection. Pdf text is extracted to a text file for different usages (e.g. indexing the text). This library is in it's most basic form a node.js wrapper for pdf.js. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    cerche

    cerche

    Experimental search engine for conversational AI such as parl.ai

    This is an experimental search engine for conversational AI such as parl.ai, large language models such as OpenAI GPT3, and humans (maybe).
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    CleanRL

    CleanRL

    High-quality single file implementation of Deep Reinforcement Learning

    CleanRL is a Deep Reinforcement Learning library that provides high-quality single-file implementation with research-friendly features. The implementation is clean and simple, yet we can scale it to run thousands of experiments using AWS Batch. CleanRL is not a modular library and therefore it is not meant to be imported. At the cost of duplicate code, we make all implementation details of a DRL algorithm variant easy to understand, so CleanRL comes with its own pros and cons. You should...
    Downloads: 0 This Week
    Last Update:
    See Project
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • 10
    PerlPP

    PerlPP

    Perl preprocessor - embed Perl source in any file

    Translates Text+Perl to Text. It can be used for any kind of text templating, e.g. code generation. No external modules are required, just a single file. Requires Perl 5.10.1+. PerlPP runs in two passes: it generates a Perl script from your input, and then it runs the generated script. If you see error at (eval ##) (for some number ##), it means there was an error in the generated script. The -D switch defines elements of %D. If you do not specify a value, the value true (a constant in the...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    Minimal text diffusion

    Minimal text diffusion

    A minimal implementation of diffusion models for text generation

    A minimal implementation of diffusion models of text: learns a diffusion model of a given text corpus, allowing to generate text samples from the learned model. The main idea was to retain just enough code to allow training a simple diffusion model and generating samples, remove image-related terms, and make it easier to use. To train a model, run scripts/train.sh. By default, this will train a model on the simple corpus. However, you can change this to any text file using the --train_data...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    AllenNLP

    AllenNLP

    An open-source NLP research library, built on PyTorch

    AllenNLP makes it easy to design and evaluate new deep learning models for nearly any NLP problem, along with the infrastructure to easily run them in the cloud or on your laptop. AllenNLP includes reference implementations of high quality models for both core NLP problems (e.g. semantic role labeling) and NLP applications (e.g. textual entailment). AllenNLP supports loading "plugins" dynamically. A plugin is just a Python package that provides custom registered classes or additional...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    Super-PDF-Editor-Lite

    Super-PDF-Editor-Lite

    World's most comprehensive, powerful, process-based PDF editor

    World's most comprehensive, powerful, process-based and lighting fast PDF reader, editor and batch processor. Includes features like Create PDF from Images, HTML, Text files. Create a processing log file. Extract Page, Split Page, Rotate Page, Merge Page, Duplicate page, Move Page, Printing, and Compress Page. Improve image enhancement before OCR operation for better OCR performance. pdf Imposition, etc. Super PDF Editor is best for bulk pdf processing, especially for the printing industry....
    Downloads: 4 This Week
    Last Update:
    See Project
  • 14
    Pigo

    Pigo

    Fast face detection, pupil/eyes localization

    Fast face detection, pupil/eyes localization and facial landmark points detection library in pure Go. Pigo is a pure Go face detection, pupil/eyes localization and facial landmark points detection library based on the Pixel Intensity Comparison-based Object detection paper. The reason why Pigo has been developed is because almost all of the currently existing solutions for face detection in the Go ecosystem are purely bindings to some C/C++ libraries like OpenCV or dlib, but calling a C...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    Termin-Bot

    Termin-Bot

    Telegram bot to get notifications about free appointments

    Termin (eng. appointment) is essential in Germany, especially in Berlin. Sometimes it is tough to get a termin in public institutions and this bot is a way to automate termin checks and send Telegram notifications, if free termin was found. The code could be also considered as a template for subsequent development. The principle of bot operation is to refresh a webpage with needed appointments periodically [TERMIN_URL] and to recognize the change on it. E.g., the page always contains the...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 16
    MANTI

    MANTI

    MANTI - Mastering Advanced N-Termini Interpretation

    MANTI is a one-stop shop N-termini annotation & evaluation solution. MANTI was previously (un)known as muda.pl ahead of v3.7, the project was renamed to MANTI.pl with v3.7 on 2019-06-24. It congregates information from different MaxQuant or DiaNN/MSFragger output files into a master file suitable explicitly for protein neo-termini analyses. The central anchor for the data congregation is the modificationSpecificPeptides.txt or diann-output.pr_matrix.tsv file - additional data is inferred...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 17
    AI Chatbots based on GPT Architecture

    AI Chatbots based on GPT Architecture

    Training & Implementation of chatbots leveraging GPT-like architecture

    Training & Implementation of chatbots leveraging GPT-like architecture with the aitextgen package to enable dynamic conversations. It sure seems like there are a lot of text-generation chatbots out there, but it's hard to find a python package or model that is easy to tune around a simple text file of message data. This repo is a simple attempt to help solve that problem. ai-msgbot covers the practical use case of building a chatbot that sounds like you (or some dataset/persona you choose)...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    StudioGAN

    StudioGAN

    StudioGAN is a Pytorch library providing implementations of networks

    StudioGAN is a Pytorch library providing implementations of representative Generative Adversarial Networks (GANs) for conditional/unconditional image generation. StudioGAN aims to offer an identical playground for modern GANs so that machine learning researchers can readily compare and analyze a new idea. Moreover, StudioGAN provides an unprecedented-scale benchmark for generative models. The benchmark includes results from GANs (BigGAN-Deep, StyleGAN-XL), auto-regressive models (MaskGIT,...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    Fairseq

    Fairseq

    Facebook AI Research Sequence-to-Sequence Toolkit written in Python

    Fairseq(-py) is a sequence modeling toolkit that allows researchers and developers to train custom models for translation, summarization, language modeling and other text generation tasks. We provide reference implementations of various sequence modeling papers. Recent work by Microsoft and Google has shown that data parallel training can be made significantly more efficient by sharding the model parameters and optimizer state across data parallel workers. These ideas are encapsulated in the...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    ModelFox

    ModelFox

    ModelFox makes it easy to train, deploy, and monitor ML models

    ModelFox makes it easy to train, deploy, and monitor machine learning models. Train a model from a CSV file on the command line. Make predictions from Elixir, Go, JavaScript, PHP, Python, Ruby, or Rust. Learn about your models and monitor them in production from your browser. ModelFox makes it easy to train, deploy, and monitor machine learning models. You can install the modelfox CLI by either downloading the binary from the latest GitHub release or by building from source. Train a machine...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    LayoutParser

    LayoutParser

    A Unified Toolkit for Deep Learning Based Document Image Analysis

    With the help of state-of-the-art deep learning models, Layout Parser enables extracting complicated document structures using only several lines of code. This method is also more robust and generalizable as no sophisticated rules are involved in this process. A complete instruction for installing the main Layout Parser library and auxiliary components. Learn how to load DL Layout models and use them for layout detection. The full list of layout models currently available in Layout Parser....
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    edge-TTS-record

    edge-TTS-record

    Tool that can record speech synthesis

    edge-TTS-record is a Windows-based tool that records speech synthesized by the Microsoft Edge browser’s online TTS voices and saves the result as .wav audio files. The idea is simple but effective: since Edge’s online TTS voices (such as “Xiaoxiao” or “Yunyang” for Chinese) are often high-quality, this tool provides a way to “capture” them offline for later use. Users can type or paste text, preview the speech, and then trigger the recorder; the system automatically captures the audio output...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 23
    VideoSrt

    VideoSrt

    Windows-GUI

    This is an open source Windows-GUI software tool that can recognize video speech and automatically generate subtitle SRT files. VideoSrtIt is written in Golanglanguage and developed based on lxn/walk Windows-GUI toolkit. Open source software tool that can recognize video speech and automatically generate subtitle SRT files. It is suitable for business scenarios that quickly and batch generate Chinese/English subtitles and text files for media (video/audio). Recognize video/audio speech to...
    Downloads: 30 This Week
    Last Update:
    See Project
  • 24
    The fastai book

    The fastai book

    The fastai book, published as Jupyter Notebooks

    These notebooks cover an introduction to deep learning, fastai, and PyTorch. fastai is a layered API for deep learning; for more information, see the fastai paper. These notebooks are used for a MOOC and form the basis of this book, which is currently available for purchase. It does not have the same GPL restrictions that are on this repository. The code in the notebooks and python .py files is covered by the GPL v3 license; see the LICENSE file for details. The remainder (including all...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    The goal of this project is to investigate optimal ways to do genre classification for the ten indigenous South African languages. Funded by Dept of Arts and Culture of the SA Government. http://www.trifonius.co.za/projects/genre-classification
    Downloads: 0 This Week
    Last Update:
    See Project
MongoDB Logo MongoDB