Showing 17 open source projects for "text database"

View related business solutions
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • AI-powered service management for IT and enterprise teams Icon
    AI-powered service management for IT and enterprise teams

    Enterprise-grade ITSM, for every business

    Give your IT, operations, and business teams the ability to deliver exceptional services—without the complexity. Maximize operational efficiency with refreshingly simple, AI-powered Freshservice.
    Try it Free
  • 1
    Trafilatura

    Trafilatura

    Python & command-line tool to gather text on the Web

    Trafilatura is a Python package and command-line tool designed to gather text on the Web. It includes discovery, extraction and text-processing components. Its main applications are web crawling, downloads, scraping, and extraction of main texts, metadata and comments. It aims at staying handy and modular: no database is required, the output can be converted to various commonly used formats. Going from raw HTML to essential parts can alleviate many problems related to text quality, first by avoiding the noise caused by recurring elements (headers, footers, links/blogroll etc.) and second by including information such as author and date in order to make sense of the data. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    news-please

    news-please

    Python tool for crawling and extracting structured data from news site

    ...Developers can use the software either as a standalone command line application or integrate it into their own Python applications through its library interface. Extracted article data can be stored in different formats and systems, including JSON files or database-backed storage solutions.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    Buku

    Buku

    Powerful command-line bookmark manager. Your mini web!

    buku is a powerful bookmark manager written in Python3 and SQLite3. buku fetches the title of a bookmarked web page and stores it along with any additional comments and tags. You can use your favourite editor to compose and update bookmarks. With multiple search options, including regex and a deep scan mode (particularly for URLs), it can find any bookmark instantly. Multiple search results can be opened in the browser at once. Though a terminal utility, it's possible to add bookmarks...
    Downloads: 25 This Week
    Last Update:
    See Project
  • 4
    SFM2Web reads text and database files encoded with SFMs (Standard Format Markers) and then generates a web site according to flags specified in control files. This is useful for web publication of MDF lexicons, USFM Bible books, texts, phrasebooks, etc.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Gemini 3 and 200+ AI Models on One Platform Icon
    Gemini 3 and 200+ AI Models on One Platform

    Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

    Build generative AI apps with Vertex AI. Switch between models without switching platforms.
    Start Free
  • 5
    Microblog

    Microblog

    Microblogging application developed in my Flask Mega-Tutorial series

    Microblog is a Flask-based social blogging application that serves as the companion example project to the author’s Flask tutorial book, demonstrating how to build a production-ready web application with features typical of modern social systems. The project includes user registration and login, password hashing, profile editing, and avatar support, but it goes deeper by illustrating how to implement following/follower relationships, posting short messages, and displaying personalized feeds....
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    huhamhire-hosts

    huhamhire-hosts

    hosts for Internet Freedom

    Since the governments of some countries are using the blocking the internet access to several websites and web service providers which includes some world famous sites like Google, YouTube, twitter, Facebook, and Wikipedia etc., we designed this tiny utility in order to help people getting through the Internet blockade. Hosts Setup Utility provides basic tools to manage the hosts file on current operating systems. It also provides both support for Graphical Desktop environment with...
    Leader badge
    Downloads: 1 This Week
    Last Update:
    See Project
  • 7
    the maker - Content Management System
    "the maker" is a Content Management System for websites. It is client side and interacts seamlessly with a remote FTP server. It features a powerful text editor, has a build in FTP client and a tool for basic image manipultaion.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8

    pyblosxom

    file-based blog system

    Pyblosxom is a weblog engine that uses standard text files located on your filesystem as a database of weblog entries. The concept is simple, managing content is as easy as editing a text file with your favorite text editor. Pyblosxom is now hosted on http://pyblosxom.github.io/
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    Nucular Archiving System for creating full text indices for fielded data. Python API, web, and command line interfaces. Fast. Very light weight. Concurrent read/writes with no possible locking issues. No server process. Proximity. Facets. Funny name.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Custom VMs From 1 to 96 vCPUs With 99.95% Uptime Icon
    Custom VMs From 1 to 96 vCPUs With 99.95% Uptime

    General-purpose, compute-optimized, or GPU/TPU-accelerated. Built to your exact specs.

    Live migration and automatic failover keep workloads online through maintenance. One free e2-micro VM every month.
    Try Free
  • 10
    A collection of python scripts to create and handle an XML corpus (a large collection of text for linguistic purpose) from an original Wikipedia database backup dump. It includes a regular expression based parser for the MediaWiki markup language.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    Wets is a very flexible ETL which can handle data coming from text file / XML / MySQL / postgreSQL and Zope server and because of the language associated there is no limitation concerning the transformation process. See the drop project for a GUI.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    An attempt at a Xanalogical Web, comprised of a CGI system with both native and HTML interfaces, a set of standards, a backend supporting Transclusion, versioning, and a custom browser with support for Transpointing and editing.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    Eligante is a software for archivation, management and browsing (with full-text search functions) of all your communications, be it via email, chat (IRC, ICQ, MSN,...) and even messaging websites (hi5, orkut,...).
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    NovaX is a set of programs that is being devloped for small company's and beginners in the fields of HTML and programming. Coded in Python and C++, this is also a good replacement for MS Office. NOTE: This requires Python to be on your PC. ( Python.org )
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    HTSQL is an HTTP access library to SQL databases, at http://htsql.org -- it is implemented with Python, and initially uses the PostgreSQL database. It has a do-as-you-please BSD style license for use with open source databases.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    SchemaDoc is a XML-based markup language for documenting XML schemas. The work products include both the vocabulary and a set of tools for combining it with the schema source (e.g. a DTD) to produce documentation in HTML, XML DocBook, LaTeX, etc.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    Access to the Ontos API - a RESTful Web service that provides NLProcessing of text and ontology support. See www.ontos.com and http://blog.ontos.com/?cat=11 and http://test.ontos.com/ontosApiDoc/
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • Next
MongoDB Logo MongoDB