Showing 440 open source projects for "gnu/linux"

View related business solutions
  • Train ML Models With SQL You Already Know Icon
    Train ML Models With SQL You Already Know

    BigQuery automates data prep, analysis, and predictions with built-in AI assistance.

    Build and deploy ML models using familiar SQL. Automate data prep with built-in Gemini. Query 1 TB and store 10 GB free monthly.
    Try Free
  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • 1
    This project is about building a small search engine. This is done using the TREC-6 document collection as a basis to provide solid and reliable evaluation.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    Java program to extract postings and comments from http://www.livejournal.com (blog) into DB and view/classify/process it. LJ loader. Components to reuse: perl-like, but efficient Web pages scraper, trees analyzer, concurrent scheduler.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    The goal of the project is to guide developers in designing Web applications which uses various Opensource frameworks such as spring and hibernate etc to build a scaleable, efficient and reliable Web application.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    J-Obey is a Java Library/package, which allows people writing their own crawlers to have a stable Robots.txt parser, if you are writing a web crawler of some sort you can use J-Obey to take out the hassle of writing a Robots.txt parser/intrepreter.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Compliant and Reliable File Transfers Backed by Top Security Certifications Icon
    Compliant and Reliable File Transfers Backed by Top Security Certifications

    Cerberus FTP Server delivers SOC 2 Type II certified security and FIPS 140-2 validated encryption.

    Stop relying on non-certified, legacy file transfer tools that creak under the weight of modern security demands. Get full audit trails, advanced access controls and more supported by an award-winning team of experts. Start your free 25-day trial today.
    Start Free Trial
  • 5
    A web app for creating a repository of pictures (our focus is birds). Users submit pictures, with a wizard that generates RDF descriptiors. Sumissions are forwarded to Admins for aproval. Instances will export the RDF so that repositories may cooperate.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    SENTENSA Knowledge Miner is a platform independent tool for searching any text. SENTENSA uses robust methods of indexing and searching text, leveraging on experience from more than 20 years of information retrieval.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    TagHybrida is a French hybrid syntactic parser. TagHybrida is a four stage parser combining hand-writen and corpus based information.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    (Almost) all a scholar in the Humanities needs (polytonic Greek fonts, stylistic and metrical analysis tools, search engines on TLG and PHI) concentrated in only one Linux Live CD, ready to use everywhere at home or at University, without installation
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    Prototype for a framework and user interface for combining various structured search and document clustering techniques.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Go From AI Idea to AI App Fast Icon
    Go From AI Idea to AI App Fast

    One platform to build, fine-tune, and deploy ML models. No MLOps team required.

    Access Gemini 3 and 200+ models. Build chatbots, agents, or custom models with built-in monitoring and scaling.
    Try Free
  • 10
    A configurable knowledge management framework. It works out of the box, but it's meant mainly as a framework to build complex information retrieval and analysis systems. The 3 major components: Crawler, Analyzer and Indexer can also be used separately.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    JaWiki is Java Wiki with a file based database to manage the Content. The content is stored in XML files in the file system. A html frontend allows to edit the content by the users via an Browser. A standalone server also included.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    A drop-in framework for adding tagging (folksonomy) capabilities to existing applications
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    The Batino Browser is the next generation rich web browser platform. It is based on Eclipse technology.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    This application generates an index of a website using information stored in the pages' meta tags.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    Spidertron is a multithreaded web crawling API for web sites of moderate size (hundreds of thousands of pages) that allows you to focus not on the crawling but on processing of the information retreived.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    Frutch is a French-speaking working group on the Nutch Open Source search engine. It provides some extensions to Nutch.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    JLinkCheck is an Ant Task written in Java for checking links in websites. It is not just checking one single page, but crawling a whole site like a spider, generating a report in XML and (X)HTML. JReptator will be its succesor with many more features
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    The project Navigator aims at supporting automated gathering of dynamic information from third party web sites, using their web interface to post queries and to gather replies. Navigator is written in OS-independent java language.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    Catalogo is a system for cataloguing resources on a web site. It allows semantic search of information on an intranet using metadata, RDF and ontology concepts. It provides a Catalog server (Java web applications) and a Catalog client (Firefox plug-in).
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    Toke is a webmining toolkit for web exploring, indexing and searching for Java. Toke allows to you crawl public or private web sites, in order to create web estatistics, web Pajek graphs, Lucene indexs and word frequency files for data clustering.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    This project aims to create a free and open catalog over music that is popular to day including links to audio files and websites, created with our search engine, as well as statistics over genres and artist popularity. Relased under GNU/GPL.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    Searchy is a distributed metainformation search engine whose main goal is to federate search systems and integrate information. It uses RDF as abstract information model and may be used with Dublin Core, FOAF, vCard, etc
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    NWA Toolset, a software package for accessing archived web documents
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    Roosster.org is a personal "on-demand" search engine. This means, it indexes only items/entries/files/URLs you explicitly tell it to index and provides a full-text-search over indexed items. Goto http://roosster.org/dev for all details.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    SmartCrawler is a java-based fully configurable, multi-threaded and extensible crawler, which is able to fetch and analyze the contents of a web site by using dinamically pluggable filters
    Downloads: 0 This Week
    Last Update:
    See Project
MongoDB Logo MongoDB