Showing 436 open source projects for "java open source"

View related business solutions
  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • Context for your AI agents Icon
    Context for your AI agents

    Crawl websites, sync to vector databases, and power RAG applications. Pre-built integrations for LLM pipelines and AI assistants.

    Build data pipelines that feed your AI models and agents without managing infrastructure. Crawl any website, transform content, and push directly to your preferred vector store. Use 10,000+ tools for RAG applications, AI assistants, and real-time knowledge bases. Monitor site changes, trigger workflows on new data, and keep your AIs fed with fresh, structured information. Cloud-native, API-first, and free to start until you need to scale.
    Try for free
  • 1
    A configurable knowledge management framework. It works out of the box, but it's meant mainly as a framework to build complex information retrieval and analysis systems. The 3 major components: Crawler, Analyzer and Indexer can also be used separately.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    (Almost) all a scholar in the Humanities needs (polytonic Greek fonts, stylistic and metrical analysis tools, search engines on TLG and PHI) concentrated in only one Linux Live CD, ready to use everywhere at home or at University, without installation
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    Prototype for a framework and user interface for combining various structured search and document clustering techniques.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    JaWiki is Java Wiki with a file based database to manage the Content. The content is stored in XML files in the file system. A html frontend allows to edit the content by the users via an Browser. A standalone server also included.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Network Management Software and Tools for Businesses and Organizations | Auvik Networks Icon
    Network Management Software and Tools for Businesses and Organizations | Auvik Networks

    Mapping, inventory, config backup, and more.

    Reduce IT headaches and save time with a proven solution for automated network discovery, documentation, and performance monitoring. Choose Auvik because you'll see value in minutes, and stay with us to improve your IT for years to come.
    Learn More
  • 5
    A drop-in framework for adding tagging (folksonomy) capabilities to existing applications
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    The Batino Browser is the next generation rich web browser platform. It is based on Eclipse technology.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 7
    Spidertron is a multithreaded web crawling API for web sites of moderate size (hundreds of thousands of pages) that allows you to focus not on the crawling but on processing of the information retreived.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    Frutch is a French-speaking working group on the Nutch Open Source search engine. It provides some extensions to Nutch.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    This application generates an index of a website using information stored in the pages' meta tags.
    Downloads: 0 This Week
    Last Update:
    See Project
  • All-in-one security tool helps you prevent ransomware and breaches. Icon
    All-in-one security tool helps you prevent ransomware and breaches.

    SIEM + Detection and Response for IT Teams

    Blumira’s detection and response platform enables faster resolution of threats to help you stop ransomware attacks and prevent data breaches. We surface real threats, providing meaningful findings so you know what to prioritize. With our 3-step rapid response, you can automatically block known threats, use our playbooks for easy remediation, or contact our security team for additional guidance. Our responsive security team helps with onboarding, triage and ongoing consultations to continuously help your organization improve your security coverage.
    Learn More
  • 10
    This project aims to create a free and open catalog over music that is popular to day including links to audio files and websites, created with our search engine, as well as statistics over genres and artist popularity. Relased under GNU/GPL.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    JLinkCheck is an Ant Task written in Java for checking links in websites. It is not just checking one single page, but crawling a whole site like a spider, generating a report in XML and (X)HTML. JReptator will be its succesor with many more features
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    The project Navigator aims at supporting automated gathering of dynamic information from third party web sites, using their web interface to post queries and to gather replies. Navigator is written in OS-independent java language.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    Toke is a webmining toolkit for web exploring, indexing and searching for Java. Toke allows to you crawl public or private web sites, in order to create web estatistics, web Pajek graphs, Lucene indexs and word frequency files for data clustering.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    Catalogo is a system for cataloguing resources on a web site. It allows semantic search of information on an intranet using metadata, RDF and ontology concepts. It provides a Catalog server (Java web applications) and a Catalog client (Firefox plug-in).
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    Roosster.org is a personal "on-demand" search engine. This means, it indexes only items/entries/files/URLs you explicitly tell it to index and provides a full-text-search over indexed items. Goto http://roosster.org/dev for all details.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    Thenali is a content management system software project aimed to support the publication and maintenance of educational counselling and career counselling information website.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    Searchy is a distributed metainformation search engine whose main goal is to federate search systems and integrate information. It uses RDF as abstract information model and may be used with Dublin Core, FOAF, vCard, etc
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    SmartCrawler is a java-based fully configurable, multi-threaded and extensible crawler, which is able to fetch and analyze the contents of a web site by using dinamically pluggable filters
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    iCalGrabber is a java based application to grab event information from web sites. The events are stored on the filesystem based on Apples iCal format. These .ics files could be read by iCal specific applications like the Mozilla calendar.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    Sperowider Website Archiving Suite is a set of Java applications, the primary purpose of which is to spider dynamic websites, and to create static distributable archives with a full text search index usable by an associated Java applet.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    JMdRdf is the tool which creates RDF/RSS. 1.You can generate RDF/RSS about your homepage from your HTML(s) without programming. JMdRdf extract Information such as title, description, etc automatically from HTML. 2.You can paste RDF/RSS into your HTML
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    myDbSearcher is a search engine for MySQL Databases. It is written in Java. It scans several tables on different databases. A XMLRPC-Server will give you access to the Index. Currently it runs on http://www.idowa.de/ueberblick/suche/index_html
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    Robust featureful multi-threaded CLI web spider using apache commons httpclient v3.0 written in java. ASpider downloads any files matching your given mime-types from a website. Tries to reg.exp. match emails by default, logging all results using log4j.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    Lude is an XML-RPC Lucene Daemon written in Java. Clients in any environment can create indexes, add/update/delete documents, and query the index through a simple XML-RPC API.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    HouseSpider is a Java applet that adds search capability to your web site. It can search by two methods, by spidering through your site or by searching a cached index file. It has 100% i18n (internationalization) support.
    Downloads: 0 This Week
    Last Update:
    See Project