216,000+ customers in over 135 countries grow their businesses with HubSpot
HubSpot is an AI-powered customer platform with all the software, integrations, and resources you need to connect your marketing, sales, and customer service. HubSpot's connected platform enables you to grow your business faster by focusing on what matters most: your customers.
Software and support solutions to protect your clients’ critical business assets
ConnectWise SIEM (formerly Perch) offers threat detection and response backed by an in-house Security Operations Center (SOC). Defend against business email compromise, account takeovers, and see beyond your network traffic. Our team of threat analysts does all the tedium for you, eliminating the noise and sending only identified and verified treats to action on. Built with multi-tenancy, ConnectWise SIEM helps you keep clients safe with the best threat intel on the market.
wordseg project is a word segment module implemented by C#
wordseg project is a word segment module implemented by C#. It is used to segment text into tokens and to label token's attribute according its context and semantic by front-maximum matching and CRF algorithms.
The following are some sentences need to be segmented:
张晓晨和付仲恺一起坐在家(西坝河东里社区)里的沙发上看非诚勿扰。
百度公司的名字源于“众里寻他千百度”这诗句。
After above sentences be segmented by wordseg, the result as follows for each sentence:
张晓晨[PER] 和 付仲恺[PER] 一起 坐 在 家 ( 西坝河东里社区[LOC] ) 里 的 沙发[PDT] 上 看 非 诚 勿扰 。
百度公司...
Integra desktop search engine - cross-platform, portable, embeddable, extensible search engine and search tools (both console and GUI) for developers and end-users. Full unicode support, easy UI localisation, plugins for document parsers and unpackers.
"Swish-e is a fast, flexible, and free open source system for indexing collections of Web pages or other files" (http://swish-e.org/ ) This module provides a Python API for this software.
MRS is a tool to quickly and easily store and index large flat file databanks and in a space efficient manner. It is currently used to index huge bioinformatics databanks but it is not limited to this area.
This project aims to build a suite of Natural Language Processing tools. Modules will include corpus indexing and access tools, a part-of-speech tagger, tokenisers, text classification software, etc.
HXPath is a command line tool useful to extract data from HTML documents. HXPath can select sub trees, like the standard xpath tool, but is also able to read contents and attributes and output them in a bash friendly format. HTML Tidy and HTTP/HTTPS get are built in too.
This program aims search an index of the Computer science forum http://tud.hicknhack.org/forum/ , which is very useful and at the same time distracting for many students at the Dresden University of Technology.
High performance distributed in-memory key/value store
Infinispan is an open source, Java based data grid platform. ***IMPORTANT*** Starting with Infinispan 5.0.0.FINAL, Infinispan releases are no longer hosted in Sourceforge. They can now be located in www.jboss.org/infinispan/downloads
RIG is a web-based JPEG image album viewer, especially useful for digital camera albums; provides automatic image resizing, preview & thumbnail caching, user authentication; composed of a PHP web interface and a C++ thumbnail engine.
Lurker is a mailing list archiver designed for capacity, speed, simplicity, and configurability in that order.
Noteworthy features include: google-style searching on all fields, chronology preserving threads, multilingual, and attachment support.
A function-testing, performance-measuring, site-mirroring, web spider that is widely portable and capable of using scenarios to process a wide range of web transactions, including ssl and forms.
The censorship tools are a collection of bash scripts for a) comparing DNS server answers to get the blacklist from the censored server, b) downloading censored URLs and c) other stuff like open all censored pages of a blacklist with a browser.
ht://Check is more than a link checker. It's particularly suitable for checking broken links, anchors and web accessibility barriers, but retrieved data can also be used for Web structure mining. Uses a MySQL backend. Derived from ht://Dig.
Voxound Extension is a daemon application used to provide additional local content access and management functionality to the voxound.com web application.
OpenFTS (Open Source Full Text Search engine) is an advanced PostgreSQL-based search engine
that provides online indexing of data and relevance ranking for
database searching. Close integration with database allows use of
metadata to restrict search re
Sgrep (sorted grep) is a much faster alternative to traditional Unix grep when searching large files, because sgrep searches sorted input files using a fast binary search to find matching lines.
A threaded C application that searches torrent trackers/indexers for .torrent files and sorts the results according to user defined criteria. Uses glib2.0 and libcurl4
A utility to extract meta-information (properties/comments) out of various file-types; e.g. HTML, PDF, RTF & various Office documents; OGG/MP3 files and JPEG/PNG/GIF images, which can be presented in various output formats (HTML, XML, LaTeX & plain t