jWords is a port of WORDS (by William Whitaker, a free latin-to-english dictionary program written in Ada), to Java. Besides the dictionary will be translated to the German language.
The Digital Communication Systems Lib: This library contains digital communucations systems implementations, such as BSPK and MIMO - WOFDM. Various OpenSource Projects: 1. Password/Username/Domain Analyzer
A program for cropping GPX track files. The GUI plots the GPX track (latitude vs longitude) and lets the user remove GPS points from the beginning and end of the track respectively. There is also a built-in Command Line Interface.
Note: No development anymore since I found out that GpsPrune does all the things I was looking for.
PileLine (Pileup pipeLine) is a toolkit for efficient handling of genomic position (GP) files, produced by next-generation sequencing experiments. It is designed to be memory efficient by performing on-disk operations over sorted GP files.
Deploy in 115+ regions with the modern database for every enterprise.
MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
The program creates OWL ontology files that describe relationships between entities. Basis are definitions found by searching Wikipedia articles for specific lexico-syntactic patterns.
jLlama is a desktop application to monitor servers over SSH. Any figure retrieved from the command line can be polled and graphed in real time. Out of the box, jLlama can graph CPU and Memory usage for Linux and Solaris servers.
ELIA(Eyegaze Language Integration Analysis) supports the analysis of eye-tracking data for studies in language processing. ELIA eases early analysis of data to enable iterative development of experiments in response to spoken language.
Soaplab is a generator of Web Services providing a programmatic access to command-line (and other) applications on remote computers (an example of its usage is available at http://www.ebi.ac.uk/soaplab/)
AI-powered service management for IT and enterprise teams
Enterprise-grade ITSM, for every business
Give your IT, operations, and business teams the ability to deliver exceptional services—without the complexity. Maximize operational efficiency with refreshingly simple, AI-powered Freshservice.
A set of software utilities for processing and analyzing 16S rRNA genes including generating NAST alignments, chimera checking, and assembling paired 16S rRNA reads according to reference sequence homology.
Cca-forum unifies the Common Component Architecture tools and tutorial. It includes the CCA specifications, the Ccaffeine framework for HPC, and related tools. These support multilanguage scientific and parallel computing.
CORPSE (CORPus SEarch) is a powerful search engine written in Java. The aim is to provide an efficient implementation of a word level inverted index search with various cool functions that can be used on very large corpora.
Java program to create a (potentially multilingual) glossary of the unique words in any given Lojban text.
Note that the Sourceforge page for this was superceded by the Bitbucket repository: https://bitbucket.org/pretoriusjf/vlastezba/overview
Any further updates will be made there.
This is a multi-threaded Java program that tries to maximise CPU utilization by separating and distributing the computational part of the work from the file reading and writing part by using separate threads.
MediaWikiRevisionsExtractor extracts the history of a particular wiki page, computes the modifications made between each revisions and finally, stores the whole set of modifications in a file.
Parsers for biological data based on scanner generators like Flex (C), Re2c(C), Jflex (Java) and Ifickle (Tcl). This scanner generators are providing easier maintainance, development and higher speed than hand written scanners. Scanner output is SQL.
PhyloSort sorts phylogenetic trees by searching for user-specified subtrees that contain a monophyletic group of interest defined by operational taxonomic units.
HATS (Haplotype Amplification in Tumor Sequences) is a tool that calls the amplified alleles, and thus amplified haplotype, in copy number aberration regions in next generation sequencing tumor data. The amplified haplotype may reveal gene variants.
Enrich and query corpora in the TEI-XML vocabulary. CorpusReader manage very large corpora and corpora containing milestone annotation. It provides tools for enriching corpora with output of linguistic parsers, and for extracting quantitative information
A program to calculate and create lists of possible variations (permutations) and selections (combinations) of a given set of elements. Creating the cartesian product of two sets is also possible.
Cougar Squared is a new Java library for machine learning and data mining research, supporting research needs of the community. It is written by researchers for researchers. It extends the WEKA and YALE machine learning frameworks.