NGSReadsTreatment, a computational tool for the removal of duplicated reads in paired-end or single-end datasets. NGSReadsTreatment can handle reads from any platform with the same or different sequence lengths. Using the probabilistic structure Cuckoo Filter, the redundant reads are identified and removed by comparing the reads with themselves. Thus, no prerequisite is required beyond the set of reads. NGSReadsTreatment was compared with other redundancy removal tools in analyzing different sets of reads. The results demonstrated that NGSReadsTreatment was better than the other tools in both the amount of redundancies removed and the use of computational memory for all analyses performed.
To execute the multithreaded version adding at the end of command line the number of threads that you will use for processing, it is worth mentioning that this number must be a power of 2.
Note: To run the new version use Java version 13.
Features
- java -jar NgsReadsTreatment_v1.3.jar SRR3615375_1.fastq SRR3615375_2.fastq 4 Note: The last number (4) indicates the number of threads.