Hi, may I know what is the default read coverage cut-off parameter for SV calling? And if I am able to adjust it? Thanks Tham
Hi Sitan, When calculating breakpoints supporting reads, we allow a [-1, +1] shift around the junction to alleviate alignment ambiguity. This may overestimate the split reads around breakpoints. But estimation of SV frequency based on split reads may underestimate it as well. To be comparable, I suggest you use (column 19 + column 29)/(column 16 + column 26). Col 19 and Col 29 are high-quality split read numbers. These fields (column 11 to column 39) are: cluster_id, contig_id, contig_size, reads_used_for_assembly,...
Can be found at https://github.com/czc/nb_distribution
Hello Zechen. I was trying to run novoBreak. but there were some error. I using AWS because i dont have high-perfomance computer. NovoBreak is in /home/ec2-user/novoBreak_distribution_v1.1.3. And i export the PATH too, but it didnt work well. export $PATH -bash: export: `/usr/local/bin:/bin:/usr/bin:/usr/local/sbin:/usr/sbin:/sbin:/opt/aws/bin:/home/ec2-user/.local/bin:/home/ec2-user/bin:/home/ec2-user/novoBreak_distribution_v1.1.3': not a valid identifier And i run by command 'sh run_novoBreak.sh...
you should export the path to the novoBreak_distribution directory. The command should be "export PATH=$PATH:/data/tmp/huebnerj/novoBreak_distribution_v1.1.3rc".
If there is no normal control, a simulated normal bam file (no coverage need) can be used to meet the interfaces of novoBreak. But the results would be both somatic and germline events. You can use wgsim to simulate raw reads from reference and align them using bwa to generate a bam file as "normal".
The source code of novoBreak is on the SourceForge already. Simply use "git clone https://git.code.sf.net/p/novobreak/git novobreak-git && cd novobreak-git && make" and copy the binary of novoBreak to replace the "novoBreak" in novoBreak_distribution_v1.1.3rc should be fine. If the software in the novoBreak_distribution_v1.1.3rc directory cannot use due to Linux environment (such as samtools), simply download the source and compile, install and copy to the novoBreak_distribution_v1.1.3rc directory...
so I am also unable to run this on our main server Linux any help will be very welcome! bash novoBreak_distribution_v1.1.3rc/run_novoBreak.sh novoBreak_distribution_v1.1.3rc/novoBreak hg19.fasta bamfiles/P38_14.MB2.sorted.dup_removed.q40.bam bamfiles/P38_14CER.sorted.dup_removed.q40.bam 2 -PWD readlink: invalid option -- 'P' Try 'readlink --help' for more information. mkdir: missing operand Try 'mkdir --help' for more information. novoBreak_distribution_v1.1.3rc/run_novoBreak.sh: line 26: /home/rmgtcpr/novoBreak_distribution_v1.1.3rc/novoBreak/novoBreak:...
So i see that readlink is a problem, and i did brew install coreutils so i can use greadlink, and I edited the file changing readlink to greadlink everywhere i also did brew install perl in case that was the problems now I get someting to do with symlinks which someone commented on, probably beyond me to solve... greadlink: invalid option -- 'P' Try 'greadlink --help' for more information. usage: mkdir [-pv] [-m mode] directory ... /Volumes/SD/novoBreak_distribution_v1.1.3rc/run_novoBreak.sh: line...
Hello I would really like to use this, but 1) I am trying on a single core i7 mac with 16 G of RAM. Is there any chance it will work, even it takes days, or should I give up? 2) it doesnt wan to start- am i doing this the right way? bash /Volumes/SD/novoBreak_distribution_v1.1.3rc/run_novoBreak.sh /SD/novoBreak_distribution_v1.1.3rc /Volumes/SD/bwa0712/hg19.fasta /Volumes/SD/MPS_work/P38_14.MB2.sorted.dup_removed.q40.bam /Volumes/Toshiba4Tb/WGSdata/mayo_MPS/P38_14.MB2.sorted.dup_removed.q40.chr4.sorted.bam...
Hi Zechen, I'm attempting to use your tool novoBreak by running 'run_novoBreak.sh',...
Hi Zechen, I'm attempting to use your tool novoBreak by running 'run_novoBreak.sh',...
Hi Wenkai, You may try the inherent second filter filter_sv2.pl. Or a tip you may...
In the Nature Methods paper novoBreak is used to call indels on the DREAM datasets....
Hi, I'd like to report and share some information which is not described in README...
new file: asm_R2.c
usage update
On branch testbitvec
On branch feature-bam
On branch feature-bam
change samtools to v.0.1.19
print out read pair ids
fixed a bug when encap bitvec
fixed a read length < kmer size bug
v1.05
On branch local_assembly
On branch loadid
fixed bitvec init size type bug
On branch feature-bam
On branch feature-bam
On branch testbitvec
modified: filter_kmer.c
report somatic and germline kmer reads now
modified: Makefile
On branch filterkmer
hash reference first then process reads
On branch feature-bam
On branch feature-bam
fixed a bam header bug
Initial commit
modified: Makefile
On branch filterkmer
using bitvec to represent kmers in reads
fixed warnings of selfassignment and one error ...
modified: filter_kmer.c
fixed a bug of freeing chash values
modified: file_reader.c
sequences are loaded, then only id loaded
added new files
nothing to do