-
Notifications
You must be signed in to change notification settings - Fork 24
Description
Strobealign sometimes produces alignments where only a couple of bases are aligned and the rest is soft clipped. For example, running the tests/compare-baseline.sh script with option -s (single ends) currently produces the file baseline/bam/acc4cffe5ac2c4db266c58d00b7b6462c6b4189c.se.bam, where read SRR6055476.83000 is mapped with CIGAR 2M149S and mapping quality 60, which doesn’t make sense. This particular case seems to be caused by a false positive hit.
It would be better to mark such extremely short hits as unmapped. The question is which minimum number of aligned bases we require. It should definitely be at least
In this dataset, there are 40 alignments shorter than