Category
Sequence Analysis
Usage
plot_len.pl input.clstr 1,2-4,5-9,10-19,20-49,50-99,100-299,500-99999 10-59,60-149,150-499,500-1999,2000-999999
Manual
where
2nd line are sizes of cluster
3rd line are lengths of sequences
It will print distribution of clusters and sequences :
Size # seq #clstr 10-59 60-149 150-499 500-1999 2000-up
1 266312 266312 36066 103737 103285 22727 497
2-4 208667 81131 1229 14680 44607 20006 609
5-9 156558 24198 118 2148 12026 9388 518
10-19 155387 11681 30 596 5024 5462 569
20-49 176815 6007 6 139 2212 3135 515
50-99 106955 1568 0 24 410 955 179
100-499 154209 896 0 3 124 597 172
500-up 43193 40 0 0 1 14 25
Total 1268096 391833 37449 121327 167689 62284 3084
Share your experience or ask a question