Here you can download the executable file of the software SearchRepeats. SearchRepeats searches for exact non overlapping repeats in nucleotidic (DNA) sequences and outputs these repeats in a text report. For more description you can refer to our publication.
Fast Discerning Repeats in DNA Sequences with a
Compression Algorithm
Rivals, M. Dauchet, J-P. Delahaye, O. Delgrange
Extended abstract in the 8th Workshop on Genome and
Informatics (GIW97)
Tokyo, 12-13 Dec 1997
usage : SearchRepeats <filename> [Min Factor Length]
Parameters:
First, one finds some general informations about the number of
factors, zones, etc. Then comes a list that describes each zone on
one line. An factor occurrence can be referenced several time: this
defines the TYPE of the zone. The first time, it is a TYPE_2
zone, while for further references to an already referenced factor
zones are of TYPE_N. The table below gives the meaning of the
other columns.
Example:
Nb_large_factors_in_seq 31231 Nb_encoded_zones 210 Nb_encoded_factors 209 Gain_evaluation_bits 55776 Encoded_char 33365 Code_length_evaluation 123537 1 138920 TYPE_2 82 90953 2 139002 TYPE_2 101 91138 3 139123 TYPE_2 132 91259 4 151734 TYPE_2 81 151634 ...
TOP WHAT PUBLICATION HOW OUTPUT