Recursive version of the Re-Pair algorithm for large repetitive collections.
git clone https://github.com/apachecom/rrepair.git
./config.sh
usage is <filename> <ws> <mod> <max-iter> <th-len-seq>
where:
- filename: file to compress
- ws: windows size used by the parser
- mod: module value used by the parser
- max-iter: maximum iterations in the recursive partition
- th-len-seq: the threshold for the length of the initial sequence of the parser