Re-Pair [5] is an effective grammar-based compression scheme achieving strong compression rates in practice. Let n, σ, and d be the text length, alphabet size, and dictionary size of the final grammar, respectively. In their original paper, the authors show how to compute the Re-Pair grammar in expected linear time and 5n + 4σ2 + 4d + √n words of working space on top of the text. In this work, we propose two algorithms improving on the space of their original solution. Our model assumes a memory word of [log2n] bits and a re-writable input text composed by n such words. Our first algorithm runs in expected O(n/ϵ) time and uses (1 + ϵ)n + √n words of space on top of the text for any parameter 0

Bille, P.; Gortz, I. L.; Prezza, Nicola. (2017). Space-Efficient Re-Pair Compression. In Data Compression Conference Proceedings (pp. 171- 180). Isbn: 978-1-5090-6721-3. Doi: 10.1109/DCC.2017.24.

Space-Efficient Re-Pair Compression

Prezza N.
2017

Abstract

Re-Pair [5] is an effective grammar-based compression scheme achieving strong compression rates in practice. Let n, σ, and d be the text length, alphabet size, and dictionary size of the final grammar, respectively. In their original paper, the authors show how to compute the Re-Pair grammar in expected linear time and 5n + 4σ2 + 4d + √n words of working space on top of the text. In this work, we propose two algorithms improving on the space of their original solution. Our model assumes a memory word of [log2n] bits and a re-writable input text composed by n such words. Our first algorithm runs in expected O(n/ϵ) time and uses (1 + ϵ)n + √n words of space on top of the text for any parameter 0
2017
978-1-5090-6721-3
compression; grammar; Re-Pair
Bille, P.; Gortz, I. L.; Prezza, Nicola. (2017). Space-Efficient Re-Pair Compression. In Data Compression Conference Proceedings (pp. 171- 180). Isbn: 978-1-5090-6721-3. Doi: 10.1109/DCC.2017.24.
File in questo prodotto:
File Dimensione Formato  
repair.pdf

Open Access

Tipologia: Documento in Pre-print
Licenza: DRM (Digital rights management) non definiti
Dimensione 346.71 kB
Formato Adobe PDF
346.71 kB Adobe PDF Visualizza/Apri
Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11385/194115
Citazioni
  • Scopus 9
  • ???jsp.display-item.citation.isi??? 14
  • OpenAlex ND
social impact