Improving PPM with Dynamic Parameter Updates

Steinruecken, C; Ghahramani, Z; MacKay, D

Improving PPM with Dynamic Parameter Updates

Repository URI

https://www.repository.cam.ac.uk/handle/1810/254106

Files

Steinruecken 2015 Data Compression Conference 2015.pdf (198.71 KB)

Type

Article

Authors

Steinruecken, C

Ghahramani, Zoubin

https://orcid.org/0000-0002-7464-6475

MacKay, D

Abstract

This article makes several improvements to the classic PPM algorithm, resulting in a new algorithm with superior compression effectiveness on human text. The key differences of our algorithm to classic PPM are that (A) rather than the original escape mechanism, we use a generalised blending method with explicit hyper-parameters that control the way symbol counts are combined to form predictions; (B) different hyper-parameters are used for classes of different contexts; and (C) these hyper-parameters are updated dynamically using gradient information. The resulting algorithm (PPM-DP) compresses human text better than all currently published variants of PPM, CTW, DMC, LZ, CSE and BWT, with runtime only slightly slower than classic PPM.

Description

This is the author accepted manuscript. The final version is available from IEEE via http://dx.doi.org/10.1109/DCC.2015.77

Keywords

46 Information and Computing Sciences, 4602 Artificial Intelligence

Journal Title

Data Compression Conference Proceedings

Conference Name

2015 Data Compression Conference (DCC)

Journal ISSN

1068-0314

Publisher

IEEE

Publisher DOI

https://doi.org/10.1109/DCC.2015.77

Rights

Collections

Scholarly Works - Engineering
Symplectic mapped items for data match