Adaptive, Lossless Compression of 2-D Astronomical Images

We describe a lossless compression program developed for incorporation into the data acquisition system of a 16 CCD drift scan survey. The program is based on switched linear prediction with Rice coding of the high order bits of the mapped prediction residuals. Several notable features of the program are its speed ( $\approx 5$ MB/sec on a 300 MHz Pentium II), adaptivity to a variety of image types without any user-tuned parameters, and suitability for online work (it is a one-pass algorithm with minimal buffering requirements and a steady output stream).

The compression ratios and run times of the program are compared to other well-known compression programs using a variety of test images. A stand-alone version of the encode program is available in C from ftp://www.astro.yale.edu/pub/sabbey/encode.tar.gz.

1. Introduction

Although numerous programs are available for the lossy compression of astronomical images (e.g., White & Percival 1994; Starck et al. 1996), much less work has been published on lossless compression, in particular for online applications (although see Veran & Wright 1994). As a result, in producing the online software for the QUEST (QUasar Equatorial Survey Team) 16 CCD drift scan survey (Sabbey, Coppi, & Oemler 1998), we designed a custom, lossless compression program. The real-time compression provided has proved crucial for reliably acquiring and storing the nightly 30 GB of drift scan image data with the existing hardware system.

The program was required to be fast (i.e., compress at a rate comparable to the disk I/O), work in one-pass with minimal buffering, be easy to implement and incorporate into a complex system, allow streaming of the data to disk or tape, adapt to a variety of image types, and provide compression ratios comparable to more sophisticated offline programs without requiring any user-tuned parameters.

To accomplish this, we started with the well-known technique (Rice 1991) of Rice coding (Rice & Plaunt 1971) the difference between each pixel value and its linear prediction (a linear combination of the already-transmitted, neighboring pixel values). More specifically, the prediction residuals are partitioned into the N low order ``noise'' bits, which cannot be compressed, and the remaining (16-N) high order bits, which form a sharply peaked (Laplacian) distribution, and can be efficiently transmitted as Rice codes. Rice coding is a form of entropy coding (in which more frequently occuring values are assigned shorter codewords) that is less optimal but faster than Huffman coding.

To extend this technique of Rice coding the most significant bits of the differential signal, we investigated the selection of the linear predictor, the calculation of the partition size N, and how to make both adaptive at each pixel of the image in a fast and automated way without transmitting overhead information. The adaptivity is crucial for obtaining consistently high compression ratios and eliminating parameters that require user tuning. Some examples of image variation that the program should adapt to are changes in the background level, the noise level, the object surface density and shape, the image type (e.g., photometry or slitless spectroscopy), and the existence of bad CCD columns and other defects.

To adaptively select the predictor we modified Graham's rule (see Netravali & Lamb 1980, and Fig. 1) for use with astronomical images. In relatively flat image regions, the mean of several neighboring pixels will be selected as the predictor; otherwise the direction and order of a predictor will be selected based on the local correlation. For example, pixel values in a bad CCD column will be predicted using the adjacent previous row value, while a higher order predictor in the dispersion direction will be used to predict bright spectrum pixels in a slitless spectroscopy image.

**Figure 1:** A prediction, $\hat{{\rm x}}$ , is formed for the current pixel, ${\rm x}$ , using one of several possible linear combinations of the neighboring pixels (a, b, c, and d). A modification of Graham's rule is used to select one of the linear combinations based on a comparison of the local gradient approximations ( $\delta _{x} = \vert c-b\vert$ and $\delta _{y} = \vert a-b\vert$ ) to a noise threshold $\epsilon$ .
$\begin{figure} \epsscale{0.7} \plotone{sabbeycn1_1.eps} \end{figure}$

An alternative approach to making the prediction adaptive is to subdivide the image into blocks (e.g., 8 x 8). Then for each block all the predictors can be tested and the most effective one used. However, for our image set the optimal block size and shape varied among the images and within a given image. Also, the blocks were not aligned with the image features, and there was a conflict between using small blocks to allow rapid adaptivity but needing large blocks to limit the overhead of indicating the predictor used. Due to these and other obstacles (e.g., the increased computation required), we resorted to the modified Graham's rule described above.

The partition size (number of ``noise'' bits N = N_s + N_r) is adaptively selected at each pixel from the sum of a slowly varying term, N_s, which accounts for the background noise level, and a rapidly varying term, N_r, which accounts for structure remaining in the prediction residual image. N_s for a given image block is obtained using the number of 1-bit Rice codes transmitted in the previous image block (N_s is adjusted to keep the probability of Rice codeword 0 near optimal). N_r is obtained at each pixel from a table lookup using the prediction residual from the previous pixel as an index.

**Figure 2:** The performance of **encode** is evaluated by comparison to the lossless output of three well-known compression programs. The **encode** program often provides significantly improved compression ratios (the average percent difference is 22% improvement).
$\begin{figure} \epsscale{0.7} \plotone{sabbeycn1_2.eps} \end{figure}$

**Figure 3:** The speed of **encode** is evaluated by comparison to three well-known compression programs (using a 300 MHz Pentium II). Only the large ( $\approx 8$ MB) images were used to avoid measuring times $\ll 1$ s and to use a consistent image size. The **encode** program is found to be an order of magnitude faster in many cases.
$\begin{figure} \epsscale{0.7} \plotone{sabbeycn1_3.eps} \end{figure}$

$\begin{deluxetable}{lllll} \scriptsize\tablecolumns{5} \tablecaption{25 test ima... ...s$497 &Fornax dwarf in B, crowded star field & 4.8\\ \enddata \end{deluxetable}$
Of the 25 test images used to evaluate the encode program, 13 comprise the ``standard test images'' suite used at the Two-Dimensional Photometry Systems and Test Images Workshop (ESO/ECF 1989). The remaining 12 images include simulated and observed photometry and spectroscopy image data with varying noise levels, crowding, etc.

The compression ratios and run times of the encode program were evaluated by comparison to three well-known compression programs: hcompress, fitspress, and compfits+ compact. The test images used are listed in Table 1, and the results are presented in Figs. 2 and 3. Although the three comparison programs have important uses beyond lossless compression (e.g., hcompress and fitspress are often used for lossy compression), they nonetheless represent the state of the art and provide a relevant standard. Further details of the encode algorithm can be found in Sabbey et al. (1998).

References

Rice, R. F. 1991, Some Practical Universal Noiseless Coding Techniques, Part III, Module PSI14,K+ (JPL Publ. 91-3), (Pasadena: JPL)

Veran, J. P. & Wright J. R. 1994, in ASP Conf. Ser., Vol. 61, Astronomical Data Analysis Software and Systems III, ed. D. R. Crabtree, R. J. Hanisch, & J. Barnes (San Francisco: ASP), 519

White, R. L. & Percival, J. W. 1994, in SPIE Proc., Vol. 2199, Advanced Technology Optical Telescopes V, ed. L. M. Stepp (Bellingham: SPIE), 703

Adaptive, Lossless Compression of 2-D Astronomical Images

Abstract:

1. Introduction

References