A Coordinate-based Fuzzy Encoding Strategy for Compressing Grayscale Images

doi:10.21203/rs.3.rs-1219938/v1

Download PDF

Research Article

A Coordinate-based Fuzzy Encoding Strategy for Compressing Grayscale Images

https://doi.org/10.21203/rs.3.rs-1219938/v1

This work is licensed under a CC BY 4.0 License

Journal Publication

published 19 Aug, 2023

Read the published version in Soft Computing →

You are reading this latest preprint version

Image compression techniques realized in various ways have become an indispensable part in the practical storage and transmission of digital images. In this study, we present a novel method of lossy compression based on sampling and fuzzy encoding for grayscale images and discuss the problem of their reconstruction. First, an image is divided into a number of non-overlapping blocks of pixels. Next, we perform multiple rounds of random sampling. In each round, a number of pixels are selected as prototypes for the representing the corresponding block. Each pixel in the block is reconstructed based on the gray-levels of the prototypes and membership degrees computed with respect to the distances of each pixel to the prototypes. The reconstruction abilities delivered by the prototypes are quantified by a certain objective fidelity criterion and the prototypes leading to lowest reconstruction error are determined as representatives of current block. Finally, once the representatives in each block have been determined, we reconstruct the whole image based on these prototypes. Experimental studies as well as visual evaluations show that the proposed algorithm is able to achieve high compression ratios while preserving the overall fidelity in the decompressed images.

Image compression

grayscale image

fuzzy encoding

coordinated-based

Digital images play an import role in multimedia and information intercommunication by carrying a wealth of information. Images may represent objects in the form of picture-like bitmaps, vector-based drawings or 3D rendering. With the rapid development of information technology, a large amount of digital image data is generated at high pace. Although in recent years we have witnessed the increase of bandwidth of the communication networks and the decrease of the price of storage devices, digital images produced by various technologies and applications have established higher demands with regard to the bandwidth and storage capacity. In order to overcome the existing limitations on the bandwidth of communication networks and storage capacity, the problem of digital image compression has been extensively researched. We have been witnessing a highly diversified plethora of approaches to image compressing. Image compression is a data compression technology which aims at reducing the cost of transmission and storage. Generally, there are two categories of compression algorithms: lossy compression and lossless compression [5]. Compared with lossless compression, lossy compression may lead to loss of fidelity, but it is especially useful when being used at low bit rates while loss of fidelity is acceptable in order to fulfill the bandwidth of transmission. Lossy compression algorithms usually tend to achieve a trade-off between the bit rate of the transmission system and the quality of the images being processed. Some lossy compression techniques may lead to an extremely high compression ratio.

Existing image compression techniques usually make use of visual perception or exploit statistical properties of image data to reach the aim of compressing data. The motivation behind the proposed algorithm presented in this study is to use a certain number of neighboring prototypes to represent one pixel using fuzzy set-based encoding, given the fact that significant correlations or redundancy usually exist among consecutive or neighboring pixels. We present a novel image compression algorithm which makes use of the sampled pixels (prototypes) and membership degrees of each pixel to a group of prototypes with respect to the gray levels to reconstruct image. The membership values used for image reconstruction are implied by the distance of the pixels to be reconstructed to the selected prototypes. Given a digital image of fixed size, it could be represented by a two-dimensional plane where x and y are spatial coordinates, and any pair of coordinates (x, y) represents a specific pixel. When fuzzy encoding / decoding is involved, the membership degrees of the gray level of this pixel to the brightness of the prototypes become necessary. But since each pixel has its unique coordinate, the distances between different pixels are determined and could be calculated on the fly. Once a group of prototypes has been determined and membership degrees computed with respect to coordinates could approximate the membership grades between values of gray levels, the requirement for storage space could be reduced significantly, as we only need to store the index (coordinates) and values of gray-levels of the prototypes after compression is completed. Given the coordinates of the prototypes and their gray levels, it is enough to reconstruct the whole image, which means that we are able to achieve a very high compression ratio.

It is worth noting that in current image compression techniques (no matter which compression algorithm has been adopted), the inner relationship between the gray levels of pixels and their coordinates has not been fully exploited. The method proposed in this study has not been explored in the existing literatures. The developed approach exhibits a tangible level of originality in the sense that a novel method which fully utilize the information implicit in the coordinates of each pixel to reconstruct the image. An overview of the overall framework highlighting the essence of the method is visualized in Fig. 1. The completion of the compression process gives rise to a collection of prototypes whose coordinates and gray levels are stored for further decompression. In the decompression process, the reconstruction of the original image is based on these prototypes and the membership degrees of each pixel to these prototypes implied in their coordinates. In this study, we are interested in the compression of grayscale images. Grayscale images are usually stored in the form of two-dimensional arrays of gray-level values. The only colors of a grayscale image are the shades of gray and each pixel in the image is a scalar proportional to the intensity of gray level.

This study is organized as follows. In Section II, we offer a brief introduction to the existing image compression algorithms with some focus on the techniques involving fuzzy sets. In Section III, we elaborate on the detailed image compression algorithm. Performance indexes used to evaluate the quality of reconstructed images are discussed in Section IV. A series of experiments and ensuing parametric analysis are reported in Section V while conclusions are offered in Section VI.

Image compression technologies have been utilized in many diverse applications, such as remote sensing, medical imaging, digital television, and video surveillance systems. The key to image compression technology is to remove redundant data from the original image, and later the compressed image is decompressed to exactly rebuild the original image or to reconstruct an approximation one. There are several different types of data redundancies, for example, spatial redundancy, temporal redundancy, coding redundancy, psychovisual redundancy and knowledge redundancy. General compression systems try to remove those redundancies to meet the objective of compressing images.

Generally, image compression methods could be divided into two categories: lossless methods and lossy methods, depending on whether the compressed data supports error-free data reconstruction. A general compression system is usually constructed in the encoder-decoder architecture. The encoder removes redundancy information from the original image and enhances the immunity to noise while the decoder recovers/reconstructs the original image as faithfully as possible. Lossless compression is a necessary choice when degradation of original image is not acceptable. For example, in medical fields, the details of medical images are very important to diagnose illness and choose a right therapeutic scheme [1]. Other areas in which lossless compression are extensively adopted includes compression of thermal images taken by satellites [2], remote sensing images [3], and fingerprints images [4] etc. Many powerful lossless image compression algorithms have been developed. It is needless to say that there is no ideal lossless compression tool, which could efficiently compress all possible images [5] [6] [30][33]. For this reason, many different compression algorithms have been designed with specific assumptions about what kinds of redundancy the original images are likely to contain. Some of the most commonly used lossless compression algorithms include Huffman coding [7], arithmetic coding [8] and Run Length Coding [9]. In recent years, we also have witnessed a lot of such studies reported in the literature [11] [12] [13][27]. The main drawback of lossless image compression algorithms is that they cannot guarantee the compressed image being smaller in size than the original one.

Compared with lossless compression algorithms, lossy image compression algorithms can reach higher compression rate at the expense of losing some information of the original images. While the degradation of the uncompressed image could be quantified through certain performance indexes, such as Mean Square Error (MSE) or Peak-Signal-To-Noise ratio (PSNR), the efficiency of the compression algorithm could also be assessed in terms of data compression ability and implementation complexity of the algorithm. Lossy compression is widely used, albeit imperfect, especially when the main goal is to reach a given compression rate to meet the bandwidth requirement or to overcome the restrictions on storage or to reduce encoding-decoding time. There are a number of approaches to lossy image compression, such as vector quantisation, predictive coding, and transform coding [5]. These algorithms are carried in the following aspects to achieve the goal of compressing image: i) quantification of the color space to reduce dimensionality of the problem, ii) implementing lower resolution for chroma information than for luma information when coding images, iii) fractal compression and iv) transform coding, which is also the most commonly used method. Fuzzy sets and Fuzzy relational equations also have been used in image compression/decompression [21] [22], which exploit a new field for the research of image processing. In these studies, images are represented as fuzzy relations since fuzzy sets are able to capture the essence of the problem from information granule point of view and come with a sound algorithmic framework. In the area of lossy image compression, we can refer to some recent studies presented in the literature [14–16] [25][29] [32]. In recent year, there also has been an interest in compressing images using neural networks, refer to some recent studies [19] [20] [18] [26] [31]. The performance of compression algorithms can be quantified in terms of the following criteria: image quality, compression ratio, and compression speed, which is affected by computational complexity and memory requirements.

In what follows, we elaborate on the underlying compression-decompression mechanism based on sampling and fuzzy encoding completed on a basis of coordinated-based membership degrees. This will help us stress a systematic and coherent development process being proposed in this study as well as highlight the novelty along with the proposed mechanism. The algorithm is run for some specific values of setting-namely, the value of sampling ratio, the number of prototypes for reconstructing one pixel and the fuzzification coefficient. The effect of these values on the quality of the reconstruction will be studied later.

A. Segmentation of image

Before an image could be processed, it needs to be digitalized, encoded and stored. Grayscale images could be defined as a two-dimensional function, f(x, y), where x and y are spatial coordinate in horizontal and vertical directions, respectively. The gray level (brightness) of the image at any point (x, y) is denoted by its amplitude f. A sample of the image at a certain point is usually referred to as a pixel. An image consists of a finite number of pixels, each of which has a particular location and a gray-level value. There are several different ways of coding the values of gray-level at each point depending on the number of bits to represent one pixel. More bits are used to represent one pixel, more colors could be represented, and more memory and wider bandwidth are demanded when being processed. 8-bit color image is the most commonly one, which allows up to 256 colors (gray levels). A continuous digital representation of image is transformed into digital format and the values of gray-level of each square could be arranged in a two dimensional matrix.

Suppose that the original image is represented by a matrix of pixels of size M×N, and the gray level of each pixel is stored as an 8-bit integer value which therefore allows for 256 different shades of brightness ranging from 0 to 255. We set value 0 to black, 255 to white and distribute the gray scale linearly in between. The first step is to partition an image into a number of non-overlapping blocks which are adjacent to each other. Each block is a rectangle with a fixed-size m×n where m<<M and n<<N, as shown in Fig. 2. Note that the sizes of the blocks on the right or the bottom of the image may be smaller than m×n. These blocks are treated in the same way as others for uniformity. The choice of suitable values of m and n requires some attention. If the size of a block is too large, we may face the problem of explosion of search space. On the other hand, undersized block may hinder the visual quality of reconstructed image, especially in the areas near the boundaries of the blocks.

B. Selection of prototypes

In each block, we randomly pick up a sample of pixels, say g pixels; we will refer to them as prototypes and call them v₁, v₂, …, v_g. The coordinates of these points are marked as (x₁, y₁), (x₂, y₂), …, (x_g, y_g) while the values of their gray levels are denoted as f(x₁, y₁), f (x₂, y₂), …, f (x_g, y_g), respectively. These sampling points are treated as “anchor” points based on which the block is reconstructed. The sampling ratio is described as the number of sampling points to the total number of pixels in this block and then forms the following expression p = g/(m×n). The sampling percentage p influences the quality of the reconstruction and compression ratio. As there is an inherent factor of randomness, the whole process is repeated a certain number of iterations and the prototypes producing the lowest reconstruction error are treated as representatives of current block. As to the sampling ratio, one option is to adopt the same sampling ratio in each sub-block; another option worth considering is to choose different values of sampling ratio for different blocks depending upon the level of details presented in the given block and this ratio could be determined based on the reconstruction criterion.

C. Determination of membership degrees

Let us assume that for each block of pixels, we are provided a collection of prototypes v₁, v₂, …, v_g, as shown in Fig. 3. For any pixel s located at coordinates (a, b) in this block that is not designated as prototype, we calculate its membership degrees to the c nearest prototypes among v₁, v₂, …, v_g, which are marked as v’₁, v’₂, …, v’_c. The corresponding coordinates of these c prototypes are marked as (x’₁, y’₁), (x’₂, y’₂), …, (x’_c, y’_c). The choice of parameter c will influence the visual quality of the reconstructed image. The values of membership degrees are determined in the same manner as for Fuzzy C-Means (FCM) clustering method [23][24][28]

$${u_i}(s)=\frac{1}{{\sum\limits_{{j=1}}^{c} {{{\left( {\frac{{||s - v{'_i}||}}{{||s - v{'_j}||}}} \right)}^{2/(m - 1)}}} }}$$

where u_i(s) stands for the value of membership degree of pixel s to prototype v’_i. Here ||.|| denotes a certain distance function (we use a Manhattan distance in this study, any other form of the distance could also be considered), namely, $||s - v{'_i}||={\text{|}}a - x{'_i}{\text{|}}+{\text{|}}b - y{'_i}{\text{|}}$. In any image, each pixel of an image can be identified by its x and y coordinates. Given a number of prototypes v₁, v₂, …, v_g and a specific pixel s (a, b), the vector of membership grades [u₁(s), u₂(s),…, u_c(s)] of this pixel to the c prototypes that are closest to it do not need to be stored since their values could be calculated on the fly. The values of membership degree in this vector depend upon a number of parameters, such as m and c. The values of m (fuzzification coefficient in the FCM method) influence membership values. The most commonly used value of m is equal to 2 while higher values that 2 make the resulting membership functions become “spiky” and lower values of fuzzification coefficient produce Boolean-like (binary) membership regions [10].

D. Reconstruction of image through fuzzy decoding

Having a collection of pixels which are considered as prototypes, we can formulate a reconstruction problem: determine the value of gray level of each pixel in the block given a group of prototypes v₁, v₂, …, v_g. We identify two categories of pixels: (a) Pixels that have been identified as prototypes. Since the values of their gray levels have been stored in advance, the reconstruction doesn’t require any specialized handing. (b) Pixels that are not selected as prototypes, denoted as s, whose coordinate is represented as (a, b) and the gray level of the reconstructed pixel is denoted as F(a, b). The reconstruction problem is solved in the same way as in the FCM thus the reconstruction result (value of gray level) is determined as:

$$F(a,b)=\frac{{\sum\limits_{{i=1}}^{c} {{u_i}{{(s)}^m}f(x{'_i},y{'_i})} }}{{\sum\limits_{{i=1}}^{c} {{u_{^{i}}}{{(s)}^m}} }}$$

where (x’_i, y’_i), i=1, 2, …, c, represents the coordinate of the prototype used for reconstructing s(a, b). The quality of the compression-decompression depends on a number of parameters and they can be optimized. In general, we could regard compression-decompression problem as an optimization of selection of prototypes guided by a minimization of some certain performance index which quantifies a departure of the values of gray level of the reconstructed pixels from their original ones, namely $(F({x_s},{y_s}) - f({x_s},{y_s}))$ where s denotes the pixel in current block. The performance index of the mean squared error (MSE) considered here comes in the following form.

$${\text{MS}}{{\text{E}}_{{\text{block}}}}{\text{=}}\frac{1}{{m \times n}}{\sum\limits_{{s \in current{\text{ }}block}} {(F({x_s},{y_s}) - f({x_s},{y_s}))} ^2}$$

E. Reconstruction of the overall image

One the prototypes for each block have been determined, they form a backbone of the basis for the reconstruction of the overall image. The reconstruction the overall image is processed block by block. After the prototypes for each sub-block have been determined, for the pixels in each block that have not been identified as prototypes, the decompression (reconstruction) process consists of three steps: (a) determination of the c nearest prototypes v’₁, v’₂, …, v’_c from the prototypes that represent current block according to the distances of this pixel to the prototypes, (b) calculation of membership degrees (matching levels) of this pixel to the selected c nearest prototypes and (c) aggregation of the gray levels of the selected prototypes weighted by the membership degrees.

The performance index of the Mean Square Error MSE_all, which is used to evaluate the quality of the overall compression-decompression is expressed in the following form:

$$MS{E_{all}}=\frac{1}{{M \times N}}{\sum\limits_{{s \in {\text{who}}le\_image}} {(F({x_s},{y_s}) - f({x_s},{y_s}))} ^2}$$

where s represents the pixel that belongs to the whole image, f(x, y) denotes original gray level of pixel local at (x, y) while F(x, y) represented the value of the reconstructed gray level.

Besides performance index MSE that we use to quantify the amount of resulting distortion in the reconstruction image, performance index Peak-Signal-to Noise- Ratio (PSNR) is defined as

$$PSNR=10{\log _{10}}\frac{{x_{{peak}}^{2}}}{{\sigma _{q}^{2}}}=10{\log _{10}}\frac{{{{({2^{{\text{bits }}}} - {\text{1)}}}^{\text{2}}}}}{{\frac{1}{{MN}}\sum\limits_{{j=1}}^{N} {\sum\limits_{{i=1}}^{M} {{{(F(i,j) - f(i,j))}^2}} } }}$$

where the value of bits is equal to 8. These two performance indexes are categorized as objective criteria. Compared with subjective criteria which rely on human’s visual perception to judge the quality of an image [17], objective criteria are much easier to generate and seemingly unbiased. The lower the value of MSE is, the closer the reconstructed image is to the original one. On the other hand, higher values of PSNR indicate better image quality.

With regard to the compression of image, we are also interested in the compression ratio (CR) which is expressed as ratio of the original uncompressed input length to the sequence length of the compressed output.

$$CR=\frac{{{\text{original image's file size}}}}{{{\text{compressed file size}}}}$$

Another performance index worth considering is bits per pixel (BPP) which is determined as follows:

$$BPP=\frac{{{\text{Number of bits}}}}{{{\text{Number of pixels}}}}$$

Higher values of CR indicate better compression ability. Since each pixel has a range of values from 0 to 256, its values could be stored in a one byte array. Suppose the sampling ratio is equal to p (0<p<1), we need to allocate about p×M×N×8 bits to store the values of gray-level of the sampled pixels in total. Beyond that, we also need to mark which points have been selected as prototypes. The most direct and simplest way is to use chromosome representation in which one bit is corresponding to one pixel in the image (0 means unselected while 1 means selected). Therefore, a storage space of M×N bits needs to be allocated. In addition, a few “tokens” to store the relevant information about current image, such as its size, sampling ratio and the values of relative parameters. Since the very limited number of tokens, they are almost negligible when calculating CR and BPP. On the basis of these considerations, the values of CR and BPP can be approximated as:

$${\text{CR}} \approx \frac{{M \times N \times 8}}{{p \times M \times N \times 8+M \times N}}=\frac{8}{{8p+1}}$$

$${\text{BPP}} \approx \frac{{p \times M \times N \times 8+M \times N}}{{M \times N}}=8p+1$$

In this section, we report on a series of experiments concerning the compression of grayscale images. Our goal is to quantify the performance of the compression mechanism, as well as evaluate the impact being brought by the sampling ratio p, the number of prototypes c used for representing / reconstructing each pixel, and the value of fuzzification coefficient m. The influence of sampling ratios on the reconstruction quality in terms of performance index MSE is obvious; by intuition, higher sampling ratio will improve the quality of the reconstructed image. Given a certain sampling ratio p, the effects of parameters m and c on the reconstruction quality require some careful attention. The optimal values of m and c are likely linked to the nature of the image. When the value of c equals to 1, it becomes to a Boolean encoding-decoding style compression. In this case, instead of using several nearest c (c>1) prototypes to represent and reconstruct one pixel (x, y), we choose the prototype for which this pixel is the closet and use this prototype to represent/reconstruct pixel (x, y). At the same time, we determine the membership value with respect to the closest prototype as 1 while the membership grades of pixel (x, y) to other prototypes are determined as 0. The results of experiments show that suitable edge length of the sub-block ranges from 10 to 20. In the process of the experiments, when performing image segmentation, the size of each block is set as 15×15. The prototypes in each block are determined as the best family of randomly selected prototypes resulting in the best reconstruction quality of current block. For each block, the random selection is repeated 100 times. We sweep through the values of p starting from 0.05 and moving to the upper bound, which is 0.50 in this case. At the same time, the corresponding values of MSE and PSNR are recorded.

Grayscale image-girl

This image is from the network (URL: https://imgcache.cjmx.com/star/201605/20160518145710516.jpg) and has been converted to grayscale. We perform the experiments on a grayscale image whose size is 400×300. Fig. 4 shows the original image and the reconstructed images when the value of c is set to 4, the value of fuzzification coefficient m is equal to 2.0, and sampling ratios are set to 0.15, 0.25 and 0.35, respectively. It is evident that the increase of the value of sampling ratio results in a visible improvement of the reconstructed image. The progression of the optimization of prototypes in each sub-block is quantified in terms of the MSE obtained in successive generations. The values of MSE obtained when p=0.35 for 312-th sub-block versus the successive generations are shown in Fig. 5.

The plots of MSE and PSNR (when c equals to 3) versus the fuzzification coefficient m for different values of sampling ratio are shown in Fig. 6. It is evident that the range of optimal values of m is positioned slightly beyond the typical value of 2.0. But as the value of m keeps increasing, the improvement in the quality of the reconstructed image is not overly critical when the value of m is larger than 3.0. The tendency is also evident that with the increase of the value of sampling ratio, the values of MSE decrease and the values of PSNR keep increasing, which indicates an improvement in the quality of the reconstructed images.

The plots of the performance indexes MSE and PNSR versus c for different values of sampling ratios are shown in Fig. 7 (the value of fuzzification coefficient m is set to 2.0). The performance of using fuzzy encoding-decoding is superior over the Boolean style compression (the case when c = 1) in all cases. The improvements of performance expressed in terms of MSE are 28.6%, 30.4% and 34% when applying fuzzy encoding-decoding compared with Boolean encoding-decoding mechanism for p = 0.15, 0.25 and 0.35, respectively. The optimal value of c is positioned around 4 or 5 at which point the least reconstruction error is obtained. Beyond this region, we may see a slightly decrease in the quality of the reconstructed image.

Image of Lena

In the experiments on gray image of Lena with a size of 256×256, we follow the same setup as in the previous image and report the results. The original image and the reconstructed ones for different values of sampling ratio are shown in Fig. 8. With the increase of the value of sampling ratio, the visual quality of the reconstructed keeps increasing. The values of MSE and PNSR versus m for different values of sampling ratio are shown in Fig. 9. The same tendency as in the previous experiments is visible that the decrease in the MSE values and the increase in the PNSR values occur with the increase of the value of sampling ratio for different values of m; the optimal value of m varies in the range between 2.0 and 3.0 and is data dependent. The corresponding values of performance indexes MSE and PSNR versus c when m is set to 2.0 are shown in Fig. 10. It is evident from these plots that best reconstruction abilities are achieved when the values of c are around 4 or 5, beyond which the values of MSE and PNSR do not change significantly. The fuzzy encoding-decoding mechanism produces much lower value of the reconstruction error compared with Boolean encoding-decoding (the case when c = 1). The improvements expressed in terms of MSE brought by fuzzy encoding-decoding are 23.4%, 25.8%, and 29.1% for p = 0.15, 0.25 and 0.35, respectively.

Images of leaf and airplane

Here, we run experiments on images leaf (of size 360*480) and airplane (of size 512*512) following the same setup as in previous experiments. These images are also from the internet. When the sampling ratio p is set to 0.25, the value of c is set to 3 (which is determined experimentally) and the value of fuzzification coefficient m is set to 2.0, the values of MSE for images leaf and airplane are 18.05 and 13.15 while the values of PNSR of the reconstructed images are 35.57dB and 36.94dB, respectively. The fuzzy encoding-decoding shows an obvious advantage over the encoding-decoding completed in Boolean style and the ratios of improvement of MSE are 30.7% and 27.7% for the images of leaf and plane in this case. The original images and the corresponding reconstructed images are shown in Fig. 11. The optimal value of c for the image leaf is equal to 3, and the optimal value of c for the image airplane varies between 2 and 4. The range of values of m resulting in better reconstruction quality given specified values of c and sampling ratio p is located between 2.0 and 3.3.

We have proposed a novel compression algorithm based on sampling and fuzzy encoding. The compression mechanism proposed in this study is quite distinct from the existing ones as the membership degrees of the gray level of each pixel to the gray levels of the selected prototypes are implied by the Manhattan distance between pixels during the encoding and decoding processes. Since each pixel and prototype have fixed coordinates in the two-dimensional plane, the membership degrees between them could be calculated on-the-fly, which avoids unnecessary space cost and could help achieve a much high compression ratio. Experimental results show that the proposed method could achieve very high compression ratio and at the same time ensure the quality of the decompressed image. We also have discussed how its parameters affect the reconstruction quality of the decompressed images. While this study has focused on the compression of grayscale images, one could extend this approach to compressing color images in future study. The developed approach could offer an inspiration to other types of data compressing fields, such as dynamic image compression, audio compression.

Acknowledgment

This work was supported by the National Natural Science Foundation of China under Grant Nos. 62076189, 62006184, the Recruitment Program of Global Experts, Canada Research Chair (CRC), Natural Sciences and Engineering Research council of Canada (NSERC) and the Science and Technology Development Fund, MSAR, under Grant No. 0012/2019/A1, and the National Key R&D Program of China under Grant 2018YFB1700104.

Ethical approval

Ethics approval was not required for this research.

Conflict of interest

There is no conflict of interest with any of the suggested reviewers. I would like to declare on behalf of my co-authors that the work described was original research that has not been published previously, and not under consideration for publication elsewhere, in whole or in part. All the authors listed have approved the manuscript that is enclosed.

Informed Consent

We the undersigned declare that this manuscript entitled “A Coordinate-based Fuzzy Encoding Strategy for Compressing Grayscale Images” is original, has not been published before and is not currently being considered for publication elsewhere.

We confirm that the manuscript has been read and approved by all named authors and that there are no other persons who satisfied the criteria for authorship but are not listed. We further confirm that the order of authors listed in the manuscript has been approved by all of us.

Signed by all authors as follows: Dan Wang, Xiubin Zhu, Witold Pedrycz, Zhenhua Yu, and Zhiwu Li

Authorship contributions

Dan Wang: Investigation, Methodology, Coding and Writing

Xiubin Zhu: Experimental studies, Writing and Reviewing

Witold Pedrycz: Methodology and Reviewing

Zhenhua Yu: Reviewing

Zhiwu Li: Investigation, Reviewing

L. F. R. Lucas, N. M. M. Rodrigues, L. A. da Silva Cruz and S. M. M. de Faria, “Lossless compression of medical images using 3-D predictors,” IEEE Transactions on Medical Imaging, vol. 36, no. 11, pp. 2250-2260, Nov. 2017.
A. C. Johnsy and G. Schirinzi, “A lossless coding scheme for maps using binary wavelet transform,” European Journal of Remote Sensing, vol. 50, no. 1, pp. 77-86, 2017.
N. R. M. Noor and T. Vladimirova, “Investigation into lossless hyperspectral image compression for satellite remote sensing,” International Journal for Remote Sensing, vol. 34, no. 14, pp. 5072-5104, 2013.
H. Bekkouche, M. Barret, and J. Oksman, “Adapted generalized lifting schemes for scalable lossless image coding,” Signal Processing, vol. 88, no. 11, pp. 2790-2803, 2008.
A. J. Hussain, A. Al-Fayadh, and N. Radi, “Image compression techniques: A survey in lossless and lossy algorithms,” Neurcomputing, vol. 300, pp. 44-69, 2018.
D. Rossinelli, G. Fourestey, F. Schmidt, B. Busse, and V. Kurtcuoglu, “High-throughput lossy-to-lossless 3D image compression,” IEEE transactions on Medical Imaging, vol. 40, no. 2, pp. 607-620, 2021.
D.A. Huffman, “A method for the construction of minimum redundancy codes,” Proceedings of the IRE, vol. 40, no.9, pp. 1098–1101, 1952.
G. Glen and J. Langdon, “An introduction to arithmetic coding,” IBM Journal of Research and Development, vol. 28, no. 2, pp. 135–149, 1984.
A. H. Robinson and C. Cherry, “Results of a prototype television bandwidth compression scheme,” Proceedings of the IEEE, vol. 55, no. 3, pp. 356–364, 1967.
W.Pedrycz and J. V. de Oliveira, “A development of fuzzy encoding and decoding through fuzzy clustering,” IEEE Transactions on Instrumentation and Measurement, vol. 57, no. 4, pp. 829-837, Apr. 2008
M. Hernández-Cabronero, I. Blanes, A. J. Pinho, M. W. Marcelli， and J. Serra-Sagristà, “Progressive lossy-to-lossless compression of DNA microarray images,” IEEE Signal Processing Letters, vol. 23, no. 5, pp. 698-702, 2016.
J. Li, J. Wu, and G. Jeon, “GPU Acceleration of Clustered DPCM for Lossless Compression of Hyperspectral Images,” IEEE Transactions on Industrial Informatics, vol. 16, no. 5, pp. 2906-2916, 2020.
A. C. Karaca and M. K. Güllü, “Lossless hyperspectral image compression using bimodal conventional recursive least-squares,” Remote Sensing Letters, vol. 9, no. 1, pp. 31-40, 2018.
V. Lukin, I. Vasilyeva, S. Krivenko, F. Li, S. Abramov, O. Rubel, B. Vozel, K. Chehdi, and K. Egiazarian, “Lossy compression of multichannel remote sensing images with quality control,” Remote Sensing, vol. 12, no. 22. Pp. 3840, 2020.
K. Rajakumar and T. Arivoli, “Lossy image compression using multiwavelet transform coding,” Wireless Personal Communications, vol. 87, no. 2, pp. 1-19, 2015.
M. Hernández-Cabronero, I. Blanes, A. J. Pinho, M. W. Marcellin, and J. Serra-Sagristà, “Analysis-driven lossy compression of DNA microarray images,” IEEE Transactions on Medical Imaging, vol. 35, no. 2, pp. 654-664, 2016.
S. E. Umbaugh, Computer Imaging: Digital Image Analysis and Processing, CRC Press, 2005.
H. Chen, X. He, C. Ren, L. Qing, and Q Teng “CISRDCNN: Super-resolution of compressed images using deep convolutional neural networks,” Neurocomputing, vol. 285, pp. 204-219, 2018.
S. A. Alshehri, “Neural network technique for image compression,” IET Image Processing, vol. 10, no. 3, pp. 222-226, 2016.
X. Yuan and Y. Pu, “Parallel lensless compressive imaging via deep convolutional neural networks,” Optics Express, vol. 26, no.2, pp. 1962-1977, 2018.
P. Hurtik and S. Tomasiello, “A review on the application of fuzzy transform in data and image compression,” Soft Computing, vol. 23, no. 23, pp. 12641-12653, 2019.
K. Hirota and W. Pedrycz “Fuzzy relational compression”, IEEE Transactions on Systems, Man, And Cybernetics—Part B: Cybernetics, vol. 29, no. 3, pp. 407-415, Jun. 1999.
W. Pedrycz, Knowledge-Based Fuzzy Clustering, John Wiley, New York, 2005.
J. C. Bezdek, Pattern Recognition with Fuzzy Objective Function Algorithms, Plenum Press, New York, 1981.
K. M. Hosny, A. M. Khalid, and E. R. Mohamed, “Efficient compression of volumetric medical images using Legendre moments and differential evolution,” Soft Computing, vol. 24, no. 1, pp. 409-427, 2020.
L. Ding, Y. Tian, H. Fan, Y. Wang, and T. Huang, “Rate-performance-loss optimization for inter-frame deep feature coding from videos,” IEEE Transactions on Image Processing, vol. 26, no. 12, pp. 5743-5757, 2017.
A. Weinlich, P. Amon, A. Hutter, and A. Kaup, “Probability distribution estimation for autoregressive pixel-predictive image coding,” IEEE Transactions on Image Processing, vol. 25, no. 3, pp. 1382-1395, 2016.
X. B. Zhu, W. Pedrycz, and Z. W. Li, “Fuzzy clustering with nonlinearly transformed data,” Applied Soft Computing, vol. 61, pp. 364-376, Dec 2017.
J. Bégaint, D. Thoreau, P. Guillotel, and C. Guillemot, “Region-based prediction for image compression in the cloud,” IEEE Transactions on Image Processing, vol. 27, no. 4, pp. 1835-1846, 2018.
M. Kalluri, M. Q. Jiang, N. Ling, J. H. Zheng, and P. Zhang, “Adaptive RD optimal sparse coding with quantization for image compression,” IEEE Transactions on Multimedia, vol. 21, no. 1, pp. 39-50, 2019.
S. Ding, P. Du, X. Zhao, Q. Zhu, and Y. Xue, “BEMD image fusion based on PCNN and compressed sensing,” Soft Computing, vol. 23, no. 20, pp. 10045-10054, 2019.
K. Dhou, “An innovative design of a hybrid chain coding algorithm for bi-level image compression using an agent-based modeling approach,” Applied Soft Computing, vol. 79, pp. 94-110, 2019.
M. Brindha and N. A. Gounden, “A chaos based image encryption and lossless compression algorithm using hash table and Chinese Remainder Theorem,” Applied Soft Computing, vol. 40, no. 379-390, 2016.

Download PDF

Journal Publication

published 19 Aug, 2023

Read the published version in Soft Computing →

Reviewers agreed at journal
31 Jan, 2023
Reviewers invited by journal
05 Jan, 2023
Editor assigned by journal
06 Jan, 2022
First submitted to journal
03 Jan, 2022

You are reading this latest preprint version

A Coordinate-based Fuzzy Encoding Strategy for Compressing Grayscale Images

Status:

Journal Publication

Version 1

Abstract

Figures

I. Introduction

Ii. Compression Of Images

Iii. Fuzzy Compression-decompression Algorithm

Iv. Performance Indexes

V. Experimental Studies

Vi. Conclusions

Declarations

References

Status:

Journal Publication

Version 1