Development of Wavelet Based Image Compression Approach Using Modified SPIHT Algorithm

doi:10.21203/rs.3.rs-894142/v1

Download PDF

Research Article

Development of Wavelet Based Image Compression Approach Using Modified SPIHT Algorithm

https://doi.org/10.21203/rs.3.rs-894142/v1

This work is licensed under a CC BY 4.0 License

You are reading this latest preprint version

In case of images & videos, the encoding philosophy is substantially different as compared to that of speech. The very basic difference between image & speech is that, the speech signals are one dimensional, whereas the images are two dimensional in nature. Modern day images can have the resolution in the range of ‘1024 x 1024’, this means there will 1 Mega Pixels & for color images, 1 pixel is represented by 24 bits (8 bits for each pixel component of red, green & blue), that means there are 24 Megabits in a single still image. In videos such still images are changing at 30 frames per second, so the basic bit rate going to be 30 times of 24 Megabits, this leads to the explosion in information. This demands compression of images at significant extent. In this work, image compression is achieved by using transformation based on modified lifting wavelet structure followed by quantization based on modified set partition in hierarchical tree (SPIHT) algorithm. This modification in SPIHT algorithm helps to reduce the bit stream length of compressed signal to significant extent as compared to conventional SPIHT algorithm. The proposed architecture is developed with Verilog language and simulated with modelsim simulator and synthesized with XST(VHDL/Verilog) tool of Xilinx software under Virtex6-XC6VLX240TD environment. The comparative analysis of proposed system with existing system shows that, there is significant improvement in compression ratio & PSNR.

Image Compression

SPIHT algorithm

lifting wavelet

PSNR

Compression ratio

Image Filtering [1], Image Compression and Image Security [2] are the three indispensible aspects of image processing system [3]. For a still images of size ‘1024 x 1024’, 24 Megabits needs to processed and this is a very high number; but very luckily, the redundancy that exist in almost all types of natural images is very high. The redundancy in the sense that, any pixel in the image is highly correlated in its intensity value with its entire surrounding neighborhood; this type of redundancy is called as s statistical redundancy. There is yet another redundancy exploited in image compression i.e. Psycho-visual redundancy. The ultimate detector of any image or video are human eyes and based on the perception of images or videos that are getting from eyes, the accuracy of images is restricted i.e. what eyes cannot perceive is avoided. Psycho-visual redundancy essentially refers, to discard the image content that human eyes cannot detect. The general architecture of image compression system is shown in figure 1.

For compression, the image arrays are first put through a process of transformation. This transformation process makes the image signal amenable to good compression. The transformation process actually exploits the statistical redundancy. Quantization process is followed by transformation process. The quantization process exploits the psycho-visual redundancy. The quantization process essentially leads to the loss in the signal due to truncation of some values; hence exact reconstruction of signal is not possible at receiver end. The quantized signal also seems to have some statistical redundancy. Every symbol of quantized signal will be associated with some probability of occurrence, based on this probability of occurrence, the entropy is computed. This entropy is exploited using Entropy Encoder block to have compressed image. At receiver end exactly opposite process is used so as to restore the image from its compressed form.

The Transformation & Entropy Encoding processes are totally lossless processes, whereas quantization is a lossy process. The image compression achieved without quantization results in lossless compression. There are several lossless compression techniques like Huffman coding [4], Arithmetic coding [5], Lemple-Ziv Coding [6]; but the lossless compression techniques always have some limitations, as Images or videos contains very large information, hence the lossless compression achieved by predictive coding followed by entropy coding is not sufficient, and hence in most of practical applications lossy compression is widely used. The lossy compression includes quantizer, as an additional block as compared to lossless compression. Even if this quantizer quantizes the transformed output by truncating certain information, human eyes unable to detect any major perceptual degradation in the quality of decompressed image. Quantization block in-spite being a lossy element, the transformation block is key block in lossy compression, in order to decide how effectively the compression could be achieved. The transformation block is essentially lossless block but this block follows a major role in the sense that, it makes the signal amenable to the compression by exploiting the correlation that exists in the signal. Essentially, the transformation process offer energy compaction property [7]. The image in spatial domain undergoes transformation process, so as to have image in transformed domain, with number of coefficients in transformed domain is exactly equal to number of pixels in spatial domain. Out of total number of coefficients in transformed domain, most of the coefficient does not carry energy with it and most of the energy is carried by few coefficients only. The significant compression is then achieved by retaining only few coefficients, that carries most of the energy & discarding large number of other coefficients. Various image compression techniques have been evolved over the period of time. Transformation based [8–10] techniques like discrete cosine transform [11] and discrete wavelet transform [12], often results high compression efficiency but with high degree computational complexity. On the other hand, techniques without transformation deals with very low computational complexity but with cost of degradation in terms of compression efficiency.The very first image compression standard named JPEG (Joint Picture expert group) was developed with DCT. Although JPEG standard was developed in 1992, but after the 25 years of its development, the JPEG standard is still going strong [13], but the image compression using Discrete Cosine Transform suffers from some serious problems of Blocking Artifacts & Graininess in reconstructed images [14]. Because of these deficiencies associated with DCT, the DCT is not suitable beyond certain bit rate. This issue of DCT is solved by using Discrete Wavelet Transform (DWT). JPEG-2000 was developed with DWT, so as to overcome the deficiencies associated with JPEG.The DWT is one dimensional in nature whereas the images are two dimensional in nature, so the concept of one dimensional discrete wavelet filter is extended in two dimensional way & applied on images. The image signal is represented as\(S({n_1},{n_2})\), where \({n_1}\)= Index in horizontal direction &\({n_2}\)= Index in vertical direction. So for image encoding four different filter pairs [15] are used as given below,

\(\Phi ({n_1},{n_2})=\Phi ({n_1})\Phi ({n_2})\) , Low pass filter in both vertical & horizontal direction.

\({\psi ^h}({n_1},{n_2})=\psi ({n_1})\Phi ({n_2})\) , High pass filter in horizontal direction & low pass filter in vertical direction.

\({\psi ^V}({n_1},{n_2})=\Phi ({n_1})\psi ({n_2})\) , Low pass filter in horizontal direction & high pass filter in vertical direction.

\({\psi ^d}({n_1},{n_2})=\psi ({n_1})\psi ({n_2})\) , High pass filter in both vertical & horizontal direction.

Where, \(\Phi (n)\) is a scaling filter &\(\psi (n)\) is a wavelet filter. Each of above filter pair consists of two filters with one acting in horizontal direction & other acting in vertical direction. These results are then decimated by the factor of two in horizontal direction & also decimated by factor of two in vertical direction, so each pair will cause the overall decimation by the factor of four. So when these four filter pair applied on image, then the individual filter pair responses is represented in image space, with the complete image is divided in to four sub-bands as depicted in below figure 2.

This wholeprocess is summarized in block diagram form as given in figure 3.

Where \(j+1\)= Starting Scale

=- Row index

= Column index

The \({W_\Phi }(j+1,m,n)\) represents a starting signal, this signal could be the entire image. This signal is applied to two different filters that is\({h_\psi }(n)\)&\({h_\varphi }(n)\). With \({h_\psi }(n)\) is wavelet filter (HPF) applied along column and \({h_\varphi }(n)\) is a scaling filter (LPF) applied along column. The signals obtained from these two filters are then down sampled (decimated) by the factor of two. The two signal obtained from these two decimator is processed through pair of \({h_\psi }(m)\)&\({h_\varphi }(m)\) filters. With \({h_\psi }(m)\) is wavelet filter (HPF) applied along row and \({h_\varphi }(m)\)is a scaling filter (LPF) applied along row, these filters are followed by decimator, so that there will be four signals as below

\(W_{\psi }^{D}(j,m,n)\) = This signal is derived by high pass filtering along the rows as well as along the columns; hence this sub band is referred as HH sub band. The high pass filtering extract edges & as this signal is derived by high passes filtering along both rows & columns, it extracts edges along both row & column, in short extracting diagonal edges.
\(W_{\psi }^{V}(j,m,n)\) = This signal is derived by high pass filtering along the columns & low pass filtering along the rows, hence this sub band is referred as HL sub band. This signal is derived by high pass filtering along the columns & low pass filtering along the rows, in short extracting the vertical edges.
\(W_{\psi }^{H}(j,m,n)\) =This signal is derived by low pass filtering along the columns & high pass filtering along the rows, hence this sub band is referred as LH sub band. This signal is derived by low pass filtering along the columns & high pass filtering along the rows, in short extracting the horizontal edges.
\(W_{\Phi }^{{}}(j,m,n)\) = This signal is derived by low pass filtering along the rows as well as along the columns; hence this sub band is referred as LL sub band.

Thus signal \({W_\Phi }(j+1,m,n)\)is scaled down to\(W_{\Phi }^{{}}(j,m,n)\), represented as LL sub band. Effectively the image resolution is reduced by the factor of two along the horizontal & vertical direction, so LL sub band contains one fourth samples of original image. All the four sub bands (HH, HL, LH, and LL) can be further analyzed or partitioned individually. The images are generally very rich in low frequency content, hence out of all these four sub bands, maximum information is contained within the LL sub band only. In short original image is squeezed to one fourth of its size & put it in to the LL sub band. Thus logically LL sub band is partitioned further instead of other sub bands as shown in figure 4. Mathematically \(W_{\Phi }^{{}}(j,m,n)\)is taken as starting point to obtain next four sub bands as LL₁, HL₁, LH₁, HH₁.

The LL₁ sub band can be partitioned further to obtain LL₂, HL₂,L H₂& HH₂ sub bands; likewise this portioning or decomposition can be achieved, so as to obtain coarser & coarser domain from finer domain. This type of partitioning helps space frequency localization. This way of partitioning the LL sub band & obtaining further sub bands is called as dyadic partitioning.

In this work, image compression is achieved by using transformation based on modified lifting wavelet structure followed by quantization based on modified set partition in hierarchical tree (SPIHT) algorithm.

2.1 Modified Lifting Wavelet Structure

The DWT technique based on convolution, involves filtering the input signal by two independent filters asHPF (High pass filter) &LPF (low pass filter). The output of both these filters is then decimated by two, so as to get low pass & high pass sub bands; also called as approximation &detailed sub bands respectively. The speed of convolution based DWT is the major constraint. Tian et al. [16] proposes efficient & high speed 2-dimensional lifting discrete wavelet transform. The lifting wavelet transform divides the given sequence in to its two poly-phase components, these poly-phase components are operated in parallel. So lifting wavelet deals with an operation on poly-phase components as an operation on ‘2 X 2’ sequences instead of original sequence. This is illustrated in figure 5.

From above figure 5, it is cleared that, the decomposition is achieved by arithmetic addition & subtraction between two adjacent samples of the given sequence, followed by multiplication with ½; so as to get the \({Y_{{V_0}}}\) and \({Y_{{W_0}}}\), with number of samples are exactly half in \({Y_{{V_0}}}\) and \({Y_{{W_0}}}\) as compare to Original sequence. This is illustrated by two filters followed by decimator as shown in figure 6. This structure is called as Analysis filter bank, as it decomposes or analyses the original sequence in to its components.

These filters are then described in terms of their system functions rather in time domain, for this purpose the Z-transform is used, as the Z-Transform is a linear operator i.e. the linear combination of sequences results in same linear combination in Z-domain, with region of convergence are at least intersection of region of convergence of individual sequences, which are linearly combined. The Z-transform of first filter (LPF) is given below,

\(b(n)=\frac{1}{2}\left[ {a(n)+a(n - 1)} \right]\)

Taking Z-transform on both side

\(\begin{gathered} B(Z)=\frac{1}{2}\left[ {A(Z)+{Z^{ - 1}}A(Z)} \right] \hfill \\ \hfill \\ \end{gathered}\)

System Function=\({H_{Low}}(Z)=\frac{{B(Z)}}{{A(Z)}}=\frac{1}{2}\left[ {1+{Z^{ - 1}}} \right]\)

Similarly, the system function for second filter (HPF) \({H_{High}}(Z)\)is given below,

\(\begin{gathered} {H_{High}}(Z)={Z^{ - 1}}{H_{Low}}( - {Z^{ - 1}}) \hfill \\ {H_{High}}(Z)=\frac{1}{2}{Z^{ - 1}}(1 - Z) \hfill \\ {H_{High}}(Z)=\frac{1}{2}( - 1+{Z^{ - 1}}) \hfill \\ \hfill \\ \end{gathered}\)

In general the factor ½ is not considered while analyzing, as it is compensated by selecting the multiplying factor of 2 at synthesis side. The Analysis Filter Bank in terms of its system function is as shown in figure 7.

This is represented in terms signal flow graph as given below figure 8.

Thus the complete image can be decomposed by using above homogeneous repetitive structure called a lattice. This lattice structure is analyzed using poly phase matrix so as to get the lifting structure as given in figure 9.

Thus the crisscross operation in figure 8 is broken in to the two cascade operations shown in figure 9. This lifting structure improves the speed of DWT computations, but it involves multiplication operations. Shahadi et al. [17] proposes multiplier free efficient & fast lifting wavelet transform as shown in figure 10.

As shown in figure 10 (a), the analysis filter bank involves breaking the sequence in to the odd & even sequences followed by subtraction & addition operations. This modified structure replaces the repetitive analysis filter bank in figure 7, so as obtain the different sub bands, with much more faster rate as compare to conventional convolution based DWT.

Analyzing the whole image in to four sub bands doesn’t means compressing the image. For still image compression, it must be seen that how this individual sub bands as; LL, LH, HL & HH are individually compressed. Now as discussed earlier images are rich in low frequency region i.e. LL sub band contain more information and other sub bands will be having a sparsity of information, hence the high frequency coefficients can be quantized more severely or more coarsely as compared to low frequency coefficients, just like DCT but in DCT truncation of high frequency coefficients is done by exploiting psycho-visual redundancy. For effective compression, it is very essential to exploit the inter-relationship between the different sub bands. Consider the first level partitioning of ‘8 X 8’ pixel array, so as to have each sub band of size, ‘4 X 4’ pixel. Now consider the pixel at coordinate position (1,1) in LL sub band, this pixel will have some relation with the pixels presents in high frequency sub bands at same coordinate position (1,1); as shown in below figure 11.

In two level of partitioning, each pixel in HL₂, LH₂& HH₂ sub bands corresponds to four pixels in HL₁, LH₁& HH₁ respectively, with each pixel in LL₂ sub band corresponds to a pixel in HL₂, LH₂& HH₂., as shown in figure 12.

Now in this form, what emerges is a kind of data structure representation, with one pixel in LL₂ sub band as a starting point with corresponding pixel position in LH₂, HL₂ & HH₂; as its descendants. Individually each of these three pixels in LH₂, HL₂ & HH₂ sub bands will have four descendants in LH₁, HL₁ &HH₁ sub bands respectively. If the LL₂ sub band is partitioned further then the generalized data structure evolved in the form of a tree. Brahimi et al. [18] proposes improved embedded zero tree wavelet (EZW) technique for mage coding. This EZW technique achieves compression by exploiting redundancy between the coefficients, which are there at same pixel location but in the successive sub bands. In order to understand this EZW technique, consider the three level of decomposition, in this decomposition first of all the coefficient from top most LL₃ sub band is picked up, then the data structure tree is formed with HL₃, LH₃& HH₃ coefficients in same resolution; as a descendants. This sequence of picking the coefficients is repeated in finer and finer sub bands., as shown in figure 13.

As shown in figure 13, the coefficients at the same spatial position are picked-up at different resolutions. The biggest advantage of traversing in this manner is that; as most of the coefficients are highly significant in low frequency sub bands, that means in LL sub band almost all the coefficients are highly significant and going towards finer & finer resolution that too in the high frequency sub bands, the coefficients will become lesser & lesser significant. The significant coefficients are represented as a binary ‘1’ & insignificant coefficients are represented as binary ‘0’. In EZW technique, while traversing wavelet coefficients as depicted in figure 13, then if at particular position any coefficient found to be zero along with all of its descendants also to be zero, then there is no point in encoding all the coming descendants as beyond this everything is going to be zero, that is called as zero tree. The zero tree is a part of a tree structure, where all the elements starting with some intermediate level of parent node will have its own value as zero and all subsequent descendants to it, as zero. The Embedded coding deals with generating the bit stream in prioritized manner that means more significant coefficients are transmitted first followed by lesser significant coefficients. This embedded encoding performs very well when the image is transmitted over dynamic bandwidth of internet. The beauty of this scheme is that, if the available bandwidth is more, then all the higher & lesser priority coefficients are transmitted for the best quality. In case of limited bandwidth, after passing priority coefficients and if the bit budget is exhausted & don’t have bandwidth then bit stream is truncated; this results a reasonably good quality image, it would not be best quality image.

The significance of any coefficient (x) is decided as given below,

\(\begin{gathered} \left| X \right|<T=Insignficant \hfill \\ \left| X \right| \geqslant T=Significant \hfill \\ \end{gathered}\)

Where, ‘T’ is a threshold value

The EZW is a multi-pass algorithm, initially the threshold value is kept high, so the significance & insignificance is decided in more harsh way, but with every successive pass the value of threshold is halved. This ensures that, at the end of first pass only few very significant coefficients are encoded; later followed by next pass, if the bit budget permits; so as to encode lesser significant coefficients. If still the bit budget permits, then more & more passes are undertaken. The technique of successive approximation is used for encoding the significant coefficients. Although the EZW is a very standard technique for wavelet encoding, but there are certain limitations with it as; this technique deals with splitting the LL sub bands only & do not consider other sub bands splitting, another limitation is that, it is exploiting redundancy which is present at a particular spatial position but across different scale, but it is not exploiting the redundancy that exists among neighborhood coefficients of the same sub band. This issue associated with EZW is solved using Set partitioning in Hierarchical tree (SPIHT) algorithm [19].

2.2 Modified SPIHT Algorithm

SPIHT algorithm organizes the partitioned image in the form of data structure trees called as spatial orientation trees. This tree(for three level partitioned image) is organized in such a way that, top most LL₃ sub band is picked up as a route node, then the data structure tree is formed with HL₃, LH₃& HH₃ coefficients in same resolution; as a descendants. This sequence of picking the coefficients is repeated in finer and finer sub bands in such a way that one coefficient in HL₃, LH₃& HH₃ sub-bands corresponds to four coefficients in HL₂, LH₂& HH₂ sub bands and ultimately one coefficient in HL₃, LH₃& HH₃ corresponds to 16 coefficients inHL₁, LH₁& HH₁ sub bands, as shown in figure 14.

The SPIHT algorithm includes three sets as given,

\(P(j,k)\) = Set of all descendants of coefficient\((j,k)\), this set is also known as Type A set.
\(Q(j,k)\) = Set of all direct child of coefficient\((j,k)\), this set is also known as Type B set.
\(R(j,k)\) = Set of all descendants except direct child of coefficient\((j,k)\).

In SPIHT algorithm, each wavelet coefficient is categorized in to the significant & insignificant coefficient. The significance of any coefficient (x) is decided as given below,

\(\begin{gathered} \left| X \right|<T=Insignficant \hfill \\ \left| X \right| \geqslant T=Significant \hfill \\ \end{gathered}\)

With, \(T={2^n}\)

\(n=\left[ {lo{g_2}{C_{max}}} \right]\) ; \({C_{max}}\)= Value of coefficient with maximum value.

The SPIHT is a multi-pass algorithm and with every sub sequent pass, the value of threshold is reduced by decrementing the value of by 1. The significant coefficients are represented by logic ‘1’, whereas the insignificant coefficients are represented by logic ‘0’. In general the significant coefficients are kept in a set called as list of significant pixel (LSP) and insignificant pixels are placed in set called as list of insignificant pixels (LIP). Apart from these two sets, one more set named list of insignificant set (LIS) is used to place the set (Sub bands) which do not contain any significant coefficient. Initially the set LIP is loaded with coefficients in highest level, set LIS is loaded with set having descendants & set LSP is kept empty. SPIHT encoding begins with MSB plane, for every bit plane, it checks all the three lists as LIP, LIS & LSP. If any coefficient in LIP becomes significant, then it is transferred to LSP. Ultimately all the coefficients in LSP (excluding the coefficients becomes significant in last round) is refined in each sub sequent round by successive approximation.

As discussed in conventional algorithm for SPIHT, if a type A set (Set of all descendants) is significant, but with corresponding set of offspring is insignificant coefficients; in such a case SPIHT algorithm puts four continuous zero in bit stream; this results in larger bit stream. This problem is solved as,

When Type A set is significant set, with set of offspring coefficient is insignificant,

Then logic ‘1’ is used to represent set significant & logic ‘0’ is used to represent offspring insignificant. So this saves 3 bits.

When Type A set is significant set, with set of offspring coefficient is significant,

Then logic ‘1’ is used to represent set significant& logic ‘1’ is used to represent offspring significant. So it takes one additional bit.

The proposed architecture is developed with Verilog language and simulated with modelsim simulator and synthesized with XST(VHDL/Verilog) tool of Xilinx software under Virtex6-XC6VLX240TD environment. The performance of proposed system is tested with several images. All these test images are applied to proposed image compression module, so as to get the compressed images. Qualitative outcome of proposed system with few standard test images, with the corresponding bits per pixel (bpp) values is depicted in below given Table 1.

The result obtained for all the test images in terms of PNR (Peak Signal to Noise Ratio), bits per pixel and compression ratio is enclosed in table 2 below;

Table 2: Compression/Decompression Result

Sr. No.	Image	image Size	PSNR (dB)	bpp (compressed Image)	Compression Ratio
1	Image (a) (Leena)	256 X 256 X 8	30.75	0.3296	24.27
2	Image (b) (Boat)	256 X 256 X 8	28.09	0.3282	24.37
3	Image (c) (Pepper)	256 X 256 X 8	28.99	0.3346	23.90
4	Image (d) (Barbara)	256 X 256 X 8	29.80	0.3234	24.73
Average			29.40	0.3390	24.31

The compression/decompression results obtained with proposed module is compared with earlier techniques in terms of PSNR & compression ratio is enclosed in table 3 as below;

Table 3: Comparative Analysis with earlier compression techniques

Sr. No.	Reference	Image size	PSNR(dB)	Compression Ratio
1	Vidya et al. [20]	128 X 128	28.08 (Leena)	4
2	Erra [21]	256 X 256	-	5
3	Jridi et al. [22]	256 X 256	25.38 (leena)	bpp = 0.64 CR = 8/0.64 = 12.5
4	Huang et al. [23]	256 X 256	31.19	16
5	Panigrahy et al. [24]	256 X 256	32..7 (Leena)	5.12
6	Haque et al. [25]	1024 X 1024	--	4.1
7	Jackson et al [26]	512 X 512	33.1 (Leena)	12.8
8	Saad et al [27]	256 X 256	30.7 (Leena)	5.82
9	Chan et al. [19]	256 X 256	27.42 (Barbara) 31.22 (Barbara) Avg. = 29.32	bpp = 0.25 CR = 8/0.25 = 32 bpp = 0.5 CR = 8/0.25 = 16 Avg. CR= 24
10	Proposed	256 X 256	29.40(Avg.) 30.75 (Leena) 29.80 (Barbara)	24.31 24.27 (24.73)

Pictorial representation for comparative analysis of compression ratio & PSNR is given in figure 15and figure 16 respectively

The comparative analysis of proposed image compression technique with earlier techniques shows that the proposed technique gives best optimal performance for PSNR and Compression ratio trade-off.

This work focused on development of image compression scheme using image transformation based on modified lifting wavelet structure followed by quantization based on modified set partition in hierarchical tree (SPIHT) algorithm. The modified lifting wavelet structure includes simple addition & subtraction operations. The wavelet coefficients obtained from modified lifting wavelet transform are encoded by improved SPIHT algorithm. This Proposed SPIHT algorithm checks the significance and insignificance of set of the offspring for any given node before checking significance and insignificance of individual offspring of the said node. The comparative analysis of proposed image compression technique with existing techniques shows that, the proposed system gives improved performance in terms of PSNR & compression ratio.

5.1 Funding

Not Applicable.

5.2 Conflict of Interest/Competing Interest

There is no conflict in manuscript.

5.3 Availability of data and materials

The Datasets generated during and/or analysed during the current study are available from corresponding author on reasonable request.

5.4 Code of availability

Software Application.

5.5. Authors Contribution

Not Applicable.

Shelke, S. K., & Patel, G. S. (2020). Low Power High Frequency Implementation of Image Filtering using Improved Median Filtering. International Journal of Advanced Science and Technology, 29(4s), 1833-1843.
Shelke, S. K., Sinha, S. K., & Patel, G. S. (2021). DEVELOPMENT OF IMPROVED SPEED AES BASED ON KEY DEPENDENT INTRA-ROUND OPERATIONS. Journal of Theoretical and Applied Information Technology, 99(13).
Shelke, S. K., Sinha, S. K., & Patel, G. S. (2021). Study of End to End Image Processing System Including Image De-noising, Image Compression & Image Security. Wireless Personal Communications, 1-12.
Xue, S., &Oelmann, B. (2006). Unary prefixed Huffman coding for a group of quantized generalized Gaussian sources. IEEE transactions on communications, 54(7), 1164-1169.
Marpe, D., Schwarz, H., &Wiegand, T. (2003). Context-based adaptive binary arithmetic coding in the H. 264/AVC video compression standard. IEEE Transactions on circuits and systems for video technology, 13(7), 620-636.
Aboy, M., Hornero, R., Abásolo, D., &Álvarez, D. (2006). Interpretation of the Lempel-Ziv complexity measure in the context of biomedical signal analysis. IEEE transactions on biomedical engineering, 53(11), 2282-2288.
Subudhiray, S., &Srivastav, A. K. (2012). Implementation of hybrid DWT-DCT algorithm for image compression: a review. International Journal of Research in Engineering and Applied Sciences, 2(2), 1200-1210.
Kim, S., Lee, D., Kim, J. S., & Lee, H. J. (2016). A high-throughput hardware design of a one-dimensional SPIHT algorithm. IEEE transactions on multimedia, 18(3), 392-404.
Yng, T. L. B., Lee, B. G., &Yoo, H. (2008). A low complexity and lossless frame memory compression for display devices. IEEE transactions on consumer electronics, 54(3), 1453-1458.
Jin, Y., & Lee, H. J. (2012). A block-based pass-parallel SPIHT algorithm. IEEE transactions on circuits and systems for video technology, 22(7), 1064-1075.
Wallace, G. K. (1992). The JPEG still picture compression standard. IEEE transactions on consumer electronics, 38(1), xviii-xxxiv.
Christopoulos, C., Skodras, A., &Ebrahimi, T. (2000). The JPEG2000 still image coding system: an overview. IEEE transactions on consumer electronics, 46(4), 1103-1127.
Hudson, G., Léger, A., Niss, B., &Sebestyén, I. (2017). JPEG at 25: Still going strong. IEEE MultiMedia, 24(2), 96-103.
Chang, H., Ng, M. K., &Zeng, T. (2013). Reducing artifacts in JPEG decompression via a learned dictionary. IEEE transactions on signal processing, 62(3), 718-728.
Si-xian, W., Fei-peng, L., Fa-long, Z., Meng-yang, L., &Hui, L. (1999). Medical image compression by applying 3D DWT in PACS. Wuhan University Journal of Natural Sciences, 4(3), 313-318.
Tian, X., Wu, L., Tan, Y. H., &Tian, J. W. (2011). Efficient multi-input/multi-output VLSI architecture for two-dimensional lifting-based discrete wavelet transform. IEEE transactions on computers, 60(8), 1207-1211.
Shahadi, H. I., Jidin, R., Way, W. H., & Abbas, Y. A. (2014). Efficient FPGA architecture for dual mode integer haar lifting wavelet transform core. Journal of Applied Sciences, 14(5), 436.
Brahimi, T., Laouir, F., Boubchir, L., & Alii-Chérif, A. (2017). An improved wavelet-based image coder for embedded greyscale and colour image compression. AEU-International Journal of Electronics and Communications, 73, 183-192.
Li, Q., Chen, D., Jiang, W., Liu, B., & Gong, J. (2016). Generalization of SPIHT: Set partition coding system. IEEE transactions on image processing, 25(2), 713-725.
Vidya, D., Parthasarathy, R., Bina, T. C., & Swaroopa, N. G. (2000). Architecture for fractal image compression. Journal of systems architecture, 46(14), 1275-1291.
Erra, U. (2005, December). Toward real time fractal image compression using graphics hardware. In International Symposium on Visual Computing (pp. 723-728). Springer, Berlin, Heidelberg.
Jridi, M., Alfalou, A., & Kumar Meher, P. (2012). Optimized Architecture Using a Novel Subexpression Elimination on Loeffler Algorithm for DCT-Based Image Compression. VLSI Design.
Huang, Z., Zhang, X., Chen, L., Zhu, Y., An, F., Wang, H., & Feng, S. (2017). A vector-quantization compression circuit with on-chip learning ability for high-speed image sensor. IEEE Access, 5, 22132-22143.
Panigrahy, M., Chakrabarti, I., & Dhar, A. S. (2016). Low-delay parallel architecture for fractal image compression. Circuits, Systems, and Signal Processing, 35(3), 897-917.
Haque, M., Kaisan, A. A., Saniat, M. R., & Rahman, A. (2014). GPU accelerated fractal image compression for medical imaging in parallel computing platform. arXiv preprint arXiv:1404.0774.
Jackson, D. J., Ren, H., Wu, X., & Ricks, K. G. (2007). A hardware architecture for real-time image compression using a searchless fractal image coding method. Journal of Real-Time Image Processing, 1(3), 225-237.
Saad, A. H., & Abdullah, M. Z. (2016). High-speed implementation of fractal image compression in low cost FPGA. Microprocessors and Microsystems, 47, 429-440

Table 1 is available in the Supplementary Files section

Table1.docx

Download PDF

Reviewers invited by journal
15 Jun, 2022
First submitted to journal
09 Sep, 2021

You are reading this latest preprint version

Development of Wavelet Based Image Compression Approach Using Modified SPIHT Algorithm

Status:

Version 1

Abstract

Figures

1. Introduction

2. Proposed System

2.1 Modified Lifting Wavelet Structure

2.2 Modified SPIHT Algorithm

3 Result And Discussion

4 Conclusion

5 Declarations

References

Table

Supplementary Files

Status:

Version 1