Multi-resolution pre-processing for pattern recognition in images and audio signals
MetadataShow full item record
With the rapid growth of technology and the proliferation of data in this “digital age”, most current data (imaging and audio) applications require higher data resolution, higher data transmission rates, and better compression techniques to meet the ever increasing demands placed on them. In the case of data compression, an ideal system must produce good quality of data with high compression ratios while minimising the computational overhead. In other words, bandwidth cost money, therefore, the transmission and storage of information become costly. However, if we can use less data, both transmission and storage become cheaper. The Haar Wavelet Transform (HWT) has been chosen, here, to fulfil the data compression requirement of this research. The HWT is a mathematical tool that is ideally suited for hierarchically decomposing image information. For example, it is the preferred method by the JPEG format to produce encoded images at higher compression ratios. In addition, it can also be used to remove irrelevant/redundant image data to facilitate the recording, transmission, and searching of data effectively. HWT based image compression is computationally efficient and is symmetric in nature (both the forward and the inverse transforms have the same complexity) allowing for the building of fast compression and decompression routines. The aim of this research is to determine optimum compression rates for data using Machine Learning methods; typically, Artificial Neural Network (ANN). The MNIST dataset images and the TI46 database audio are subject to a HWT transformation and a signature of the information is constructed by removing irrelevant/redundant data by means of a cut-off function. Multiple signatures for images and audio are built for differing cut-offs are then input into a neural network. Their performances are then compared to find the optimum HWT based compression. Which, it has made the input vectors of the network much more compact and thus easier to establish relationships between them, leading to a rapid and effective classification of the signal.
The following license files are associated with this item: