Show simple item record

dc.contributor.authorTsapatsoulis, Nicolasen
dc.contributor.authorRapantzikos, K.en
dc.contributor.authorPattichis, Constantinos S.en
dc.creatorTsapatsoulis, Nicolasen
dc.creatorRapantzikos, K.en
dc.creatorPattichis, Constantinos S.en
dc.description.abstractIn this paper we propose a novel saliency-based computational model for visual attention. This model processes both top-down (goal directed) and bottom-up information. Processing in the top-down channel creates the so called skin conspicuity map and emulates the visual search for human faces performed by humans. This is clearly a goal directed task but is generic enough to be context independent. Processing in the bottom-up information channel follows the principles set by Itti et al. but it deviates from them by computing the orientation, intensity and color conspicuity maps within a unified multi-resolution framework based on wavelet subband analysis. In particular, we apply a wavelet based approach for efficient computation of the topographic feature maps. Given that wavelets and multiresolution theory are naturally connected the usage of wavelet decomposition for mimicking the center surround process in humans is an obvious choice. However, our implementation goes further. We utilize the wavelet decomposition for inline computation of the features (such as orientation angles) that are used to create the topographic feature maps. The bottom-up topographic feature maps and the top-down skin conspicuity map are then combined through a sigmoid function to produce the final saliency map. A prototype of the proposed model was realized through the TMDSDMK642-0E DSP platform as an embedded system allowing real-time operation. For evaluation purposes, in terms of perceived visual quality and video compression improvement, a ROI-based video compression setup was followed. Extended experiments concerning both MPEG-I as well as low bit-rate MPEG-4 video encoding were conducted showing significant improvement in video compression efficiency without perceived deterioration in visual quality. © World Scientific Publishing Company.en
dc.sourceInternational journal of neural systemsen
dc.subjectpsychological modelen
dc.subjectautomated pattern recognitionen
dc.subjectPattern Recognition, Automateden
dc.subjectImage codingen
dc.subjectComputation theoryen
dc.subjectartificial neural networken
dc.subjectNeural Networks (Computer)en
dc.subjectVideo signal processingen
dc.subjectWavelet analysisen
dc.subjectreaction timeen
dc.subjectVideo Recordingen
dc.subjectWavelet decompositionen
dc.subjectImage qualityen
dc.subjectImage compressionen
dc.subjectpattern recognitionen
dc.subjectEmbedded implementationen
dc.subjectModels, Psychologicalen
dc.subjectPattern Recognition, Visualen
dc.subjectPhotic Stimulationen
dc.subjectROI-based video encodingen
dc.subjectVisual attention modelen
dc.titleAn embedded saliency map estimator scheme: Application to video encodingen
dc.description.endingpage304 Σχολή Θετικών και Εφαρμοσμένων Επιστημών / Faculty of Pure and Applied SciencesΤμήμα Πληροφορικής / Department of Computer Science
dc.description.notes<p>Cited By :18</p>en
dc.source.abbreviationInt.J.Neural Syst.en
dc.contributor.orcidPattichis, Constantinos S. [0000-0003-1271-8151]
dc.contributor.orcidTsapatsoulis, Nicolas [0000-0002-6739-8602]

Files in this item


There are no files associated with this item.

This item appears in the following Collection(s)

Show simple item record