ICASSP99 Image and Video Technologies

Image and Video Technologies
Home Full List of Titles 1: Speech Processing CELP Coding Large Vocabulary Recognition Speech Analysis and Enhancement Acoustic Modeling I ASR Systems and Applications Topics in Speech Coding Speech Analysis Low Bit Rate Speech Coding I Robust Speech Recognition in Noisy Environments Speaker Recognition Acoustic Modeling II Speech Production and Synthesis Feature Extraction Robust Speech Recognition and Adaptation Low Bit Rate Speech Coding II Speech Understanding Language Modeling I 2: Speech Processing, Audio and Electroacoustics, and Neural Networks Acoustic Modeling III Lexical Issues/Search Speech Understanding and Systems Speech Analysis and Quantization Utterance Verification/Acoustic Modeling Language Modeling II Adaptation /Normalization Speech Enhancement Topics in Speaker and Language Recognition Echo Cancellation and Noise Control Coding Auditory Modeling, Hearing Aids and Applications of Signal Processing to Audio and Acoustics Spatial Audio Music Applications Application - Pattern Recognition & Speech Processing Theory & Neural Architecture Signal Separation Application - Image & Nonlinear Signal Processing 3: Signal Processing Theory & Methods I Filter Design and Structures Detection Wavelets Adaptive Filtering: Applications and Implementation Nonlinear Signals and Systems Time/Frequency and Time/Scale Analysis Signal Modeling and Representation Filterbank and Wavelet Applications Source and Signal Separation Filterbanks Emerging Applications and Fast Algorithms Frequency and Phase Estimation Spectral Analysis and Higher Order Statistics Signal Reconstruction Adaptive Filter Analysis Transforms and Statistical Estimation Markov and Bayesian Estimation and Classification 4: Signal Processing Theory & Methods II, Design and Implementation of Signal Processing Systems, Special Sessions, and Industry Technology Tracks System Identification, Equalization, and Noise Suppression Parameter Estimation Adaptive Filters: Algorithms and Performance DSP Development Tools VLSI Building Blocks DSP Architectures DSP System Design Education Recent Advances in Sampling Theory and Applications Steganography: Information Embedding, Digital Watermarking, and Data Hiding Speech Under Stress Physics-Based Signal Processing DSP Chips, Architectures and Implementations DSP Tools and Rapid Prototyping Communication Technologies Image and Video Technologies Automotive Applications / Industrial Signal Processing Speech and Audio Technologies Defense and Security Applications Biomedical Applications Voice and Media Processing Adaptive Interference Cancellation 5: Communications, Sensor Array and Multichannel Source Coding and Compression Compression and Modulation Channel Estimation and Equalization Blind Multiuser Communications Signal Processing for Communications I CDMA and Space-Time Processing Time-Varying Channels and Self-Recovering Receivers Signal Processing for Communications II Blind CDMA and Multi-Channel Equalization Multicarrier Communications Detection, Classification, Localization, and Tracking Radar and Sonar Signal Processing Array Processing: Direction Finding Array Processing Applications I Blind Identification, Separation, and Equalization Antenna Arrays for Communications Array Processing Applications II 6: Multimedia Signal Processing, Image and Multidimensional Signal Processing, Digital Signal Processing Education Multimedia Analysis and Retrieval Audio and Video Processing for Multimedia Applications Advanced Techniques in Multimedia Video Compression and Processing Image Coding Transform Techniques Restoration and Estimation Image Analysis Object Identification and Tracking Motion Estimation Medical Imaging Image and Multidimensional Signal Processing Applications I Segmentation Image and Multidimensional Signal Processing Applications II Facial Recognition and Analysis Digital Signal Processing Education Author Index A B C D E F G H I J K L M N O P Q R S T U V W X Y Z	Architectures For Generalized 2D FIR Filtering Using Separable Filter Structures Authors: Michael S Andrews, Page (NA) Paper number 2482 Abstract: The problem of generalized 2D FIR filtering for large filter kernel sizes can be computationally prohibitive when required in real-time, such as with video applications. In this paper, we describe architectures and design methods for generalized 2D FIR filtering employing LOGIC Devices' LF33xx (HV Filter) family of devices. The LF33xx family of devices is designed to perform dimensionally separate filtering (row/column) along orthogonal axes (horizontal direction and vertical direction for video applications). Additionally, we will briefly review 2D filter design and kernel separability theory. IC992482.PDF (From Author) IC992482.PDF (Rasterized) TOP Segmentation of Non-Rigid Bodies in Affine Motion : A New Framework Authors: Sangeeta Narang, IBM - Solutions Research Center, New Delhi, INDIA (India) Naresh Kumar Narang, All India Radio, New Delhi (India) Kanad K Biswas, Indian Institute of Technology, New Delhi, INDIA (India) Page (NA) Paper number 1079 Abstract: In this paper a new technique for partitioning a non-rigid body (human body), when it is in affine motion is presented. The technique is based on finding the contour in the "modified difference frame". These contours can then be used in identifying the regions of interest. Results show that the technique eliminates a lot of stationary regions and thus can reduce the amount of computational time required for the processing of the image. It reduces significantly the total bit rate required for transmission. It also provides an algorithm for contour tracing using extended boundary concept and contour compression by contour merging. IC991079.PDF (From Author) IC991079.PDF (Rasterized) TOP Wireless MPEG-4 Video on Texas Instruments DSP Chips Authors: Madhukar Budagavi, Wendi Rabiner, Jennifer Webb, Raj Talluri, Page (NA) Paper number 1704 Abstract: Technology has advanced in recent years to the point where multimedia communicators are beginning to emerge. These communicators are low-power, portable devices that can transmit and receive multimedia data through the wireless network. Due to the high computational complexity involved and the low-power constraint in wireless applications, these devices require the use of processors that are powerful and are at the same time very power-efficient. In order to facilitate interoperability, it is important that these devices use standardized compression and communication algorithms. As a first step in implementing multimedia terminals, Texas Instruments (TI) has demonstrated real-time MPEG-4 video decoding (simple profile) on a TMS320C54x, TI's low power, high performance DSP chip. In addition, TI has outlined a system-level solution to transmitting video across wireless networks, including channel coding and communication protocols. IC991704.PDF (From Author) IC991704.PDF (Rasterized) TOP Joint MPEG-2 Coding For Multi-Program Broadcasting Of Per-Recorded Video Authors: Irene Koo, Panos Nasiopoulos, Rabab K Ward, Page (NA) Paper number 2365 Abstract: We developed a cost-effective operational system suitable for digital TV, video on demand, and high definition TV broadcast over satellite networks with limited bandwidth. This MPEG-2 based system is easy to implement and allows the joint video coding of multiple video programs. Compared to present broadcast operation and for the same level of picture quality, our system greatly increases the number of video streams transmitted in each channel. As a result, either a large number of transponders can be freed up to carry real-time broadcasting or the level of the transmitted picture quality can be significantly increased. By switching from tape storage to video server technology, the need for numerous (expensive) playback VTR systems at the headend is eliminated. In addition, the majority of the complete MPEG-2 encoders are replaced by much less complex MPEG-2 transcoders. All this means significant savings for the broadcast stations. In addition to the gain in bandwidth and the reduction in cost, our system speeds up the encoding process by six folds. IC992365.PDF (From Author) IC992365.PDF (Rasterized) TOP FPGA Implementation of Adaptive Temporal Kalman Filter for Real Time Video Filtering Authors: Robert D Turney, Ali M Reza, Department of Electrical Engineering and Computer Science, University of Wisconsin-Milwaukee (U.K.) Justin G.R. Delva, Department of Electrical Engineering and Computer Science, University of Wisconsin-Milwaukee (U.K.) Page (NA) Paper number 2103 Abstract: Filtering noise in real-time image sequences is required in some applications like medical imaging. The optimum approach in this case is in the form of adaptive 3-D spatial- temporal filter, which is generally very complex and prohibitive for real-time implementation. Independent processing of the image sequences, in spatial and temporal domains can resolve some of these implementation difficulties. Some of the existing spatial filters can easily be modified for real-time implementation. Adaptive temporal filters, however, are more involved. In this paper, an adaptive temporal filter is proposed that lend itself to hardware implementation for real-time temporal processing of image sequences. The proposed algorithm is based on adaptive Kalman filtering which is relatively simple and effective in its performance. Adaptation in this case is with respect to motion in the image sequence as well as variation of noise statistics. An efficient hardware implementation of this algorithm, based on FPGA technology, is proposed. IC992103.PDF (From Author) IC992103.PDF (Rasterized) TOP Programmable DSP Platform for Digital Still Cameras Authors: Klaus Illgner, Hans-Georg Gruber, Pedro R Gelabert, Jie Liang, Youngjun Yoo, Wissam Rabadi, Raj Talluri, Page (NA) Paper number 2112 Abstract: This paper presents a programmable DSP platform for Digital Still Cameras based on the Texas Instruments TMS320C54x family. One major advantage of this platform is that, after capturing an image from a CCD sensor, processing the raw image, and compressing the image for storage is performed on the Digital Signal Processor (DSP). This provides a short shot-to-shot delay and a high degree of flexibility. The system realized also allows instant viewing and selective storing of captured images. This paper outlines the various processing stages necessary to take the raw CCD data and produce a JPEG compressed bit stream and highlighting the advantages of DSPs for this application. The programmable nature of this platform allows for the exploration of different image processing and compression techniques. The low power nature of the digital signal processor provides long battery life. IC992112.PDF (From Author) IC992112.PDF (Rasterized) TOP A Robust Sequential Approach for the Detection of Defective Pixels in an Image Sensor Authors: Yap-Peng Tan, Tinku Acharya, Page (NA) Paper number 3013 Abstract: Large image sensors usually contain some defects. Defects are pixels with abnormal photo-responsibility. As a result they often generate outputs different from their adjacent pixel outputs and seriously degrade the visual quality of the captured images. However, it is not economically feasible to produce sensors with no defects for rendering images. A limited number of defects are usually allowed in an image sensor as long as the defective outputs can be corrected with post signal processing techniques. In this paper we present a robust sequential approach for detecting sensor defects from a sequence of images captured by the sensor. With this approach no extra non-volatile memory is required in the sensor device to store the locations of sensor defects. In addition, the detection and correction of image defective outputs can be performed efficiently in a computer host. Experimental results of this approach are reported in the paper. IC993013.PDF (From Author) IC993013.PDF (Rasterized) TOP Adaptive Coding of Hyperspectral Imagery Authors: Glen P Abousleman, Page (NA) Paper number 3021 Abstract: Two systems are presented for compression of hyperspectral imagery. These systems utilize adaptive classification, trellis-coded quantization, and optimal rate allocation. In the first system, DPCM is used for spectral decorrelation, while an adaptive wavelet-based coding scheme is used for spatial decorrelation. The second system uses DPCM in conjunction with an adaptive DCT-based coding scheme. In each system, entropy-constrained trellis-coded quantization (ECTCQ) is used to quantize the transform coefficients. Entropy-constrained codebooks are designed for generalized Gaussian distributions by using a modified version of the generalized Lloyd algorithm. The wavelet-based system compresses an AVIRIS hyperspectral test sequence at 0.118 bits/pixel/band, while retaining an average peak signal-to-noise ratio (PSNR) of 41.24 dB. The DCT-based system achieves the same bit rate with an average PSNR of 40.72 dB. IC993021.PDF (From Author) IC993021.PDF (Rasterized) TOP Temporally Scalable Motion Compensated Adaptive Temporal Subband Coding of Video Authors: Lowell L Winger, Anastasios N Venetsanopoulos, Page (NA) Paper number 1014 Abstract: It has been asserted that temporal subband coding (TSB) is inferior to predictive coding for regionally motion compensated (e.g. block-based MC) temporally scalable compressed video (citation deleted). There are two major disadvantages of TSB coding: temporal filtering distortions, and `open-loop' predictive coding of covered and uncovered regions. The `open-loop' structure of TSB coding, however, affords two major advantages not enjoyed by MCP coding: simple optimal bit-allocation, non-existence of quantization error feedback. A new adaptive temporal subband (TSB) motion compensated predictive (MCP) coder is proposed. Hierarchical variable-sized block-matched regions with low predictive error are TSB coded, while poorly predicted regions are `open-loop' MCP coded. Simulation results demonstrate that the adaptive coder substantially improves the temporal scalability of TSB coding, retains an advantageous `open-loop' structure and provides comparable or superior PSNR to both MCP and TSB coding at MPEG-1 quality bitrates. IC991014.PDF (From Author) IC991014.PDF (Rasterized) TOP