Image and Video Technologies

Home
Full List of Titles
1: Speech Processing
CELP Coding
Large Vocabulary Recognition
Speech Analysis and Enhancement
Acoustic Modeling I
ASR Systems and Applications
Topics in Speech Coding
Speech Analysis
Low Bit Rate Speech Coding I
Robust Speech Recognition in Noisy Environments
Speaker Recognition
Acoustic Modeling II
Speech Production and Synthesis
Feature Extraction
Robust Speech Recognition and Adaptation
Low Bit Rate Speech Coding II
Speech Understanding
Language Modeling I
2: Speech Processing, Audio and Electroacoustics, and Neural Networks
Acoustic Modeling III
Lexical Issues/Search
Speech Understanding and Systems
Speech Analysis and Quantization
Utterance Verification/Acoustic Modeling
Language Modeling II
Adaptation /Normalization
Speech Enhancement
Topics in Speaker and Language Recognition
Echo Cancellation and Noise Control
Coding
Auditory Modeling, Hearing Aids and Applications of Signal Processing to Audio and Acoustics
Spatial Audio
Music Applications
Application - Pattern Recognition & Speech Processing
Theory & Neural Architecture
Signal Separation
Application - Image & Nonlinear Signal Processing
3: Signal Processing Theory & Methods I
Filter Design and Structures
Detection
Wavelets
Adaptive Filtering: Applications and Implementation
Nonlinear Signals and Systems
Time/Frequency and Time/Scale Analysis
Signal Modeling and Representation
Filterbank and Wavelet Applications
Source and Signal Separation
Filterbanks
Emerging Applications and Fast Algorithms
Frequency and Phase Estimation
Spectral Analysis and Higher Order Statistics
Signal Reconstruction
Adaptive Filter Analysis
Transforms and Statistical Estimation
Markov and Bayesian Estimation and Classification
4: Signal Processing Theory & Methods II, Design and Implementation of Signal Processing Systems, Special Sessions, and Industry Technology Tracks
System Identification, Equalization, and Noise Suppression
Parameter Estimation
Adaptive Filters: Algorithms and Performance
DSP Development Tools
VLSI Building Blocks
DSP Architectures
DSP System Design
Education
Recent Advances in Sampling Theory and Applications
Steganography: Information Embedding, Digital Watermarking, and Data Hiding
Speech Under Stress
Physics-Based Signal Processing
DSP Chips, Architectures and Implementations
DSP Tools and Rapid Prototyping
Communication Technologies
Image and Video Technologies
Automotive Applications / Industrial Signal Processing
Speech and Audio Technologies
Defense and Security Applications
Biomedical Applications
Voice and Media Processing
Adaptive Interference Cancellation
5: Communications, Sensor Array and Multichannel
Source Coding and Compression
Compression and Modulation
Channel Estimation and Equalization
Blind Multiuser Communications
Signal Processing for Communications I
CDMA and Space-Time Processing
Time-Varying Channels and Self-Recovering Receivers
Signal Processing for Communications II
Blind CDMA and Multi-Channel Equalization
Multicarrier Communications
Detection, Classification, Localization, and Tracking
Radar and Sonar Signal Processing
Array Processing: Direction Finding
Array Processing Applications I
Blind Identification, Separation, and Equalization
Antenna Arrays for Communications
Array Processing Applications II
6: Multimedia Signal Processing, Image and Multidimensional Signal Processing, Digital Signal Processing Education
Multimedia Analysis and Retrieval
Audio and Video Processing for Multimedia Applications
Advanced Techniques in Multimedia
Video Compression and Processing
Image Coding
Transform Techniques
Restoration and Estimation
Image Analysis
Object Identification and Tracking
Motion Estimation
Medical Imaging
Image and Multidimensional Signal Processing Applications I
Segmentation
Image and Multidimensional Signal Processing Applications II
Facial Recognition and Analysis
Digital Signal Processing Education

Author Index
A B C D E F G H I
J K L M N O P Q R
S T U V W X Y Z

Architectures For Generalized 2D FIR Filtering Using Separable Filter Structures

Authors:

Michael S Andrews,

Page (NA) Paper number 2482

Abstract:

The problem of generalized 2D FIR filtering for large filter kernel sizes can be computationally prohibitive when required in real-time, such as with video applications. In this paper, we describe architectures and design methods for generalized 2D FIR filtering employing LOGIC Devices' LF33xx (HV Filter) family of devices. The LF33xx family of devices is designed to perform dimensionally separate filtering (row/column) along orthogonal axes (horizontal direction and vertical direction for video applications). Additionally, we will briefly review 2D filter design and kernel separability theory.

IC992482.PDF (From Author) IC992482.PDF (Rasterized)

TOP


Segmentation of Non-Rigid Bodies in Affine Motion : A New Framework

Authors:

Sangeeta Narang, IBM - Solutions Research Center, New Delhi, INDIA (India)
Naresh Kumar Narang, All India Radio, New Delhi (India)
Kanad K Biswas, Indian Institute of Technology, New Delhi, INDIA (India)

Page (NA) Paper number 1079

Abstract:

In this paper a new technique for partitioning a non-rigid body (human body), when it is in affine motion is presented. The technique is based on finding the contour in the "modified difference frame". These contours can then be used in identifying the regions of interest. Results show that the technique eliminates a lot of stationary regions and thus can reduce the amount of computational time required for the processing of the image. It reduces significantly the total bit rate required for transmission. It also provides an algorithm for contour tracing using extended boundary concept and contour compression by contour merging.

IC991079.PDF (From Author) IC991079.PDF (Rasterized)

TOP


Wireless MPEG-4 Video on Texas Instruments DSP Chips

Authors:

Madhukar Budagavi,
Wendi Rabiner,
Jennifer Webb,
Raj Talluri,

Page (NA) Paper number 1704

Abstract:

Technology has advanced in recent years to the point where multimedia communicators are beginning to emerge. These communicators are low-power, portable devices that can transmit and receive multimedia data through the wireless network. Due to the high computational complexity involved and the low-power constraint in wireless applications, these devices require the use of processors that are powerful and are at the same time very power-efficient. In order to facilitate interoperability, it is important that these devices use standardized compression and communication algorithms. As a first step in implementing multimedia terminals, Texas Instruments (TI) has demonstrated real-time MPEG-4 video decoding (simple profile) on a TMS320C54x, TI's low power, high performance DSP chip. In addition, TI has outlined a system-level solution to transmitting video across wireless networks, including channel coding and communication protocols.

IC991704.PDF (From Author) IC991704.PDF (Rasterized)

TOP


Joint MPEG-2 Coding For Multi-Program Broadcasting Of Per-Recorded Video

Authors:

Irene Koo,
Panos Nasiopoulos,
Rabab K Ward,

Page (NA) Paper number 2365

Abstract:

We developed a cost-effective operational system suitable for digital TV, video on demand, and high definition TV broadcast over satellite networks with limited bandwidth. This MPEG-2 based system is easy to implement and allows the joint video coding of multiple video programs. Compared to present broadcast operation and for the same level of picture quality, our system greatly increases the number of video streams transmitted in each channel. As a result, either a large number of transponders can be freed up to carry real-time broadcasting or the level of the transmitted picture quality can be significantly increased. By switching from tape storage to video server technology, the need for numerous (expensive) playback VTR systems at the headend is eliminated. In addition, the majority of the complete MPEG-2 encoders are replaced by much less complex MPEG-2 transcoders. All this means significant savings for the broadcast stations. In addition to the gain in bandwidth and the reduction in cost, our system speeds up the encoding process by six folds.

IC992365.PDF (From Author) IC992365.PDF (Rasterized)

TOP


FPGA Implementation of Adaptive Temporal Kalman Filter for Real Time Video Filtering

Authors:

Robert D Turney,
Ali M Reza, Department of Electrical Engineering and Computer Science, University of Wisconsin-Milwaukee (U.K.)
Justin G.R. Delva, Department of Electrical Engineering and Computer Science, University of Wisconsin-Milwaukee (U.K.)

Page (NA) Paper number 2103

Abstract:

Filtering noise in real-time image sequences is required in some applications like medical imaging. The optimum approach in this case is in the form of adaptive 3-D spatial- temporal filter, which is generally very complex and prohibitive for real-time implementation. Independent processing of the image sequences, in spatial and temporal domains can resolve some of these implementation difficulties. Some of the existing spatial filters can easily be modified for real-time implementation. Adaptive temporal filters, however, are more involved. In this paper, an adaptive temporal filter is proposed that lend itself to hardware implementation for real-time temporal processing of image sequences. The proposed algorithm is based on adaptive Kalman filtering which is relatively simple and effective in its performance. Adaptation in this case is with respect to motion in the image sequence as well as variation of noise statistics. An efficient hardware implementation of this algorithm, based on FPGA technology, is proposed.

IC992103.PDF (From Author) IC992103.PDF (Rasterized)

TOP


Programmable DSP Platform for Digital Still Cameras

Authors:

Klaus Illgner,
Hans-Georg Gruber,
Pedro R Gelabert,
Jie Liang,
Youngjun Yoo,
Wissam Rabadi,
Raj Talluri,

Page (NA) Paper number 2112

Abstract:

This paper presents a programmable DSP platform for Digital Still Cameras based on the Texas Instruments TMS320C54x family. One major advantage of this platform is that, after capturing an image from a CCD sensor, processing the raw image, and compressing the image for storage is performed on the Digital Signal Processor (DSP). This provides a short shot-to-shot delay and a high degree of flexibility. The system realized also allows instant viewing and selective storing of captured images. This paper outlines the various processing stages necessary to take the raw CCD data and produce a JPEG compressed bit stream and highlighting the advantages of DSPs for this application. The programmable nature of this platform allows for the exploration of different image processing and compression techniques. The low power nature of the digital signal processor provides long battery life.

IC992112.PDF (From Author) IC992112.PDF (Rasterized)

TOP


A Robust Sequential Approach for the Detection of Defective Pixels in an Image Sensor

Authors:

Yap-Peng Tan,
Tinku Acharya,

Page (NA) Paper number 3013

Abstract:

Large image sensors usually contain some defects. Defects are pixels with abnormal photo-responsibility. As a result they often generate outputs different from their adjacent pixel outputs and seriously degrade the visual quality of the captured images. However, it is not economically feasible to produce sensors with no defects for rendering images. A limited number of defects are usually allowed in an image sensor as long as the defective outputs can be corrected with post signal processing techniques. In this paper we present a robust sequential approach for detecting sensor defects from a sequence of images captured by the sensor. With this approach no extra non-volatile memory is required in the sensor device to store the locations of sensor defects. In addition, the detection and correction of image defective outputs can be performed efficiently in a computer host. Experimental results of this approach are reported in the paper.

IC993013.PDF (From Author) IC993013.PDF (Rasterized)

TOP


Adaptive Coding of Hyperspectral Imagery

Authors:

Glen P Abousleman,

Page (NA) Paper number 3021

Abstract:

Two systems are presented for compression of hyperspectral imagery. These systems utilize adaptive classification, trellis-coded quantization, and optimal rate allocation. In the first system, DPCM is used for spectral decorrelation, while an adaptive wavelet-based coding scheme is used for spatial decorrelation. The second system uses DPCM in conjunction with an adaptive DCT-based coding scheme. In each system, entropy-constrained trellis-coded quantization (ECTCQ) is used to quantize the transform coefficients. Entropy-constrained codebooks are designed for generalized Gaussian distributions by using a modified version of the generalized Lloyd algorithm. The wavelet-based system compresses an AVIRIS hyperspectral test sequence at 0.118 bits/pixel/band, while retaining an average peak signal-to-noise ratio (PSNR) of 41.24 dB. The DCT-based system achieves the same bit rate with an average PSNR of 40.72 dB.

IC993021.PDF (From Author) IC993021.PDF (Rasterized)

TOP


Temporally Scalable Motion Compensated Adaptive Temporal Subband Coding of Video

Authors:

Lowell L Winger,
Anastasios N Venetsanopoulos,

Page (NA) Paper number 1014

Abstract:

It has been asserted that temporal subband coding (TSB) is inferior to predictive coding for regionally motion compensated (e.g. block-based MC) temporally scalable compressed video (citation deleted). There are two major disadvantages of TSB coding: temporal filtering distortions, and `open-loop' predictive coding of covered and uncovered regions. The `open-loop' structure of TSB coding, however, affords two major advantages not enjoyed by MCP coding: simple optimal bit-allocation, non-existence of quantization error feedback. A new adaptive temporal subband (TSB) motion compensated predictive (MCP) coder is proposed. Hierarchical variable-sized block-matched regions with low predictive error are TSB coded, while poorly predicted regions are `open-loop' MCP coded. Simulation results demonstrate that the adaptive coder substantially improves the temporal scalability of TSB coding, retains an advantageous `open-loop' structure and provides comparable or superior PSNR to both MCP and TSB coding at MPEG-1 quality bitrates.

IC991014.PDF (From Author) IC991014.PDF (Rasterized)

TOP