Audio Visual Speech Recognition, Region of Interest, Colour Based Segmen- tation, CUAVE database.
Issue Date:
2009
Citation:
Craig Berry, Naomi Harte, Region of Interest Extraction using Colour Based Methods on the CUAVE Database, IET Irish Signals and Systems Conference ISSC, Dublin, 10-12 June, 2009
Abstract:
Region of interest (ROI) extraction is an important step in deriving vi-
sual features for an audio-visual speech recognition system. Colour based segmentation
oers the potential of computationally inexpensive algorithms for ROI selection. This
paper presents a comparative study of two colour based techniques, one using hue and
accumulated dierence, the other chrominance. Results are presented for the CUAVE
database. The two methods achieved 69% and 72% correct ROI extraction. The exper-
iment prompted investigation of a new method using a chrominance based accumulated
dierence image. The new method achieved 79% correct ROI identication. The overall
results suggest that a dual approach using chrominance to locate the mouth region and
only employing an accumulated dierence image when signicant motion is not present
would oer good robustness with lower computational cost.
Please note: There is a known bug in some browsers that causes an
error when a user tries to view large pdf file within the browser window.
If you receive the message "The file is damaged and could not be
repaired", please try one of the solutions linked below based on the
browser you are using.
Items in TARA are protected by copyright, with all rights reserved, unless otherwise indicated.