Publications
CMU Multi-Modal Activity Database ,
, 2009.
"User Verification by Combining Speech and Face Biometrics in Video",
International Symposium on Visual Computing 2008, 2008, vol. 5359, Las Vegas, NV, Springer, pp. 482-492, 12/2008.
"Structure Inference for Bayesian Multisensory Scene Understanding",
IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 30, no. 12, pp. 2140-2157, 12/2008.
"Exploring Co-Occurence Between Speech and Body Movement for Audio-Guided Video Localization",
IEEE Transactions on Circuits and Systems for Video Technology, vol. 18, no. 11, pp. 1608-1617, 11/2008.
"Robust face-voice based speaker identity verification using multilevel fusion",
Image and Vision Computing, vol. 26, no. 9, pp. 1249-1260, 09/2008.
"Audiovisual integration with Segment Models for tennis video parsing",
Computer Vision and Image Understanding, vol. 111, no. 2, pp. 142-154, 08/2008.
"Speaker detection using the timing structure of lip motion and sound",
First IEEE Workshop on CVPR for Human Communicative Behavior Analysis, Anchorage, AK, pp. 18, 06/2008.
"Multimodal person authentication using speech, face and visual speech",
Computer Vision and Image Understanding, vol. 109, no. 1, pp. 44-55, 01/2008.
"Feature space video stream consistency estimation for dynamic stream weighting in audio-visual speech recognition",
IEEE International Conference on Image Processing 2008, San Diego, CA, pp. 13161319, 2008.
"Video Augmentation for Improving Audio Speech Recognition under Noise",
British Machine Vision Conference 2008, Leeds, UK, 2008.
"Towards Audio-Visual On-line Diarization Of Participants In Group Meetings",
ECCV Workshop on Multi-camera and Multi-modal Sensor Fusion Algorithms and Applications, Marseille, France, 2008.
"Finding Speaker Face Region by Audiovisual Correlation",
ECCV Workshop on Multi-camera and Multi-modal Sensor Fusion Algorithms and Applications, Marseille, France, 2008.
"A Comparative Error Analysis of Audio-Visual Source Localization",
ECCV Workshop on Multi-camera and Multi-modal Sensor Fusion Algorithms and Applications, Marseille, France, 2008.
"Quality-Based Score Normalization for Audiovisual Person Authentication",
International Conference on Image Analysis and Recognition 2008, Portugal, 2008.
"Audio Visual Speaker Verification Based on Hybrid Fusion of Cross Modal Features",
International Conference on Pattern Recognition and Machine Intelligence, PReMI 2007, 2007, vol. 4815, Kolkata, India, Springer, pp. 469-478, 12/2007.

