A. Bedagkar-gala and S. K. Shah, A survey of approaches and trends in person re-identification, Image and Vision Computing, vol.32, issue.4, pp.270-286, 2014.
DOI : 10.1016/j.imavis.2014.02.001

A. Bhattacharyya, On a measure of divergence between two multinomial populations, Sankhya: The Indian Journal of Statistics, vol.7, issue.4, pp.401-406, 1933.

J. Bonastre, F. Wils, and S. Meignier, ALIZE, a free toolkit for speaker recognition, Proceedings. (ICASSP '05). IEEE International Conference on Acoustics, Speech, and Signal Processing, 2005., pp.737-740, 2005.
DOI : 10.1109/ICASSP.2005.1415219

URL : https://hal.archives-ouvertes.fr/hal-01434280

C. Busso, S. Hernanz, C. Chu, S. Il-kwon, S. Lee et al., Smart Room: Participant and Speaker Localization and Identification, Proceedings. (ICASSP '05). IEEE International Conference on Acoustics, Speech, and Signal Processing, 2005., p.1117, 1120.
DOI : 10.1109/ICASSP.2005.1415605

URL : http://iris.usc.edu/~icohen/./pdf/icassp05.pdf

N. Dalal and B. Triggs, Histograms of Oriented Gradients for Human Detection, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05), pp.886-893, 2005.
DOI : 10.1109/CVPR.2005.177

URL : https://hal.archives-ouvertes.fr/inria-00548512

A. P. Dempster, N. M. Laird, and D. B. Rubin, Maximum likelihood from incomplete data via the em algorithm, Journal of the Royal Statistical Society, Series B, vol.39, issue.1, pp.1-38, 1977.

T. Falk, C. Zheng, and W. Chan, A nonintrusive quality and intelligibility measure of reverberant and dereverberated speech. Audio, Speech, and Language Processing, IEEE Transactions on, vol.18, issue.7, pp.1766-1774, 2010.
DOI : 10.1109/tasl.2010.2052247

M. Farenzena, L. Bazzani, A. Perina, V. Murino, and M. Cristani, Person re-identification by symmetry-driven accumulation of local features, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp.2360-2367, 2010.
DOI : 10.1109/CVPR.2010.5539926

P. Forssen, Maximally Stable Colour Regions for Recognition and Matching, 2007 IEEE Conference on Computer Vision and Pattern Recognition, pp.1-8, 2007.
DOI : 10.1109/CVPR.2007.383120

URL : http://www.cs.ubc.ca/~perfo/papers/forssen_cvpr07.pdf

T. Germa, F. Lerasle, N. Ouadah, and V. Cadenat, Vision and RFID data fusion for tracking people in crowds by a mobile robot, Computer Vision and Image Understanding, vol.114, issue.6, pp.641-651, 2010.
DOI : 10.1016/j.cviu.2010.01.008

S. Graf, T. Herbig, M. Buck, and G. Schmidt, Features for voice activity detection: a comparative analysis, EURASIP Journal on Advances in Signal Processing, vol.15, issue.10, pp.1-15, 2015.
DOI : 10.1109/LSP.2008.917027

URL : https://asp-eurasipjournals.springeropen.com/track/pdf/10.1186/s13634-015-0277-z?site=asp-eurasipjournals.springeropen.com

A. J. Kolarik, B. C. Moore, P. Zahorik, S. Cirstea, and S. Pardhan, Auditory distance perception in humans: a review of cues, development, neuronal bases, and effects of sensory loss, Attention, Perception, & Psychophysics, vol.21, issue.7, pp.1-23, 2015.
DOI : 10.1038/82931

L. F. Lamel, J. Luc-gauvain, M. Eskenazi, and M. E. , Limsi-cnrs. Bref, a large vocabulary spoken corpus for french, pp.505-508
DOI : 10.1016/s0167-6393(99)00067-9

E. Larsen, N. Iyer, C. R. Lansing, and A. S. Feng, On the minimum audible difference in direct-to-reverberant energy ratio, The Journal of the Acoustical Society of America, vol.124, issue.1, pp.450-461, 2008.
DOI : 10.1121/1.2936368

URL : http://europepmc.org/articles/pmc2677334?pdf=render

R. Mazzon, S. F. Tahir, and A. Cavallaro, Person re-identification in crowd, Pattern Recognition Letters, vol.33, issue.14, pp.1828-1837, 2012.
DOI : 10.1016/j.patrec.2012.02.014

URL : http://www.eecs.qmul.ac.uk/~andrea/papers/2012_PRL_ReidentificationCrowd_Mazzon_Tahir_Cavallaro.pdf

D. H. Mershon and L. E. King, Intensity and reverberation as factors in the auditory perception of egocentric distance, Perception & Psychophysics, vol.62, issue.6, pp.409-415
DOI : 10.2307/1418558

URL : https://link.springer.com/content/pdf/10.3758%2FBF03204113.pdf

A. Mogelmose, C. Bahnsen, T. Moeslund, A. Clapes, and S. Escalera, Tri-modal Person Re-identification with RGB, Depth and Thermal Features, 2013 IEEE Conference on Computer Vision and Pattern Recognition Workshops, pp.301-307, 2013.
DOI : 10.1109/CVPRW.2013.52

URL : http://vbn.aau.dk/files/210202523/triModalPersonReId.pdf

D. A. Reynolds, Speaker identification and verification using Gaussian mixture speaker models, Speech Communication, vol.17, issue.1-2, pp.91-108, 1995.
DOI : 10.1016/0167-6393(95)00009-D

D. A. Reynolds, T. F. Quatieri, and R. B. Dunn, Speaker Verification Using Adapted Gaussian Mixture Models, Digital Signal Processing, vol.10, issue.1-3, pp.19-41, 2000.
DOI : 10.1006/dspr.1999.0361

URL : http://www.cse.ohio-state.edu/~dwang/teaching/cse788/papers/Reynolds-dsp00.pdf

J. F. Santos, M. Senoussaoui, and T. H. Falk, An updated objective intelligibility estimation metric for normal hearing listeners under noise and reverberation, International Workshop on Acoustic Signal Enhancement (IWAENC), 2014.

R. Satta, Appearance descriptors for person reidentification: a comprehensive review, 2013.

A. Saxena and A. Ng, Learning sound location from a single microphone, 2009 IEEE International Conference on Robotics and Automation, pp.1737-1742, 2009.
DOI : 10.1109/ROBOT.2009.5152861

E. Scheirer and M. Slaney, Construction and evaluation of a robust multifeature speech/music discriminator ICASSP-97, Acoustics, Speech, and Signal Processing IEEE International Conference on, pp.1331-1334, 1997.
DOI : 10.1109/icassp.1997.596192

URL : http://rvl4.ecn.purdue.edu/~malcolm/interval/1996-085/SpeechMusicICASSP97.ps

W. Schwartz and L. Davis, Learning Discriminative Appearance-Based Models Using Partial Least Squares, 2009 XXII Brazilian Symposium on Computer Graphics and Image Processing, 2009.
DOI : 10.1109/SIBGRAPI.2009.42

URL : http://www.umiacs.umd.edu/users/lsd/papers/paperSibgrapi09.pdf

A. Tawari and M. Trivedi, Speech based emotion classification framework for driver assistance system, 2010 IEEE Intelligent Vehicles Symposium, pp.174-178, 2010.
DOI : 10.1109/IVS.2010.5547956

URL : http://cvrr.ucsd.edu/publications/2010/IV10_ATawari.pdf

Z. Zivkovic, Improved adaptive Gaussian mixture model for background subtraction, Proceedings of the 17th International Conference on Pattern Recognition, 2004. ICPR 2004., pp.28-31, 2004.
DOI : 10.1109/ICPR.2004.1333992

URL : http://carol.science.uva.nl/~zivkovic/./Publications/zivkovic2004ICPR.pdf