skip to main content
10.1145/3089351.3089352acmconferencesArticle/Chapter ViewAbstractPublication PagesmobisysConference Proceedingsconference-collections
research-article
Public Access

Toward Accurate and Efficient Feature Selection for Speaker Recognition on Wearables

Published: 19 June 2017 Publication History
  • Get Citation Alerts
  • Abstract

    Due to the user-interface limitations of wearable devices, voice-based interfaces are becoming more common; speaker recognition may then address the authentication requirements of wearable applications. Wearable devices have small form factor, limited energy budget and limited computational capacity. In this paper, we examine the challenge of computing speaker recognition on small wearable platforms, and specifically, reducing resource use (energy use, response time) by trimming the input through careful feature selections. For our experiments, we analyze four different feature-selection algorithms and three different feature sets for speaker identification and speaker verification. Our results show that Principal Component Analysis (PCA) with frequency-domain features had the highest accuracy, Pearson Correlation (PC) with time-domain features had the lowest energy use, and recursive feature elimination (RFE) with frequency-domain features had the least latency. Our results can guide developers to choose feature sets and configurations for speaker-authentication algorithms on wearable platforms.

    References

    [1]
    Sourav Bhattacharya and Nicholas D. Lane. Sparsification and separation of deep learning layers for constrained resource inference on wearables. In Proceedings of the ACM Conference on Embedded Network Sensor Systems (SenSys), pages 176--189. ACM, 2016.
    [2]
    Frédéric Bimbot, Jean-Franc Bonastre, Corinne Fredouille, Guillaume Gravier, Ivan Magrin-Chagnolleau, Sylvain Meignier, Téva Merlin, Javier Ortega-García, Dijana Petrovska-Delacrétaz, and Douglas A. Reynolds. A tutorial on text-independent speaker verification. EURASIP Journal on Advances in Signal Processing, 2004(4):430--451, 2004.
    [3]
    Marko Borazio and Kristof Van Laerhoven. Using time use with mobile sensor data: a road to practical mobile activity recognition? In Proceedings of the International Conference on Mobile and Ubiquitous Multimedia, page 20. ACM, 2013.
    [4]
    Girish Chandrashekar and Ferat Sahin. A survey on feature selection methods. Computers and Electrical Engineering, 40(1):16--28, January 2014.
    [5]
    Cory Cornelius, Zachary Marois, Jacob Sorber, Ron Peterson, Shrirang Mare, and David Kotz. Vocal resonance as a biometric for pervasive wearable devices. Technical Report TR2014--747, Dartmouth Computer Science, February 2014. Online at http://www.cs.dartmouth.edu/reports/TR2014-747.pdf.
    [6]
    Fred Cummins, Marco Grimaldi, Thomas Leonard, and Juraj Simko. The CHAINS corpus: Characterizing individual speakers. In Proceedings of Speech and Computer (SPECOM), volume 6, pages 431--435, 2006. Online at http://chains.ucd.ie/docs/chains_corpus_specom2006.pdf.
    [7]
    A. P. Dempster, N. M. Laird, and D. B. Rubin. Maximum likelihood from incomplete data via the EM algorithm. Journal of the Royal Statistical Society, Series B (Methodological), 39(1):1--38, 1977.
    [8]
    Hua Huang and Shan Lin. Toothbrushing monitoring using wrist watch. In Proceedings of the ACM Conference on Embedded Network Sensor Systems (SenSys), pages 202--215. ACM, November 2016.
    [9]
    Tomi Kinnunen and Haizhou Li. An overview of text-independent speaker recognition: From features to supervectors. Speech Communication, 52(1):12--40, January 2010.
    [10]
    Ron Kohavi and George H. John. Wrappers for feature subset selection. Artificial Intelligence, 97(1-2):273--324, December 1997.
    [11]
    Hong Lu, A. J. Bernheim Brush, Bodhi Priyantha, Amy K. Karlson, and Jie Liu. Speakersense: Energy efficient unobtrusive speaker identification on mobile phones. In Kent Lyons, Jeffrey Hightower, and Elaine M. Huang, editors, Proceedings of the International Conference on Pervasive Computing, volume 6696, pages 188--205. Springer, June 2011.
    [12]
    Daniele Ravi, Charence Wong, Benny Lo, and Guang-Zhong Yang. A deep learning approach to on-node sensor data analytics for mobile or wearable devices. IEEE Journal of Biomedical and Health Informatics, 21(1):56--64, January 2017.
    [13]
    Reza Rawassizadeh, Chelsea Dobbins, Manouchehr Nourizadeh, Zahra Ghamchili, and Michael Pazzani. A natural language query interface for searching personal information on smartwatches. In IEEE International Conference on Pervasive Computing, WristSense workshop (Percom '17), 2017. Online at https://arxiv.org/pdf/1611.07139.
    [14]
    Reza Rawassizadeh, Elaheh Momeni, Chelsea Dobbins, Joobin Gharibshah, and Michael Pazzani. Scalable daily human behavioral pattern mining from multivariate temporal data. IEEE Transactions on Knowledge and Data Engineering, 28(11):3098--3112, November 2016.
    [15]
    Reza Rawassizadeh, Blaine A. Price, and Marian Petre. Wearables: Has the age of smartwatches finally arrived? Communications of the ACM, 58(1):45--47, December 2015.
    [16]
    Reza Rawassizadeh, Martin Tomitsch, Manouchehr Nourizadeh, Elaheh Momeni, Aaron Peery, Liudmila Ulanova, and Michael Pazzani. Energy-efficient integration of continuous context sensing and prediction into smartwatches. Sensors, 15(9):22616--22645, September 2015.
    [17]
    Douglas A. Reynolds. Speaker identification and verification using Gaussian mixture speaker models. Speech Communication, 17(1-2):91--108, August 1995.
    [18]
    Douglas A. Reynolds, Thomas F. Quatieri, and Robert B. Dunn. Speaker verification using adapted Gaussian mixture models. Digital Signal Processing, 10(1-3):19--41, January 2000.
    [19]
    Douglas A. Reynolds and Richard C. Rose. Robust text-independent speaker identification using gaussian mixture speaker models. IEEE Transactions on Speech and Audio Processing, 3(1):72--83, January 1995.
    [20]
    E. Variani, X. Lei, E. McDermott, I. L. Moreno, and J. Gonzalez-Dominguez. Deep neural networks for small footprint text-dependent speaker verification. In IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pages 4052--4056, 2014.
    [21]
    Xiaojia Zhao, Yuxuan Wang, and DeLiang Wang. Robust Speaker Identification in Noisy and Reverberant Conditions. IEEE/ACM Transactions on Audio, Speech and Language Processing (TASLP), 22(4):836--845, 2014.

    Cited By

    View all
    • (2024)PA2BLO: Low-Power, Personalized Audio Badge2024 IEEE International Conference on Pervasive Computing and Communications (PerCom)10.1109/PerCom59722.2024.10494427(154-163)Online publication date: 11-Mar-2024
    • (2023)Lightweight and Non-Invasive User Authentication on EarablesProceedings of the 24th International Workshop on Mobile Computing Systems and Applications10.1145/3572864.3580332(36-41)Online publication date: 22-Feb-2023
    • (2023)ODSearchProceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies10.1145/35694886:4(1-25)Online publication date: 11-Jan-2023
    • Show More Cited By

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    WearSys '17: Proceedings of the 2017 Workshop on Wearable Systems and Applications
    June 2017
    60 pages
    ISBN:9781450349598
    DOI:10.1145/3089351
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

    Sponsors

    In-Cooperation

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 19 June 2017

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. audio signal processing
    2. feature selection

    Qualifiers

    • Research-article

    Funding Sources

    Conference

    MobiSys'17
    Sponsor:

    Acceptance Rates

    WearSys '17 Paper Acceptance Rate 9 of 9 submissions, 100%;
    Overall Acceptance Rate 28 of 36 submissions, 78%

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)23
    • Downloads (Last 6 weeks)4

    Other Metrics

    Citations

    Cited By

    View all
    • (2024)PA2BLO: Low-Power, Personalized Audio Badge2024 IEEE International Conference on Pervasive Computing and Communications (PerCom)10.1109/PerCom59722.2024.10494427(154-163)Online publication date: 11-Mar-2024
    • (2023)Lightweight and Non-Invasive User Authentication on EarablesProceedings of the 24th International Workshop on Mobile Computing Systems and Applications10.1145/3572864.3580332(36-41)Online publication date: 22-Feb-2023
    • (2023)ODSearchProceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies10.1145/35694886:4(1-25)Online publication date: 11-Jan-2023
    • (2022)Using a small amount of text-independent speech data for a BiLSTM large-scale speaker identification approachJournal of King Saud University - Computer and Information Sciences10.1016/j.jksuci.2020.03.01134:3(764-770)Online publication date: Mar-2022
    • (2018)Vocal ResonanceProceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies10.1145/31917512:1(1-23)Online publication date: 26-Mar-2018

    View Options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Get Access

    Login options

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media