Electroglottographic Measures Based on GCI and GOI Detection Using Multiscale Product

Aicha Bouzid, Noureddine Elouze

Abstract


This paper deals with glottal parameter estimation such as local pitch and open quotient from electroglottographic signal (EGG). This estimation is based on glottal closing instants and glottal opening instants determined by a multi-scale product of this signal. Wavelet transform of EGG signal is made with a quadratic spline function. Wavelet coefficients calculated on different dyadic scales, show modulus maxima at localized discontinuities of EGG signal. The detected maxima and minima correspond to the glottal opening and closing instants called GOIs and GCIs. To improve the estimate precision, we operate the multi-scale product of wavelet transform coefficients of three successive dyadic scales. This processing enhances edge detection. A Multi-scale product is a nonlinear combination of successive scales; it reduces noise and spurious peaks. We apply cubic root amplitude on the product to improve the representation of weak amplitudes. The method has a good representation of GCI and a best detection of GOI. The method was tested on the Keele University database; it is effective and robust in multiple cases even for a typical signal showing undetermined GOIs and multiple peaks at GCIs. Finally precise measurement of these instants allows accurate estimation of prosodic parameters as local pitch and open quotient.

Keywords


wavelet transform, multi-scale product, electroglottographic signal, glottal closing instant, glottal opening instant

Full Text:

PDF

References


D. G. Childers, A. M. Smith and G. P. Moore, Relationships Between Electroglottograph, Speech, and Vocal Cord Contact, Folia Phoniatr., Vol. 36, pp. 105-118, 1984.
http://dx.doi.org/10.1159/000265727

S. Mallat, A Wavelet Tour of Signal Processing, Second Edition, Academic Press, San Diego 1999.

A. Bouzid, and N. Ellouze, Local Regularity Analysis at Glottal Opening and Closing Instants in Electroglottogram Signal Using Wavelet Transform Modulus Maxima, in Proc. Eurospeech 2003, Geneve, pp. 2837-2840, 2003.

N. Henrich, C. d'Alessandro, M. Castellongo, On the Use of the Derivative of Electroglottographic Signals for Characterization of Non-Pathological Phonation, Journal of Acoustical Society of America, Vol. 115, pp. 1321-1332, 2004.
http://dx.doi.org/10.1121/1.1646401

B. M. Sadler, T. Pham, and L. C. Sadler, Optimal and Wavelet Based Shock Wave Detection and Estimation, Journal of Acoustical Society of America, Vol. 104, no. 2, pp. 955-963, 1998.
http://dx.doi.org/10.1121/1.423312

B. M. Sadler, and A. Swami, Analysis of Multiscale Products for Step Detection and Estimation, IEEE Trans. Inform. Theory, Vol. 45, no. 3, pp. 1043-1051, 1999.
http://dx.doi.org/10.1109/18.761341

L. Zhang, and P. Bao, Edge Detection by Scale Multiplication in Wavelet Domain, Pattern Recognition Letters, Vol. 23, no. 14, pp. 1771-1784, 2002.
http://dx.doi.org/10.1016/S0167-8655(02)00151-4

P. Bao, L. Zhang, and X. Wu, Canny Edge Detection Enhancement by Scale Multiplication, IEEE Trans. on Pattern Analysis and Machine Intelligence, Vol. 27, no. 9, pp. 1485-1490, 2005.
http://dx.doi.org/10.1109/TPAMI.2005.173

Y. Xu, J. B. Weaver, D. M. Healy, and J. Lu, Wavelet Transform Domain Filters: A Spatially Selective Noise Filtration Technique, IEEE Trans. Image Processing, Vol. 3, no. 6, pp. 747-758, 1994.
http://dx.doi.org/10.1109/83.336245

M. Rothenberg, and J. J. Mahshie, Monitoring Vocal Fold Abduction through Vocal Fold Contact Area, Journal of Speech and Hearing Research, Vol. 31, pp. 338-351, 1988.
http://dx.doi.org/10.1044/jshr.3103.338

D. M. Howard, Variation of Electrolaryngographically Derived Closed Quotient for Trained and Untrained Adult Female Singers, Journal of Voice, Vol. 9, no. 2, pp. 1212-1223, 1995.
http://dx.doi.org/10.1016/S0892-1997(05)80250-4

D. M. Howard, G. A. Lindsey, and B. Allen, Toward the Quantification of Vocal Efficiency, Journal of Voice, Vol. 4, no. 3, pp. 205-212, 1990.
http://dx.doi.org/10.1016/S0892-1997(05)80015-3

D. G. Childers, and A. K. Krishnamurthy, A Critical Review of Electroglottography, CRC Critical Reviews in Biomedical Engineering, Vol. 12, pp. 131-161, 1985.

D. G. Childers, D. M. Hooks, G. P. Moore, L. Eskenazi, and A. L. Lalwani, Electroglottography and Vocal Fold Physiology, Journal of Speech Hearing Research, Vol. 33, pp. 245-254, 1990.
http://dx.doi.org/10.1044/jshr.3302.245

D. G. Childers, and J. N. Lara, Electroglottography for Laryngeal Function Assessment and Speech Analysis, IEEE Trans. on Biomedical Engineering BME, Vol. 31, No. 12, pp. 807-817, 1985.

S. Anastaplo, and M. P. Karnell, Synchronized Videoscopic and Electroglottographic Examination of Glottal Opening, Journal of Acoustical Society of America, Vol. 83, no. 5, pp. 1883-1890, 1988.
http://dx.doi.org/10.1121/1.396472

M. H. Hess, and M. Ludwigs, Strobophotoglottographic Transillumination as a Method for the Analysis of Vocal Fold Vibration Patterns, Journal of Voice, Vol. 14, no. 2, pp. 255-271, 2000.
http://dx.doi.org/10.1016/S0892-1997(00)80034-X

F. Plante, G. F. Meyer, and W. A. Ainsworth, A Pitch Extraction Reference Database, in . Eurospeech 1995, pp. 837-840, 1995.

A. Bouzid, N. Ellouze, Contribution à la Détection des Instants d'Ouverture et de Fermeture de la Glotte sur les Signaux de Parole Voisé par Transformée en Ondelettes, Thése de doctorat, ENIT, Juillet 2004.

A. Witkin, Scale-Space Filtering, Proc. Int. Joint Conf. Artif. Intell., pp. 1019-1021, 1983.

J. Pérez, and A. Bonafonte, Automatic Voice-Source Parametrization of Natural Speech, in Proc. ICSLP 2005, Lisboa, Portugal, 2005.

A. Rosenfeld, A Non Linear Edge Detection, Proc. IEEE, Vol. 58, pp. 814-816, 1970.
http://dx.doi.org/10.1109/PROC.1970.7756

S. Mallat, and S. Zhong, Characterization of Signals from Multiscale Edges, IEEE Trans. on Pattern Analysis and Machine Intelligence, Vol. 14, no. 7, pp. 710-732, 1992.
http://dx.doi.org/10.1109/34.142909




DOI: https://doi.org/10.15837/ijccc.2008.1.2371



Copyright (c) 2017 Aicha Bouzid, Noureddine Elouze

Creative Commons License
This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.

CC-BY-NC  License for Website User

Articles published in IJCCC user license are protected by copyright.

Users can access, download, copy, translate the IJCCC articles for non-commercial purposes provided that users, but cannot redistribute, display or adapt:

  • Cite the article using an appropriate bibliographic citation: author(s), article title, journal, volume, issue, page numbers, year of publication, DOI, and the link to the definitive published version on IJCCC website;
  • Maintain the integrity of the IJCCC article;
  • Retain the copyright notices and links to these terms and conditions so it is clear to other users what can and what cannot be done with the  article;
  • Ensure that, for any content in the IJCCC article that is identified as belonging to a third party, any re-use complies with the copyright policies of that third party;
  • Any translations must prominently display the statement: "This is an unofficial translation of an article that appeared in IJCCC. Agora University  has not endorsed this translation."

This is a non commercial license where the use of published articles for commercial purposes is forbiden. 

Commercial purposes include: 

  • Copying or downloading IJCCC articles, or linking to such postings, for further redistribution, sale or licensing, for a fee;
  • Copying, downloading or posting by a site or service that incorporates advertising with such content;
  • The inclusion or incorporation of article content in other works or services (other than normal quotations with an appropriate citation) that is then available for sale or licensing, for a fee;
  • Use of IJCCC articles or article content (other than normal quotations with appropriate citation) by for-profit organizations for promotional purposes, whether for a fee or otherwise;
  • Use for the purposes of monetary reward by means of sale, resale, license, loan, transfer or other form of commercial exploitation;

    The licensor cannot revoke these freedoms as long as you follow the license terms.

[End of CC-BY-NC  License for Website User]


INTERNATIONAL JOURNAL OF COMPUTERS COMMUNICATIONS & CONTROL (IJCCC), With Emphasis on the Integration of Three Technologies (C & C & C),  ISSN 1841-9836.

IJCCC was founded in 2006,  at Agora University, by  Ioan DZITAC (Editor-in-Chief),  Florin Gheorghe FILIP (Editor-in-Chief), and  Misu-Jan MANOLESCU (Managing Editor).

Ethics: This journal is a member of, and subscribes to the principles of, the Committee on Publication Ethics (COPE).

Ioan  DZITAC (Editor-in-Chief) at COPE European Seminar, Bruxelles, 2015:

IJCCC is covered/indexed/abstracted in Science Citation Index Expanded (since vol.1(S),  2006); JCR2018: IF=1.585..

IJCCC is indexed in Scopus from 2008 (CiteScore2018 = 1.56):

Nomination by Elsevier for Journal Excellence Award Romania 2015 (SNIP2014 = 1.029): Elsevier/ Scopus

IJCCC was nominated by Elsevier for Journal Excellence Award - "Scopus Awards Romania 2015" (SNIP2014 = 1.029).

IJCCC is in Top 3 of 157 Romanian journals indexed by Scopus (in all fields) and No.1 in Computer Science field by Elsevier/ Scopus.

 

 Impact Factor in JCR2018 (Clarivate Analytics/SCI Expanded/ISI Web of Science): IF=1.585 (Q3). Scopus: CiteScore2018=1.56 (Q2);

SCImago Journal & Country Rank

Editors-in-Chief: Ioan DZITAC & Florin Gheorghe FILIP.