Speech Disorders and Technology Lab: Publications

  • Director: Jun Wang, PhD
  • Locations: CMB1 & HDB5
  • 512-471-6831
  • jun.wang@austin.utexas.edu
Wang Lab home pic

Selected Peer-Reviewed Publications


2020

Dash, D., Ferrari, P., Hernandez, A., Heitzman, D., Austin, S., & Wang, J. (2020). Neural speech decoding for amyotrophic lateral sclerosis, Proc. Interspeech,  October 25-29 (Accepted).

Dinh, T., Kain, A., Samlan, R., Cao, B., & Wang, J. (2020). Increasing the intelligibility and naturalness of alaryngeal speech using voice conversion and synthetic fundamental frequency, Proc. Interspeech, October 25-29 (Accepted).

Teplansky, K., Wisler, A., Cao, B., Liang, W., Whited, C. W., Mau, T., & Wang, J. (2020). Tongue and lip motion patterns in alaryngeal speech, Proc. Interspeech, October 25-29, 2020 (Accepted).

Dash, D., Ferrari, P., & Wang, J. (2020). Role of brainwaves in neural speech decoding, Proceedings of the 28th European Signal Processing Conference (EUSIPCO), Amsterdam, NL (accepted).

Dash, D., Ferrari, P., Dutta, S., & Wang, J. (2020).  NeuroVAD: Real-time voice activity detection from non-invasive neuromagnetic signals, Sensors, 20(8), 2248, 1-13.

Dash, D., Ferrari, P., & Wang, J. (2020). Decoding speech evoked jaw motion from non-invasive neuromagnetic oscillations, Proc. International Joint Conference on Neural Networks (IJCNN) (Accepted).

Wisler, A., Teplansky, K., Green, J. R., Austin, S. G., & Wang, J. (2020). Can machines learn continuous measures of speech severity from ordinal training labels? Proc. 33rd International Conference of the Florida Artificial Intelligence Research Society (FLAIRS),  pp. 550-555.  

Dash, D., Ferrari, P., & Wang, J. (2020). Decoding imagined and spoken phrases from non-invasive neural (MEG) signals, Frontiers in Neuroscience, 14(290), 1-15.

2019

Dash, D., Wisler, A., Ferrari, F., & Wang, J. (2019). Towards a speaker independent speech-BCI using speaker adaptationProc. Interspeech, pp. 864-868.

Dash, D., Ferrari, F., & Wang, J. (2019). Spatial and spectral fingerprint in the brain: Speaker identification from single trial MEG signalsProc. Interspeech, pp. 1203-1207.

Gutz, S., Wang, J., Yunusova, Y., & Green, J. R. (2019). Early identification of speech changes due to amyotrophic lateral sclerosis using machine classificationProc. Interspeech, pp. 604-608.

Teplansky, K., Tsang, B., & Wang, J. (2019). Tongue and lip motion patterns in voiced, whispered, and silent vowel productionProc. International Congress of Phonetic Sciences, no. 832, pp. 1-5.

Cao, B., Tsang, B., & Wang, J. (2019). Comparing the performance of individual articulatory flesh points for articulation-to-speech synthesisProc. International Congress of Phonetic Sciences, no. 761, pp. 1-5.

Cao, B., Wisler, A., & Wang, J. (2019). Is articulation-to-speech synthesis language independent? A pilot studyProc. International Congress of Phonetic Sciences, no. 759, pp. 1-5.

Dash, D., Ferrari, F., Heitzman, D., & Wang, J. (2019). Decoding speech from single trial MEG signals using convolutional neural networks and transfer learningThe 41st International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC’), pp. 5531-5535.

Wisler, A., Teplansky, K., Green, J. R., Yunusova, Y., Campbell, T.F., Heitzman, D., & Wang, J. (2019). Speech-based estimation of bulbar regression in amyotrophic lateral sclerosisWorkshop on Speech and Language Processing for Assistive Technologies, pp. 24-31.

Cao, B., Sebkhi, N., Mau, T., Inan, O. T., & Wang, J. (2019). Permanent magnetic articulograph (PMA) vs electromagnetic articulograph (EMA) in articulation-to-speech synthesis for silent speech interface Workshop on Speech and Language Processing for Assistive Technologies , pp. 17-23.

Dash, D., Ferrari, P., Malik, S., Wang, J. (2019). Automatic speech activity recognition from MEG signals using Seq2Seq learningProc. International IEEE EMBS Conference on Neural Engineering (NER), pp. 340-343.

2018

Dash, D., Ferrari, P. Malik, S., Montillo, A., Maldjian, J., & Wang, J. (2018). Determining the optimal number of MEG trials: A machine learning and speech decoding perspectiveProc. International Conference on Brain Informatics, in Lecture Notes in Computer Science (LNCS), Springer-Nature, vol. 11309, 163-172.

Dash, D.*, Biswal, B., Sao, A. K., & Wang, J. (2018). Automatic recognition of resting state fMRI networks with dictionary learningProc. International Conference on Brain Informatics, in Lecture Notes in Computer Science (LNCS), Springer-Nature, vol. 11309, 249-259. (* Best Student Paper Award)

Wang, J., Kothalkar, P., Kim, M., Bandini, A., Cao, B., Yunusova, Y., Campbell, T. F., Heitzman, D., & Green, J. R. (2018). Automatic prediction of intelligible speaking rate for individuals with ALS from speech acoustic and articulatory samplesInternational Journal of Speech-Language Pathology, 20(6), 669-679.

Farquharson, K., Hogan, T. P., Hoffman, L., Wang, J., Green, K. F., Green, J. R. (2018). A longitudinal study of infants' early speech production and later letter identificationPLoS ONE, 13(10): e0204006.

Dash, D., Ferrari, P., Malik, S., & Wang, J. (2018). Overt speech retrieval form neuromagnetic signals using wavelets and artificial neural networksProc. IEEE Global Conference on Signal and Information Processing (GlobalSIP), pp. 489-493.

Dash, D., Sao, K. A., Wang, J., & Biswal, B. (2018). How many fMRI scans are necessary and sufficient for resting brain connectivity analysis? Proc. IEEE Global Conference on Signal and Information Processing (GlobalSIP), pp. 494-498.

Kim, M., Sebkhi, N., Cao, B., Ghovanloo, M., & Wang, J. (2018). Preliminary test of a wireless magnetic tongue tracking system for silent speech interfaceProc. IEEE Biomedical Circuits and Systems Conference (BioCAS), pp. 1-4.

Cao, B., Kim, M., Wang, J. R., van Santen, J., Mau, T., & Wang, J. (2018). Articulation-to-speech synthesis using articulatory flesh point sensors’ orientation informationProc. Interspeech, pp. 3152-3156.

An, K., Kim, M., Teplansky, K. J., Green, J., Campbell, T., Yanusova, Y., Heitzman, D., & Wang, J. (2018). Automatic early detection of amyotrophic lateral sclerosis from intelligible speech using convolutional neural networksProc. Interspeech, pp. 1913-1917.

Dash, D., Kim, M., Teplansky, K. J., & Wang, J. (2018). Automatic speech recognition with articulatory information and a unified dictionary for Hindi, Marathi, Bengali, and Oriya , Proc. Interspeech, pp. 1046-1050.

Kim, M., Cao, B., An, K., & Wang, J. (2018). Dysarthric speech recognition using convolutional LSTM neural network, Proc. Interspeech , pp. 2948-2952

Kim, M., Cao, B, & Wang, J. (2018). Multi-view Representation Learning via Canonical Correlation Analysis for Dysarthric Speech RecognitionProc. International Conference on Mechatronics and Intelligent Robotics, pp. 1085-1095. (Best Paper Award)

Bandini, A., Green, J. R., Wang, J., Campbell, T. F., Zinman, L., & Yunusova, Y. (2018). Kinematic features of jaw and lips distinguish symptomatic from pre-symptomatic stages of bulbar decline in amyotrophic lateral sclerosis Journal of Speech, Language, and Hearing Research, 61, 1118-1129.

2017

Kim, M., Cao, B., Mau, T., & Wang, J. (2017). Speaker-independent silent speech recognition from flesh point articulatory movements using an LSTM neural networkIEEE/ACM Transactions on Audio, Speech, and Language Processing, 25(12): 2323-2336.

Kim, M., Kim, Y., Yoo, J., Wang, J., & Kim, H. (2017). Regularized speaker adaptation of KL-HMM for dysarthric speech recognitionIEEE Transactions on Neural Systems & Rehabilitation Engineering, 25(9): 1581-1591.

Kim, M., Cao, B., Mau, T., & Wang, J. (2017). Multiview representation learning via deep CCA for silent speech recognition, Proc. Interspeech, pp. 2769-2773.

Cao, B., Kim, M., van Santen, J., Mau, T., & Wang, J. (2017). Integrating articulatory information into deep learning-based text-to-speech synthesisProc. Interspeech, pp. 254-258.

Allison, K., Yunusova, Y., Campbell, T., Wang, J., Berry, J., Green, J. (2017). The diagnostic utility of patient-report and speech-language pathologists’ ratings for detecting the early onset of bulbar symptoms due to ALSAmyotrophic Lateral Sclerosis and Frontotemporal Degeneration, 18(5-6):358-366.

Katz, W., Mehta, S., Wood, M., & Wang, J. (2017). Using electromagnetic articulography with a tongue lateral sensor to discriminate manner of articulationThe Journal of the Acoustic Society of America, 141(1), EL57-EL63.

Wang, J., Kim, M., Hernandez-Mulero, A. H., Heitzman, D., & Ferrari, P. (2017). Towards decoding speech production from single-trial Magnetoencephalography (MEG) signalsIEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 3036-3040.

Shokoohi-Yekta, M., Hu, B., Jin, H., Wang, J., Keogh E. (2017). Generalizing dynamic time warping to the multi-dimensional case requires an adaptive approachData Mining and Knowledge Discovery, 31(1), 1-31.

2016

Wang, J., Kothalkar, P., Kim, M., Yunusova, Y., Campbell, T., Heitzman, D., & Green, J. R. (2016). Predicting intelligible speaking rate of individuals with amyotrophic lateral sclerosis from a small number of speech acoustic and articulatory samplesACL/ISCA Workshop on Speech and Language Processing for Assistive Technologies (SLPAT), pp. 91-97.

Cao, B., Kim, M., Mau T., & Wang, J. (2016). Recognizing whispered speech produced by an individual with surgically reconstructed larynx using articulatory movement dataACL/ISCA Workshop on Speech and Language Processing for Assistive Technologies (SLPAT), pp. 80-86.

Kim, M., Wang, J., & Kim, H. (2016). Dysarthric speech recognition using Kullback-Leibler divergence-based hidden Markov modelProc. Interspeech, pp. 2671-2675.

Wang, J., Kothalkar, P. V., Cao, B., Heitzman, D. (2016). Towards automatic detection of amyotrophic lateral sclerosis from speech acoustic and articulatory samplesProc. Interspeech, pp. 1195-1199.

Rong, P., Yunusova, Y., Wang, J., Zinman, L., Pattee, G. L., Berry, J. D., Perry, B., & Green, J. R. (2016). Predicting speech intelligibility decline in amyotrophic lateral sclerosis based on the deterioration of individual speech subsystemsPLOS ONE, 11(5): e0154971, 1-19.

Balasubramanian, A., Wang, J., Balakrishnan, P. (2016). Discovering multidimensional motifs in physiological signals for personalized healthcareIEEE Journal of Selected Topics in Signal Processing, 10(5): 832:841.

Wang, J., Samal., A., Rong, P., & Green, J. R. (2016). An optimal set of flesh points on tongue and lips for speech-movement classificationJournal of Speech, Language, and Hearing Research, 59, 15-26.

2015

Wang, J., Hahm, S., & Mau, T. (2015). Determining an optimal set of flesh points on tongue, lips, and jaw for continuous silent speech recognitionWorkshop on Speech and Language Processing for Assistive Technologies, pp. 79-85.

Hahm, S., Heitzman, D., & Wang, J. (2015). Recognizing dysarthric speech due to amyotrophic lateral sclerosis with across-speaker articulatory normalizationWorkshop on Speech and Language Processing for Assistive Technologies, pp. 47-54.

Wang, J. & Hahm, S. (2015). Speaker-independent silent speech recognition with across-speaker articulatory normalization and speaker adaptive trainingProc. Interspeech, pp. 2415-2419.

Hahm, S. & Wang, J. (2015). Parkinson’s condition estimation using speech acoustic and inversely mapped articulatory dataProc. Interspeech, pp. 513-517.

Begum, N., Ulanova, L., Wang, J., Keogh, E. (2015). Accelerating dynamic time warping clustering with a novel admissible pruning strategyACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 49-58.

Hahm, S. & Wang, J. (2015). Silent speech recognition from articulatory movements using deep neural networkProc. of the 18th International Congress of Phonetic Sciences, no. 0524, pp. 1-5.

Shokoohi-Yekta, M., Wang, J., & Keogh, E. (2015). On the non-trivial generalization of dynamic time warping to the multi-dimensional caseSIAM International Conference on Data Mining, 289-297.

Rong, P., Yunusova, Y., Wang, J., & Green, J. R. (2015). Predicting early bulbar decline in amyotrophic lateral sclerosis: A speech subsystem approachBehavioral Neurology, no. 183027, 1-11.

2014

Wang, J., Katz, W., & Campbell, T. F. (2014). Contribution of tongue lateral to consonant productionProc. Interspeech, pp. 174-178.

Katz, W., Campbell, T. F., Wang, J., Farrar, E., Eubanks, C., Balasubramanian, A., Prabhakaran, B. & Rennaker, R. (2014). Opti-Speech: A real-time, 3D visual feedback system for speech trainingProc. Interspeech, pp. 1174-1178.

Wang, J., Samal, A., & Green, J. R. (2014). Across-speaker articulatory normalization for speaker-independent silent speech recognitionProc. Interspeech, pp. 1179-1183.

Wang, J., Samal, A., & Green, J. R. (2014). Preliminary test of a real-time, interactive silent speech interface based on electromagnetic articulographWorkshop on Speech and Language Processing for Assistive Technologies, pp. 38-45. (View demo using the silent speech interface)

2013-2009

Wang, J., Green, J. R., Samal, A., & Yunusova, Y. (2013). Articulatory distinctiveness of vowels and consonants: A data-driven approachJournal of Speech, Language, and Hearing Research, 56, 1539-1551.

Yunusova, Y., Rosenthal, J. S., Green, J. R., Rong, P., Wang, J., & Zinman, L. (2013). Detection of bulbar ALS using a comprehensive speech assessment batteryProc. of the 8th International Workshop on Models and Analysis of Vocal Emissions for Biomedical Applications, pp. 217-220.

Green, J. R., Yunusova, Y., Kuruvilla, M. S., Wang, J., Pattee, G. L., Synhorst, L., Zinman, L., & Berry, J. D. (2013). Bulbar and speech motor assessment in ALS: Challenges and future directionsAmyotrophic Lateral Sclerosis and Frontotemporal Degeneration, 14, 494-500.

Wang, J., Balasubramanian, A., Mojica de La Vega, L., Green, J. R., Samal, A., & Prabhakaran, B. (2013). Word recognition from continuous articulatory movement time-series data using symbolic representationsWorkshop on Speech and Language Processing for Assistive Technologies,  pp. 119-127.

Green, J. R., Wang, J., Wilson, D. L. (2013). SMASH: A tool for articulatory data processing and analysisProc. Interspeech, pp. 1331-35.

Wang, J., Green, J. R., & Samal, A. (2013). Individual articulator’s contribution to phoneme productionProc. of IEEE Intl. Conf. on Acoustics, Speech, and Signal Processing, pp. 7785-89.

Wang, J., Samal, A., Green, J. R., & Rudzicz, F. (2012). Whole-word recognition from articulatory movements for silent speech interfacesProc. Interspeech, pp. 1327-30.

Wang, J., Samal, A., Green, J. R., & Rudzicz, F. (2012). Sentence recognition from articulatory movements for silent speech interfacesProc. of IEEE Intl. Conf. on Acoustics, Speech, and Signal Processing, pp. 4985-88.

Yunusova, Y., Green, J. R., Greenwood, L., Wang, J., Pattee, G. L., & Zinman, L. (2012). Tongue movements and their acoustic consequences in amyotrophic lateral sclerosisFolia Phoniatrica et Logopaedica (International Journal of Phoniatrics, Speech Therapy and Communication Pathology), 64(2):94-102.

Wang, J., Green, J. R., Samal, A., & Marx, D. B. (2011). Quantifying articulatory distinctiveness of vowelsProc. Interspeech, pp. 277-280.

Wang, J., Green, J. R., Samal, A., & Carrell, T. D. (2010). Vowel recognition from continuous articulatory movements for speaker-dependent applicationsProc. of IEEE Intl. Conf. on Signal Processing and Communication Systems,  pp. 1-7.

Wang, J., Samal, A., Green, J. R., & Carrell, T. D. (2009). Vowel recognition from articulatory position time-series dataProc. of IEEE Intl. Conf. on Signal Processing and Communication Systems, pp. 1-6.


 

Note: “In computer science, papers in peer-reviewed conferences are accepted as high-quality scholarly articles”.


Non-Peer-Reviewed Publication


Wang, J. (2015). Silent speech interface: A potential speech communication tool with quality voice output for laryngectomees, The International Association of Laryngectomees (IAL) News, 60(4), 10-11. 

Yunusova, Y., Green, J. R., Wang, J., Pattee, G., & Zinman, L. (2011). A protocol for comprehensive assessment of bulbar (speech) dysfunction in amyotrophic lateral sclerosis (ALS)Journal of Visualized Experiments (JoVE), 48.


Upcoming or Recent Presentations


Heaton, J. & Wang, J. (2020). Research Symposium: Reconstructing Speech from Electromyographic Signals and Lip and Tongue Movements during Silent Speech, the Annual Research Symposium at the American Speech-Hearing-Language Associatio (ASHA) Convention, San Diego, CA, November 2020 (Invited).

Teplansky, K., Wisler, A., Green, J. R., Heitzman, D., Austin, S., & Wang, J. (2020). Articulatory compensation in early-ALS: A variability analysis, the Annual Convention of the American Speech-Language-Hearing Association (ASHA), November 19-21, San Diego, CA.

Kuruvilla-Dugdale, M., Rong, P., Wisler, A., & Wang, J. (2020). A novel speech motor-articulatory approach for phonetic complexity classification, the Annual Convention of the American Speech-Language-Hearing Association (ASHA), November 19-21, San Diego, CA.

Wang, J. (2020). Silent speech interface to assist the oral communication of laryngectomees, Head and Neck Cancer Symposium, Austin, TX (Invited).

Dash, D., Ferrari, P., & Wang, J. (2020). Neural speech decoding prior to speech onset, the 8th International Brain-Computer Interface Meeting, June 8-11, Brussels, Belgium.

Wisler, A., Teplansky, K., Green, J. R., Austin, S. & Wang, J. (2020). Analyzing residuals of machine learning prediction of speech decline due to amyotrophic lateral sclerosis, UTCare Research Day, April 10, Austin, TX.

Teplansky, K., Dutta, S., Cao, B., & Wang, J. (2020). Tongue and lip motion patterns of alaryngeal and silent speech, Conference on Motor Speech, Feb 20-23, Santa Barbara, California.

Wisler, A., Teplansky, K., Green, J., Yunusova, Y., Campbell, T. F., Heitzman, D., & Wang, J. (2020). ALS detection based on onset using correlation structural features from kinematic measurements, Conference on Motor Speech, Feb 20-23, Santa Barbara, California.

Wisler, A., Teplansky, K., Green, J., Yunusova, Y., Campbell, T. F., Heitzman, D., & Wang, J. (2020). Analyzing residuals of machine learning based ALS-FRS speech score predictions, Conference on Motor Speech, Feb 20-23, Santa Barbara, California.

Teplansky, K. J., Wisler, A., Shrode, A., Green, J. R., Yunusova, Y., Campbell, T.F., & Wang, J. (2019). Articulatory compensation in early amyotrophic lateral sclerosis, The Annual ASHA Convention, Orlando, FL, Nov 21-23. (Meritorious Poster)

Dash, D., Ferrari, P., & Wang, J. (2019). Laterality in the neural decoding of imagined and spoken speech, the 8th Graz Brain-Computer Interface Conference, Graz, Austria, September 16 – 20.

Stipancic, K., Yunusova, Y., Thomas F. Campbell, Wang, J., & Green, J. (2019).  Robust Talkers: Natural Compensation for Impaired Speech Motor Control in a Cohort of Individuals with ALS, Boston Speech Motor Control Symposium, June 21.

Sack, L., Goffman, L., Dollaghan, C., Wang, J., & Sang, M. (2019). Predictors of language outcome in children with developmental language disorder, Symposium on Research in Child Language Disorders (SRCLD), Madison, WI, Jun 6-8.

Teplansky, K., Wisler, A., Shrode, A., Yunusova, Y., Green, J. R., Campbell, T. F., Heitzman, D., & Wang, J. (2019). Articulatory vowel distinctiveness space in early and late-stage ALS, UT Dallas Callier Center PACE poster symposium, Richardson, TX, Feb 22.