yan

Yan Huang

Ph.D. Student
Computer Science Division
Electrical Engineering and Computer Science Department
University of California, Berkeley

I am interested in machine learning and its applications in speech and natural language proceesing. My research advisor is Prof. Nelson Morgan.

I am currently working as an SDE Speech Scientist with Microsoft.


Education


MS, Computer Science Department, University of California, Berkeley
MSE, Electrical and Computer Engineering Department, Johns Hopkins University

Publication


[1] "An Auditory-Based Frequency Modulation Feature and Feature Combination for Robust Speaker Identification," in Submission to the IEEE Trans of audio, speech and language processing, 2010 (with Q. Li).

[2] "An Auditory-Based Feature and Its Application to Robust Speaker Identification," Submitted to the IEEE Transaction of audio, speech and language processing, 2009 (with Q. Li).

[3] "An Auditory-Based Feature and Its Application to Robust Speaker Identification," International Conference on Speech and Signal Processing, Dallas, 2010 (with Q. Li).

[4] "Fusing short term and long term features for improved speaker diarization," IEEE Transaction of audio, speech and language processing, 2009 (with G. Friedland, O. Vinyals, and C. Mueller).

[5] "Estimating Dominance In Multi-Party Conversations Using Automatically Generated Audio Cues," Submitted to the IEEE Transaction of audio, speech and language processing, 2009 (with H.Hung, G. Friedland, and D. Gatica-Perez)

[6] "Correlating audio-visual cues in a dominance estimation framework," CVPR workshop on human behavior, 2009 (with H.Hung, G. Friedland, and D. Gatica-Perez).

[7] "Fusing short term and long term features for improved speaker diarization," International Conference on Speech and Signal Processing, Taipei, 2009 (with G. Friedland, O. Vinyals, and C. Mueller).

[8] "Estimating The Dominant Person In Multi-Party Conversations Using Speaker Diarization Strategies," International Conference on Speech and Signal Processing, Las Vegas, 2008 (with H.Hung, G. Friedland, and D. Gatica-Perez).

[9] "Optimization of Latent Semantic Analysis Based Language Model Interpolation for Meeting Recognition", Fifth Slovenian and First International Language Technologies Conference, Slovenia, 2006 (with Michael Pucher, ?g? ?tin).

[10] "Vocabulary and Language Model Adaptation using Information Retrieval", International Conference on Spoken Language Processing, Jeju Island, Korea, 2004 (with Brigitte Bigi, Renato De Mori).

[11] "A Novel Model TD-PSPTP for Speech Synthesis", 6th European Conference on Speech Communication and Technology, Budapest, Hungary, 1999 (with Bo Xu).

[12] "Neural Learning Approach for Duration Parameter Generation in Mandarin Speech Synthesis", 1th International Symposium on Chinese Spoken Language Processing, Singapore, 1998 (with Taiyi Huang).

Working Experience

International Computer Science Institute, Berkeley, CA (Research Assistant)
Panasonic Speech Technologies Laboratory, Santa Barbara, CA (Research Engineer)
Center of Language and Speech Processing, the Johns Hopkins University, Batimore, MD (Researach Assistant)
Bell-labs, Lucent Technologies (Summer Internship)
National Laboratory of Pattern Recognition, Institute of Automation, Chinese Academy of Sciences, Beijing, China (Research Assistant)

Misc

I am a huge fan of New York City Ballet Company. I also enjoy spending sometime on the barre in the studio when I have time.





web stats