People
Associate Professor

Yang Yi Ph.D, Associate Professor

Department of Electronic Engineering, Tsinghua University, Beijing, China, 100084

Tel: +86-10-62781443

Email:yangyy@tsinghua.edu.cn

Introduction:

Dr. Yang Yi is Associate Professor of Department of Electronic Engineering at the Tsinghua University. She joined at the Department of Electronic Engineering of Tsinghua University as postdoctoral fellows in 2009. She earned her Ph.D. degree of pattern recognition and artificial intelligence from Beijing University of Science and Technology in 2007. She previously worked at Huawei Technologies Co. Ltd and was an algorithm Engineer. She is interested in speech signal processing and human-computer interaction perception technology.

Education background

2001.09 - 2007.04 Ph.D. degree, Department of Automation, in Beijing University of science and technology, Beijing, China.

1995.09 - 1999.07 Bachelor degree, Department of Automation, in Beijing University of science and technology, Beijing, China.

Experience

2014.12-now Associate Professor, Department of Electronic Engineering

2011.06-2014.12 Assistant Researcher, Department of Electronic Engineering

2009.06-2011.06 Postdoctoral, Department of Electronic Engineering

2007.04-2009.05 Algorithm Engineer, Huawei Technologies Co. Ltd

Concurrent Academic

IEEE member, ACM member, CCF Senior member

Social service

None

Areas of Research Interests/ Research Projects

1. Speaker recognition technology under the complex channel conditions;

2. Speech enhancement and microphone array technology;

3. Internet cross-media analysis and retrieval technology;

4. Intelligent human-computer interaction perception technology;

5. Audio and video codec technology.

Research Status

1. NSFC: Speaker diarization based on distributed acoustic sensor networks and quantum optimization learning technology research. Project chief.

2. NSFC: Multiplayer multiparty dialogue speech separation, content analysis and understanding. Participated.

3. 863: Humane voice interactive technology research. Participated.

4. Beijing Natural Science Foundation Program: The complex scenes efficient voice enhancement technology. Participated.

5. ITU Standard: As the technology backbone to achieve China's first speech in the ITU ISO codec technology patents embedded success, won the gold medal team Huawei Medal. Participated.

6. AVS Standard: Responsible for domestic AVS codec technology standard technology embedded promote AVS audio application GB. One patent has been embedded to national standards. Group leader.

Honors And Awards

First prize, Outstanding Class Teacher, Tsinghua University, 2017.

Faculty Advisor, Honorable Mention of Mathematical Contest in Modeling (MCM), 2016.

Excellent Instructor, thirty-fourth Challenge Cup student competition, Tsinghua University, 2016.

Academic Achievement

Issued Patents:

1. Patent No. ZL201510808753.X. Robustness sound source space positioning method of distributed microphone array network.

2. Patent No. WO2015124006-A1. PCT/CN2014/091959. Audio detection classification method with a custom function.

3. Patent No. ZL201410209057.2. Gauss mix model parameter based scene audio index optimizing method, involves sensing input signal by using sensor, and arranging voice frequency section with classifier model corresponding to information classification.

4. Patent No. ZL201310298757.9. Speaker identifying method based on sparse dimension reduction, involves treating acoustic characteristics of speech signal recognition shadow sparse matrix according to judgment of classifier training.

5. Patent No. ZL201210548563.5. Partial learning based speaker recognition method, involves dividing training data into multi-class training set, extracting characteristic space features of each training set, and calculating average value of training data.

6. Patent No. 201110303580.8. Audio indexing method based on multi-distance sound sensor.

7. Patent No. 201010568360.3. Method for orienting sound source space of distributed asynchronous acoustic sensor, involves calculating space position coordinate of each sound source according to time delay estimation values.

8. Patent No. ZL201010568386.8. Speaker clustering method of distributed microphone, involves calculating time delay of sound source signal segment according to index value of certain frame of numbered sub-band.

9. Patent No. WO2009140896A1. PCT/CN2009/071708. Pitch post processing method involves performing filter process to pitch synthesized signal by obtaining and combining gain control values of local adjustment factors to acquire filer function, for performing pitch post process.

10. Patent No. ZL200910132345.1. Voice enhancing method, involves obtaining enhanced frequency domain noised voice signal according to prior signal-to-noise ratio of present frame in frequency domain clean voice signal based on least-mean-square-error criterion.

11. Patent No. ZL200810198772.5. Voice signal enhancement method, involves weighing corresponding voice signals by weighing coefficient of voice signals to acquire weighed multiple voice signals that are synthesized to acquire enhanced voice signal.

Recent Publications:

[1] Y. Yang, et al. Speech Activity Detection and Speaker Localization Based on Distributed Microphones[C]. International Conference on Human-Computer Interaction. Springer International Publishing, 2016: 392-400.

[2] Y. Yang, Design and Implementation of Advanced HCI Education[C]. International Conference on Human-Computer Interaction. Springer International Publishing, 2016: 84-90.

[3] Sun Jiasong, Zhang Jingyun, Yang Yi. Effective audio fingerprint retrieval based on the spectral sub-band centroid feature[J]. Journal of Tsinghua University (Science and Technology), 2017, 57(4): 382-387.

[4] Y. Yang, et al. Local Learning Multiple Probabilistic Linear Discriminant Analysis[C]. International Conference on Human-Computer Interaction. Springer International Publishing, 2015: 604-610.

[5] Y. Yang, et al. New Research Methods for Media and Cognition Experiment Course[C]. International Conference of Design, User Experience, and Usability. Springer International Publishing, 2015: 327-334.

[6]Yang Y., Liu J., Exploring the Large-Scale TDOA Feature Space for Speaker Diarization[C]. International Conference, HCI International 2014, Jun.22- Jun.29, pp: 551-556, 2014.

[7]Yang Y., Liu J., Dereverberation for Speaker Identification in Meeting[C]. International Conference, HCI International 2014, Jun.22- Jun.29, pp: 594-599, 2014.

[8]Yang Y., Introduction to Big Data Information Processing Technology[M]. Beijing: Electronic Industry Press, 2017.

[9]Yang Y., Cross-media Information Technology and Applications[M]. Beijing: Electronic Industry Press, 2013.

[10]Yang Y., Introduction of cross-media information technology[M]. Beijing: Electronic Industry Press, 2012.

[11]HE Liang,YANG Yi,LIU Jia,TLS-NAP Algorithm for Text-Independent Speaker Recognition[J]. PR&AI, No. 06, pp 916-921, 2012.