I proposed a new speech enhancement architecture called LPCSE that combines classic signal processing technologies, i.e., Linear Predictive Coding (LPC), with neural networks in the auto-differentiable machine learning frameworks, as shown in the figures below. The proposed architecture could leverage the strong inductive biases in the classic speech models in conjunction with the expressive power of neural networks. To achieve this work, I also studied the speech synthesis and enhancement technologies, such as speech vocoders (WaveNet WaveRNN, MelGAN HiFi-GAN, etc.), denoising (SEGAN, U-net, etc.), voice conversion (AutoVC, StarGAN-VC, etc.), and expert-rule inspired speech and audio synthesis (LPCNet, DDSP, etc.).
A brief introduction to this project is shown in the video below.
Based on the ranging method above, I proposed and implemented a fine-grained and low-cost acoustic motion-tracking method called PAMT for mobile interaction. The proposed method allows mobile users to interact with a computer by using a gesture interface in practical indoor environments. The system could provide economical and flexible navigation and gesture recognition for VR/AR users.
In practical indoor environments, it’s challenging to obtain accurate moving distance change based on the phase change due to the attenuation and reflection of acoustic signals. To address this challenge, I proposed a metric to identify the impact of multipath fading in real time. Based on the metric, I proposed a multipath interference mitigation method through the diversity of acoustic frequencies, in which the actual moving distance is calculated by combining the moving distances measured at different frequencies, as shown in the figures below. Experiment results show the measurement errors are less than 2 mm and 4 mm in one-dimensional and two-dimensional scenarios, respectively.
A demonstration is shown in the video below.
A brief introduction to this project is shown in the video below.
Engineers engaged in the development of battery power-supply devices are facing huge pressure to reduce the consumption of power. During the development, engineers require a flexible and low-cost way to know the real-time power computation of multiple Devices Under Tests (DUTs).
In this work, a low-cost and real-time power measurement platform called PTone is proposed to provide a more flexible way for low-power device development than cumbersome equipment, e.g., Keysight 34465A. Compared with traditional measuring equipment, such as Keysight 34465A (>$1500 on Amazon), our system is low-cost (<$20) and could be widely deployed. The median error of PTone is less than 0.37% of the benchmark, as shown in the figures below. Based on the real-time power consumption measured by the system, we propose an SVM-based method to diagnose the abnormal status of DUTs. Our key insight is that the real-time power consumption of low-power devices changes with the periodic system process, such as data collecting, simple signal processing, signal transmission, and sleep, and we used the features of the power consumption to identify whether the microcontroller in the DUT works as expected.
Marie Curie Research Fellow at The University of Sheffield (Mar. 2021 - present)
Research Associate at ShanghaiTech University, Shanghai, China. (Aug. 2019 - Jan. 2021)
Visiting Ph.D. Student at Shanghai Institute of Fog Computing Technology (SHIFT), Shanghai, China. (Apr. 2018 - July 2019)
Visiting Ph.D. Student at Shanghai Institute of Microsystem and Information Technology (SIMIT), Chinese Academy of Sciences (CAS), China. (Sept. 2016 - Apr. 2018)
Student Internship at Shanghai Research Center for Wireless Communications, Shanghai, China. (Oct. 2014 - Sep. 2016)
Research and Teaching Assistant at Beihang University, Beijing, China. (Sept. 2013 - Aug. 2014)
Intern at EDA Laboratory of Beihang University, Beijing, China. (June 2013 - Aug. 2013)
Summer student at the school’s innovation Laboratory of Anhui University. (July 2009 - Aug. 2009)