AD-TUNING: An Adaptive CHILD-TUNING Approach to Efficient Hyperparameter Optimization of Child Networks for Speech Processing Tasks in the SUPERB Benchmark
AD-TUNING is an adaptive CHILD-TUNING approach for hyperparameter tuning of child networks. To address the issue of selecting an optimal hyperparameter set P , which often varies for different tasks in CHILD-TUNING, we first analyze the distribution of parameter importance to ascertain the range of P . Next, we propose a simple yet efficient early-stop algorithm to select the appropriate child network from different sizes for various speech tasks. When evaluated on seven speech processing tasks in the SUPERB benchmark, our proposed framework only requires fine-tuning less than 0.1%∼10% of pretrained model parameters for each task to achieve state-of-the-art results in most of the tasks. For instance, the DER of the speaker diarization task is 9.22% relatively lower than the previously reported best results. Other benchmark results are also very competitive.
git clone https://github.com/liyunlongaaa/AD-TUNING.git
cd AD-TUNING
conda create -n ad_tuning python=3.10
conda activate ad_tuning
pip install -e ".[all]"
cd s3prl
bash run.sh > log.txt
More information (about data, config, training and inference) can be refered to here
We study many useful projects in our codeing process, which includes:
https://github.com/alibaba/AliceMind/tree/main/ChildTuning
https://github.com/s3prl/s3prl
Thanks for these authors to open source their code!
if you find this repo is useful to your research, please cite:
@inproceedings{yang23n_interspeech,
author={Gaobin Yang and Jun Du and Maokui He and Shutong Niu and Baoxiang Li and Jiakui Li and Chin-Hui Lee},
title={{AD-TUNING: An Adaptive CHILD-TUNING Approach to Efficient Hyperparameter Optimization of Child Networks for Speech Processing Tasks in the SUPERB Benchmark}},
year=2023,
booktitle={Proc. INTERSPEECH 2023},
pages={421--425},
doi={10.21437/Interspeech.2023-1167}
}
@inproceedings{yang21c_interspeech,
author={Shu-wen Yang and Po-Han Chi and Yung-Sung Chuang and Cheng-I Jeff Lai and Kushal Lakhotia and Yist Y. Lin and Andy T. Liu and Jiatong Shi and Xuankai Chang and Guan-Ting Lin and Tzu-Hsien Huang and Wei-Cheng Tseng and Ko-tik Lee and Da-Rong Liu and Zili Huang and Shuyan Dong and Shang-Wen Li and Shinji Watanabe and Abdelrahman Mohamed and Hung-yi Lee},
title={{SUPERB: Speech Processing Universal PERformance Benchmark}},
year=2021,
booktitle={Proc. Interspeech 2021},
pages={1194--1198},
doi={10.21437/Interspeech.2021-1775}
}