4
>> start inference...
>> Reference audio length: 14.79 seconds
>> gpt_gen_time: 18.24 seconds
>> gpt_forward_time: 1.00 seconds
>> bigvgan_time: 0.96 seconds
>> Total inference time: 20.71 seconds
>> Generated audio length: 3.03 seconds
>> RTF: 6.8375
>> wav file saved to: outputs\spk_1750159244.wav
参考音频:sample_prompt.wav
文本:IndexTTS 正式发布1.0版本了,效果666
生成结果:spk_1750159244.wav
无论是波形图还是实际音频,都没有声音