6
➜ index-tts git:(main) ✗ uv run webui.py
>> GPT weights restored from: ./checkpoints/gpt.pth
GPT2InferenceModel has generative capabilities, as `prepare_inputs_for_generation` is explicitly overwritten. However, it doesn't directly inherit from `GenerationMixin`. From 👉v4.50👈 onwards, `PreTrainedModel` will NOT inherit from `GenerationMixin`, and this model will lose the ability to call `generate` and other related functions.
- If you're using `trust_remote_code=True`, you can get rid of this warning by loading the model with an auto class. See https://huggingface.co/docs/transformers/en/model_doc/auto#auto-classes
- If you are the owner of the model architecture code, please modify your model class such that it inherits from `GenerationMixin` (after `PreTrainedModel`, otherwise you'll get an exception).
- If you are not the owner of the model architecture class, please contact the model code owner to update it.
>> semantic_codec weights restored from: /Users/itgoyo/.cache/huggingface/hub/models--amphion--MaskGCT/snapshots/265c6cef07625665d0c28d2faafb1415562379dc/semantic_codec/model.safetensors
cfm loaded
length_regulator loaded
gpt_layer loaded
>> s2mel weights restored from: ./checkpoints/s2mel.pth
>> campplus_model weights restored from: /Users/itgoyo/.cache/huggingface/hub/models--funasr--campplus/snapshots/fb71fe990cbf6031ae6987a2d76fe64f94377b7e/campplus_cn_common.bin
Loading weights from nvidia/bigvgan_v2_22khz_80band_256x
Removing weight norm...
>> bigvgan weights restored from: nvidia/bigvgan_v2_22khz_80band_256x
>> TextNormalizer loaded
>> bpe model loaded from: ./checkpoints/bpe.model
* Running on local URL: http://0.0.0.0:7860
* To create a public link, set `share=True` in `launch()`.
Emo control mode:0,weight:0.8,vec:None
>> starting inference...
Use the specified emotion vector
Passing a tuple of `past_key_values` is deprecated and will be removed in Transformers v4.53.0. You should pass an instance of `Cache` instead, e.g. `past_key_values=DynamicCache.from_legacy_cache(past_key_values)`.
100%|██████████████████████████████████████████████████████████████████████████████████████████████████| 25/25 [00:42<00:00, 1.71s/it]
Traceback (most recent call last):
File "/Users/itgoyo/Downloads/index-tts/.venv/lib/python3.10/site-packages/gradio/queueing.py", line 667, in process_events
response = await route_utils.call_process_api(
File "/Users/itgoyo/Downloads/index-tts/.venv/lib/python3.10/site-packages/gradio/route_utils.py", line 349, in call_process_api
output = await app.get_blocks().process_api(
File "/Users/itgoyo/Downloads/index-tts/.venv/lib/python3.10/site-packages/gradio/blocks.py", line 2274, in process_api
result = await self.call_function(
File "/Users/itgoyo/Downloads/index-tts/.venv/lib/python3.10/site-packages/gradio/blocks.py", line 1781, in call_function
prediction = await anyio.to_thread.run_sync( # type: ignore
File "/Users/itgoyo/Downloads/index-tts/.venv/lib/python3.10/site-packages/anyio/to_thread.py", line 56, in run_sync
return await get_async_backend().run_sync_in_worker_thread(
File "/Users/itgoyo/Downloads/index-tts/.venv/lib/python3.10/site-packages/anyio/_backends/_asyncio.py", line 2476, in run_sync_in_worker_thread
return await future
File "/Users/itgoyo/Downloads/index-tts/.venv/lib/python3.10/site-packages/anyio/_backends/_asyncio.py", line 967, in run
result = context.run(func, *args)
File "/Users/itgoyo/Downloads/index-tts/.venv/lib/python3.10/site-packages/gradio/utils.py", line 915, in wrapper
response = f(*args, **kwargs)
File "/Users/itgoyo/Downloads/index-tts/webui.py", line 160, in gen_single
output = tts.infer(spk_audio_prompt=prompt, text=text,
File "/Users/itgoyo/Downloads/index-tts/indextts/infer_v2.py", line 569, in infer
wav = self.bigvgan(vc_target.float()).squeeze().unsqueeze(0)
File "/Users/itgoyo/Downloads/index-tts/.venv/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1773, in _wrapped_call_impl
return self._call_impl(*args, **kwargs)
File "/Users/itgoyo/Downloads/index-tts/.venv/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1784, in _call_impl
return forward_call(*args, **kwargs)
File "/Users/itgoyo/Downloads/index-tts/indextts/s2mel/modules/bigvgan/bigvgan.py", line 372, in forward
xs = self.resblocks[i * self.num_kernels + j](x)
File "/Users/itgoyo/Downloads/index-tts/.venv/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1773, in _wrapped_call_impl
return self._call_impl(*args, **kwargs)
File "/Users/itgoyo/Downloads/index-tts/.venv/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1784, in _call_impl
return forward_call(*args, **kwargs)
File "/Users/itgoyo/Downloads/index-tts/indextts/s2mel/modules/bigvgan/bigvgan.py", line 135, in forward
xt = a1(x)
File "/Users/itgoyo/Downloads/index-tts/.venv/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1773, in _wrapped_call_impl
return self._call_impl(*args, **kwargs)
File "/Users/itgoyo/Downloads/index-tts/.venv/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1784, in _call_impl
return forward_call(*args, **kwargs)
File "/Users/itgoyo/Downloads/index-tts/indextts/s2mel/modules/bigvgan/alias_free_activation/torch/act.py", line 26, in forward
x = self.upsample(x)
File "/Users/itgoyo/Downloads/index-tts/.venv/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1773, in _wrapped_call_impl
return self._call_impl(*args, **kwargs)
File "/Users/itgoyo/Downloads/index-tts/.venv/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1784, in _call_impl
return forward_call(*args, **kwargs)
File "/Users/itgoyo/Downloads/index-tts/indextts/s2mel/modules/bigvgan/alias_free_activation/torch/resample.py", line 33, in forward
x = self.ratio * F.conv_transpose1d(
NotImplementedError: Output channels > 65536 not supported at the MPS device.