[Feature]: AutoPitch - automatic pitch detection #786

Bebra777228 · 2024-10-05T14:22:45Z

Description

Recently, a question regarding this has already been asked that, in my opinion, was not formulated quite accurately.

I would like to know if you plan to implement a function for automatic pitch detection, similar to how it was implemented in SVC.

I am not sure if this feature is still available in SVC, but about a year and a half ago, during inference, you could check the 'autopitch' box. This allowed the model to automatically adjust to the pitch of the voice in the source recording, resulting in a more realistic voice than with manual pitch adjustment (even though this feature worked poorly, the results were good). At least, that's how it seemed to me.

Problem

Proposed Solution

Alternatives Considered

aris-py · 2024-10-05T22:33:28Z

SVC is no longer used, it is too old, instead we use RVC which is much better and more up to date.

blaisewf · 2024-10-05T22:43:37Z

SVC is no longer used, it is too old, instead we use RVC which is much better and more up to date.

pero has leído?

aris-py · 2024-10-05T23:07:09Z

perdona, no se mucho ingles

kro-ai · 2024-10-07T12:27:26Z

SVC is no longer used, it is too old, instead we use RVC which is much better and more up to date.

Creo que piden la misma función en RVC. Estoy de acuerdo con ellos, ¡sería muy útil! entonces, al hacer inferencias, no es necesario ajustar los semitonos, lo hace por sí solo detectando el tono del audio de entrada.

Sorry for the bad spanish, this is auto-translated, I just wanted to make sure you understand.

For those who don't speak spanish, this feature would be very useful because you wouldn't have to adjust the pitch, ie -12 +12 semitones. It would automatically detect the pitch so if your model is a male voice and your input is a female voice, it would make the output pitch lower to compensate and make it sounds more natural. I used this feature all the time in SVC and found it very useful! Although It didn't work that great all the time, it was still useful to have. Not sure It would work all that great with singing audio though, but when using speaking audio it's really great.

AznamirWoW · 2024-10-07T12:33:04Z

For such functionality to work, there should be some kind of record of what the model was trained on
detecting a max F0 value from inferred audio can be done, but adjusting the pitch down without knowing what the model is capable of, is not.

kro-ai · 2024-10-07T15:34:36Z

For such functionality to work, there should be some kind of record of what the model was trained on detecting a max F0 value from inferred audio can be done, but adjusting the pitch down without knowing what the model is capable of, is not.

Having looked into it a bit more, the way it works on SVC is that an f0 predictor is trained alongside the main model. Which explains why It wouldn't be possible with RVC. It is a shame as this would be very useful.

tomakorea · 2024-10-07T19:23:21Z

I also had really excellent experience with SVC Automatic pitch detection, it made the spoken voice really realistic, actually better than RVC or Applio where to be realistic, it's often necessary to do a lot of manual editing.

kro-ai · 2024-10-08T12:47:48Z

I also had really excellent experience with SVC Automatic pitch detection, it made the spoken voice really realistic, actually better than RVC or Applio where to be realistic, it's often necessary to do a lot of manual editing.

Me too, It was really useful for speaking audio. Maybe some brave soul can add this to Applio.

Chilluminati91 · 2024-10-09T10:43:26Z

Shouldnt this be pretty simple in general? Calculate a mean f0 from the training data. Then calculate mean f0 from your clip before inference and shift by the difference.

kro-ai · 2024-10-10T10:04:25Z

Shouldnt this be pretty simple in general? Calculate a mean f0 from the training data. Then calculate mean f0 from your clip before inference and shift by the difference.

Interesting.

blaisewf · 2024-10-15T11:29:26Z

@Bebra777228 could you share the code used on SVC to do that "AutoPitch" function?

Bebra777228 · 2024-10-15T20:52:24Z

I can't precisely determine which file this is implemented in, but I assume it might be the models.py file. This file contains the parameter use_automatic_f0_prediction, which might be what you need.

Overall, it's easiest to search through the code. You might find something useful if you use this search link.

Bebra777228 added enhancement New feature or request feature labels Oct 5, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Feature]: AutoPitch - automatic pitch detection #786

[Feature]: AutoPitch - automatic pitch detection #786

Bebra777228 commented Oct 5, 2024 •

edited

Loading

aris-py commented Oct 5, 2024

blaisewf commented Oct 5, 2024

aris-py commented Oct 5, 2024

kro-ai commented Oct 7, 2024 •

edited

Loading

AznamirWoW commented Oct 7, 2024

kro-ai commented Oct 7, 2024

tomakorea commented Oct 7, 2024

kro-ai commented Oct 8, 2024

Chilluminati91 commented Oct 9, 2024

kro-ai commented Oct 10, 2024

blaisewf commented Oct 15, 2024

Bebra777228 commented Oct 15, 2024

[Feature]: AutoPitch - automatic pitch detection #786

[Feature]: AutoPitch - automatic pitch detection #786

Comments

Bebra777228 commented Oct 5, 2024 • edited Loading

Description

Problem

Proposed Solution

Alternatives Considered

aris-py commented Oct 5, 2024

blaisewf commented Oct 5, 2024

aris-py commented Oct 5, 2024

kro-ai commented Oct 7, 2024 • edited Loading

AznamirWoW commented Oct 7, 2024

kro-ai commented Oct 7, 2024

tomakorea commented Oct 7, 2024

kro-ai commented Oct 8, 2024

Chilluminati91 commented Oct 9, 2024

kro-ai commented Oct 10, 2024

blaisewf commented Oct 15, 2024

Bebra777228 commented Oct 15, 2024

Bebra777228 commented Oct 5, 2024 •

edited

Loading

kro-ai commented Oct 7, 2024 •

edited

Loading