I've been checking TTS and voice cloning, then I came across this video, which seems to be a song of x being sang by different singers other than x, I was checking differet TTS projects but it doesn't seem to be cabaple of doing so, how/what kind of models are able to output similar thing ?
https://www.tiktok.com/@ai.fido.project/video/7256527953784229125