Workflow?

#6
by nored2222 - opened

Does anyone have a working workflow? I'm especially looking for covering.

Start with LLM 4B and XL-Turbo and a fairly basic, mainstream vocal track. Open your own music (with lyrics, it really helps with model conditioning). Use "understand" to approximate the prompt. Paste the actual lyrics (found online, with the tags), check "Src audio" and "Timbre ref" on your song. Switch task to cover-nofsq mode, set the "cover strength" between 0.3 and 0.6, and generate the music (directly Synthezize button). aYou will get the clone that the model is capable of generating. The closer the copy, the more the lyrics and style can be modified. The "cover" mode (without NOFSQ) will diverge even further from your original music. The model is extremely good at pop/electronic music (4 to the floor). All styles can be done, but it requires some familiarity with the model.

My screenshot system doesn't show the menus, but the only thing I did was use "LM understand" at the beginning to initialize the prompt and choose the task type (cover-nofsq / cover). The model is large to explore. You have to send it lots of different music tracks and test lots of different prompts plus parameters... It's limitless. On video I've just "forked" a track.
You should consider generating many of them with a random seed (-1) or doing batches (cherry-picking).

Sign up or log in to comment