Jump to content

Using AI to manipulate film score audio (dialogue removal, instrument isolation, etc)


Recommended Posts

Posted

Which model do you all like for all purpose separation of score from an old movie?

Posted
13 minutes ago, Andy said:

Which model do you all like for all purpose separation of score from an old movie?

Bandit Plus, or MVsep DNR

 

Demucs4HT DNR is good too, but only outputs mono, so only use that with mono sources.

Posted
On 06/02/2024 at 1:20 PM, Faleel said:

Audacity has Stem Seperation plugins now:

 

https://www.audacityteam.org/blog/openvino-ai-effects/

I've installed the plugins into a 2023 build of Audactiy.

I'm using a 4-minute music file that is 16-bit/44.1khz stereo file, but when I select 'OpenVINO Music Separation' , 'the 'Preview' button is grayed-out. 

 

I tried-out another desktop where I have both CPU and GPU Inference devices and the 'Preview' button is still grayed-out.

 

Is there a step that I may have missed?

Posted
Just now, SyncMan said:

I've installed the plugins into a 2023 build of Audactiy.

I'm using a 4-minute music file that is 16-bit/44.1khz stereo file, but when I select 'OpenVINO Music Separation' , 'the 'Preview' button is grayed-out. 

 

I tried-out another desktop where I have both CPU and GPU Inference devices and the 'Preview' button is still grayed-out.

 

Is there a step that I may have missed?

the preview button isn't really necessary since there are no real settings anyway. Just click okay and wait for it to finish

Posted
6 minutes ago, enderdrag64 said:

the preview button isn't really necessary since there are no real settings anyway. Just click okay and wait for it to finish

I've done that and it gave me this error message.

error.png

This is the 'Show Log'

log.png

Posted
1 hour ago, SyncMan said:

I've done that and it gave me this error message.

error.png

This is the 'Show Log'

log.png

Looks like you didn't install the openvino-models correctly

Posted
9 hours ago, Luke Skywalker said:

Would the AI work for this video?

 

X-Minus can largely lose the vocals that get in the way but less luck with all the sfx. I'd doubt you're going to be able to get a properly listenable version sadly.

Posted
46 minutes ago, Luke Skywalker said:

Shame 😔

Do you have the original audio file of the video MVsep's Bandit Plus model might have an effect.

Posted
On 18/02/2024 at 8:18 AM, Luke Skywalker said:

Would the AI work for this video?

 

https://www.youtube.com/watch?v=RS8A99ISvi4

 

is the only music piece not in the game files. And the composer only has the multi track tapes with no means of transfering them 😞

 

 

 

 

 

 

Posted

Thanks faleel. Too many sfx in the video unfortunately, but i may be able to do something as the last half sounds tracked so it could be reconstructed.

  • 2 weeks later...
  • 2 weeks later...
Posted

Has anyone tried using AI separation to get clean the remaining music from AOTC and ROTS that we need?

Thanks for your suggestion Faleel. I've started using MVSEP and I think it's cool and definitely worth it.

Posted
On 21/03/2024 at 8:10 PM, Manakin Skywalker said:

I've used AI very sparingly on my YouTube edits. However I prefer not to use it for the most part because it adds a shit load of distortion, and in some cases removes instruments. It's important not to overdo it.

 

Yes, for music you intend to listen to, it's best for either removal of very simple elements or to make an already polluted soundscape a bit more bearable, but the more it's got to do, the more distorted the result will be. I've come across a few instances where a speciality release presents a cue with its intended mix, and these models can almost entirely remove a synth or solo element that the film removed, but those are solo or frequency distinct elements.

 

It doesn't seem to me to be at that point (yet) where a polluted rip with effects and dialogue can be magically rescued to just isolated music.

Posted

Anyone got a tutorial on how to remove the score from the movie's dialogues and sound effects using these AI tools? It's something I'd like to try one of these days but I never did this before so I'd like to now the steps.

Posted

There's one step. You go to the page in my main post, and drag an audio file into it.  It spits out two tracks, a dialogue-only track and an everything-but-the-dialogue track.

 

These tools remove dialogue, they don't remove sound effects, sadly.

Posted

But some tools that remove percussion can fortunately mistake some sound effects for percussion.

Posted

It's worth trying for just about anything - you just don't get your hopes up if it's anything more complex than dialogue, percussion and high/low frequencies.

Posted

Thanks. I've been meaning to try to do some blockbuster movies whose score I wanted to hear in complete form but their sessions never leaked. But since these are huge blockbusters with lots of explosions and stuff it would probably yield some disappointing results. But who knows, I may try it one of these days.

Posted
1 hour ago, Edmilson said:

Anyone got a tutorial on how to remove the score from the movie's dialogues and sound effects using these AI tools? It's something I'd like to try one of these days but I never did this before so I'd like to now the steps.

MVSEP has a model, "BandIt" that puts out three tracks:

 

Dialogue: at best, it's usually pretty clean.

 

Music: Sometimes can get rid of 50-70% of the sfx, sometimes only 20-50%, depends, increasing the volume, and reducing the ducking can help.

 

Effects: usually clean, sometimes some music can bleed through (string sustains, etc.)

 

There is still some artifacts of course.

  • 2 weeks later...
Posted

NGL, I have not had much luck with Bandit on MVSEP.

Posted
1 minute ago, Giftheck said:

NGL, I have not had much luck with Bandit on MVSEP.

It does seem to work less well, with lossy formats (mp3) and low volumes.

  • 4 weeks later...
Posted

I've been using the MDX23C  separation from MVSEP and I'm really impressed with the results!! Even with very low quality input files (most of the audio inputs that I've used have been mono) and it does an excellent job of removing vocals. Purchasing the premium credits is worth it :)

Posted
3 hours ago, 4te said:

I've been using the MDX23C  separation from MVSEP and I'm really impressed with the results!! Even with very low quality input files (most of the audio inputs that I've used have been mono) and it does an excellent job of removing vocals. Purchasing the premium credits is worth it :)

BSRoformer (newest version) is pretty good too.

  • 3 weeks later...
Posted

I've been playing around with MVSEP for a while, utilizing the 5.1 mix without channel 3 to see if the remaining effects could be removed without much loss, it worked mostly ok, I wasn't pushing it. So I decided to try a scene from an episode of The Flash (2014), It is a very SFX-heavy show and even the dialogue gets in the other channels but the rear channels were surprisingly clean, only 2 channels remained in need of cleaning.

 

I used Demucs4 HT first, and the "instrumental" option hadn't removed any SFX, the "other" on the other hand, was 70% clean, so I went to Bandit Plus to get a real SFX clean version, and it was full of gaps and oscillations and the dialogue was still there, but no SFX. So I tried putting those two alongside the rear channels and this is the result:

Wally And Barry Race.flac

 

With the scene:

Spoiler

 

Posted
21 minutes ago, Gabriel Bezerra said:

I've been playing around with MVSEP for a while, utilizing the 5.1 mix without channel 3 to see if the remaining effects could be removed without much loss, it worked mostly ok, I wasn't pushing it. So I decided to try a scene from an episode of The Flash (2014), It is a very SFX-heavy show and even the dialogue gets in the other channels but the rear channels were surprisingly clean, only 2 channels remained in need of cleaning.

 

I used Demucs4 HT first, and the "instrumental" option hadn't removed any SFX, the "other" on the other hand, was 70% clean, so I went to Bandit Plus to get a real SFX clean version, and it was full of gaps and oscillations and the dialogue was still there, but no SFX. So I tried putting those two alongside the rear channels and this is the result:

Wally And Barry Race.flac 13.02 MB · 0 downloads

 

With the scene:

 

Did you raise the volume/envelope compress the audio? sometimes if the audio is too quiet it makes it hard to seperate.

  • 2 weeks later...
  • 4 weeks later...
Posted

Woah.  Now that's cool!

  • 2 weeks later...
Posted

Somebody try what the hell it can make of Beethoven's sketches!

Posted

Thought I would mention that MVsep has new and updated models:

 

MVSep Guitar (guitar, other)

Now there are two versions available: the updated version of MDX23C and MelRoformer

 

MVSep Multichannel BS (vocals, instrumental)”. This model is specially prepared for extracting vocals from multi-channel audio (5.1, 7.1, etc.). After processing, it returns multi-channel audio in the same format in which it was sent to the server with the same sample rate. 

 

MVSep Multispeaker

Posted

I love it!  Now we can have the percussion only tracks we always wanted!

 

EDIT - So the last two are different processors?  I have to give them a closer listen.  Do you have a preference @The Great Gonzales?

Posted

Hmm interesting.

 

So the Melrof one is a little more aggressive in removing things like piccolo shrieks, but it leaves behind a bit of a muffled sound remaining.  The Mvsep has a more consistent air to it, but you can hear a few of the instruments sneak through.

Posted
14 hours ago, Andy said:

I love it!  Now we can have the percussion only tracks we always wanted!

 

EDIT - So the last two are different processors?  I have to give them a closer listen.  Do you have a preference @The Great Gonzales?

If I understand it correctly, MVsep Drums model is basically taking two different models and "combining forces", melroformer was one of the two (htdemucs being the other)

Posted

WOAH!

Posted
25 minutes ago, Jay said:

WOAH!

Unfortunately after some tests, it doesn't really seem to work well for the music part, but in combination with BSRoformer vocal model, it does work well for isolating SFX and dialogue for rescores etc.

Posted

That’s awesome. It’ll get to where we want it eventually. 

  • 3 weeks later...

Create an account or sign in to comment

You need to be a member in order to leave a comment

Create an account

Sign up for a new account in our community. It's easy!

Register a new account

Sign in

Already have an account? Sign in here.

Sign In Now
×
×
  • Create New...

Important Information

By using this site, you agree to our Guidelines.