How to separate voice and background music from a video file












6















Is it possible to separate voice and background music from a video file?



I only need the background music.



Type of video: MP4



Working on Windows XP 32 bit SP3










share|improve this question

























  • Perhaps not so easy youtube.com/watch?v=LtXrKo8Btfc

    – Dr. belisarius
    Feb 14 '11 at 17:00
















6















Is it possible to separate voice and background music from a video file?



I only need the background music.



Type of video: MP4



Working on Windows XP 32 bit SP3










share|improve this question

























  • Perhaps not so easy youtube.com/watch?v=LtXrKo8Btfc

    – Dr. belisarius
    Feb 14 '11 at 17:00














6












6








6


2






Is it possible to separate voice and background music from a video file?



I only need the background music.



Type of video: MP4



Working on Windows XP 32 bit SP3










share|improve this question
















Is it possible to separate voice and background music from a video file?



I only need the background music.



Type of video: MP4



Working on Windows XP 32 bit SP3







video music voice






share|improve this question















share|improve this question













share|improve this question




share|improve this question








edited Nov 26 '17 at 11:46









slhck

162k47448470




162k47448470










asked Feb 14 '11 at 16:42









SpongeBob SquarePantsSpongeBob SquarePants

2644717




2644717













  • Perhaps not so easy youtube.com/watch?v=LtXrKo8Btfc

    – Dr. belisarius
    Feb 14 '11 at 17:00



















  • Perhaps not so easy youtube.com/watch?v=LtXrKo8Btfc

    – Dr. belisarius
    Feb 14 '11 at 17:00

















Perhaps not so easy youtube.com/watch?v=LtXrKo8Btfc

– Dr. belisarius
Feb 14 '11 at 17:00





Perhaps not so easy youtube.com/watch?v=LtXrKo8Btfc

– Dr. belisarius
Feb 14 '11 at 17:00










3 Answers
3






active

oldest

votes


















4














Unless they're separate audio tracks in your video, not easily. What you'll probably have to do is extract the audio track from the video into a separate file, edit the audio file with a dedicated tool, then remux the result back into the video.



The demux/remux part is easy. What's going to be difficult is attempting to isolate the background music. You'll probably have to experiment with different effects, all of which will most likely result in either a significant loss of fidelity in the audio or not entirely removing the dialogue, if not both. What's more is that you're going to be re-encoding that output into a new mp3/aac file, and between the re-encoding and audio processing, your output is going to sound much worse than the original.



You may have better results by trying to re-master the background music and replacing the audio track in the movie file entirely.






share|improve this answer
























  • replacing it would be crazy and hopeless, since I am just music lover, I don't know anything else about music. Is there any 3rd party program out there which can help me?

    – SpongeBob SquarePants
    Feb 14 '11 at 17:08











  • No, unless you have source material with separate tracks for the various audio pieces, separating out music from voice is extremely difficult and for the tools there are out there that claim to do it, quality is always poor.

    – BBlake
    Feb 14 '11 at 17:42





















5














Audacity – a free and open-source cross-platform audio editor – can do this, using the Vocal Reduction and Isolation effect. You should first extract the audio from the video file, e.g. using ffmpeg:



ffmpeg -i video.mp4 -c:a pcm_s16le audio.wav


And then load the audio.wav file into Audacity:





If you only want to get background music, select the Remove Vocals option; if you want the opposite, choose Isolate Vocals.



Note that this is never going to sound perfect. Vocal isolation is a hard task, as everything you hear is basically mixed into two tracks. An algorithm will never be as good as your brain in isolating different sound sources. Your audio source should be a stereo file with the vocals being panned dead-center. It might also produce false-positives, removing other instruments in the process.






share|improve this answer
























  • This was one of best tutorials I've read about Audacity. Underrated answer. Thank you very much :)

    – Zeta.Investigator
    Dec 2 '17 at 18:59











  • Is it just a band-pass filter? Then what is "Strength" meaning in this? Does is attenuate the amplitude of the out-of-band signals or does it have something to do with power (or log of power)?

    – Zeta.Investigator
    Dec 2 '17 at 19:01






  • 1





    @Zeta.Investigator Just briefly skimming the article and the source code, it appears to be the power of the band pass / band stop that will be added or subtracted from the center. Apparently it's normalized by the difference between 1) the power of the middle vs. overall signal and 2) the power of the side vs. overall signal. Then it's exponentiated – so I assume it's on the log scale. (But I'm not an expert on this kind of processing or LISP.)

    – slhck
    Dec 3 '17 at 11:30





















0














I managed to remove background conversation/noise from a video by running a low-pass filter that was built in to the editor that I was using. The whirrs of my robot, which was the target of my video, remained.



Note that I am not an audio expert, and I cannot guarantee this will work for your particular case, but it's worth a try.






share|improve this answer























    Your Answer








    StackExchange.ready(function() {
    var channelOptions = {
    tags: "".split(" "),
    id: "3"
    };
    initTagRenderer("".split(" "), "".split(" "), channelOptions);

    StackExchange.using("externalEditor", function() {
    // Have to fire editor after snippets, if snippets enabled
    if (StackExchange.settings.snippets.snippetsEnabled) {
    StackExchange.using("snippets", function() {
    createEditor();
    });
    }
    else {
    createEditor();
    }
    });

    function createEditor() {
    StackExchange.prepareEditor({
    heartbeatType: 'answer',
    autoActivateHeartbeat: false,
    convertImagesToLinks: true,
    noModals: true,
    showLowRepImageUploadWarning: true,
    reputationToPostImages: 10,
    bindNavPrevention: true,
    postfix: "",
    imageUploader: {
    brandingHtml: "Powered by u003ca class="icon-imgur-white" href="https://imgur.com/"u003eu003c/au003e",
    contentPolicyHtml: "User contributions licensed under u003ca href="https://creativecommons.org/licenses/by-sa/3.0/"u003ecc by-sa 3.0 with attribution requiredu003c/au003e u003ca href="https://stackoverflow.com/legal/content-policy"u003e(content policy)u003c/au003e",
    allowUrls: true
    },
    onDemand: true,
    discardSelector: ".discard-answer"
    ,immediatelyShowMarkdownHelp:true
    });


    }
    });














    draft saved

    draft discarded


















    StackExchange.ready(
    function () {
    StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fsuperuser.com%2fquestions%2f245741%2fhow-to-separate-voice-and-background-music-from-a-video-file%23new-answer', 'question_page');
    }
    );

    Post as a guest















    Required, but never shown

























    3 Answers
    3






    active

    oldest

    votes








    3 Answers
    3






    active

    oldest

    votes









    active

    oldest

    votes






    active

    oldest

    votes









    4














    Unless they're separate audio tracks in your video, not easily. What you'll probably have to do is extract the audio track from the video into a separate file, edit the audio file with a dedicated tool, then remux the result back into the video.



    The demux/remux part is easy. What's going to be difficult is attempting to isolate the background music. You'll probably have to experiment with different effects, all of which will most likely result in either a significant loss of fidelity in the audio or not entirely removing the dialogue, if not both. What's more is that you're going to be re-encoding that output into a new mp3/aac file, and between the re-encoding and audio processing, your output is going to sound much worse than the original.



    You may have better results by trying to re-master the background music and replacing the audio track in the movie file entirely.






    share|improve this answer
























    • replacing it would be crazy and hopeless, since I am just music lover, I don't know anything else about music. Is there any 3rd party program out there which can help me?

      – SpongeBob SquarePants
      Feb 14 '11 at 17:08











    • No, unless you have source material with separate tracks for the various audio pieces, separating out music from voice is extremely difficult and for the tools there are out there that claim to do it, quality is always poor.

      – BBlake
      Feb 14 '11 at 17:42


















    4














    Unless they're separate audio tracks in your video, not easily. What you'll probably have to do is extract the audio track from the video into a separate file, edit the audio file with a dedicated tool, then remux the result back into the video.



    The demux/remux part is easy. What's going to be difficult is attempting to isolate the background music. You'll probably have to experiment with different effects, all of which will most likely result in either a significant loss of fidelity in the audio or not entirely removing the dialogue, if not both. What's more is that you're going to be re-encoding that output into a new mp3/aac file, and between the re-encoding and audio processing, your output is going to sound much worse than the original.



    You may have better results by trying to re-master the background music and replacing the audio track in the movie file entirely.






    share|improve this answer
























    • replacing it would be crazy and hopeless, since I am just music lover, I don't know anything else about music. Is there any 3rd party program out there which can help me?

      – SpongeBob SquarePants
      Feb 14 '11 at 17:08











    • No, unless you have source material with separate tracks for the various audio pieces, separating out music from voice is extremely difficult and for the tools there are out there that claim to do it, quality is always poor.

      – BBlake
      Feb 14 '11 at 17:42
















    4












    4








    4







    Unless they're separate audio tracks in your video, not easily. What you'll probably have to do is extract the audio track from the video into a separate file, edit the audio file with a dedicated tool, then remux the result back into the video.



    The demux/remux part is easy. What's going to be difficult is attempting to isolate the background music. You'll probably have to experiment with different effects, all of which will most likely result in either a significant loss of fidelity in the audio or not entirely removing the dialogue, if not both. What's more is that you're going to be re-encoding that output into a new mp3/aac file, and between the re-encoding and audio processing, your output is going to sound much worse than the original.



    You may have better results by trying to re-master the background music and replacing the audio track in the movie file entirely.






    share|improve this answer













    Unless they're separate audio tracks in your video, not easily. What you'll probably have to do is extract the audio track from the video into a separate file, edit the audio file with a dedicated tool, then remux the result back into the video.



    The demux/remux part is easy. What's going to be difficult is attempting to isolate the background music. You'll probably have to experiment with different effects, all of which will most likely result in either a significant loss of fidelity in the audio or not entirely removing the dialogue, if not both. What's more is that you're going to be re-encoding that output into a new mp3/aac file, and between the re-encoding and audio processing, your output is going to sound much worse than the original.



    You may have better results by trying to re-master the background music and replacing the audio track in the movie file entirely.







    share|improve this answer












    share|improve this answer



    share|improve this answer










    answered Feb 14 '11 at 16:55









    afrazierafrazier

    20.6k24682




    20.6k24682













    • replacing it would be crazy and hopeless, since I am just music lover, I don't know anything else about music. Is there any 3rd party program out there which can help me?

      – SpongeBob SquarePants
      Feb 14 '11 at 17:08











    • No, unless you have source material with separate tracks for the various audio pieces, separating out music from voice is extremely difficult and for the tools there are out there that claim to do it, quality is always poor.

      – BBlake
      Feb 14 '11 at 17:42





















    • replacing it would be crazy and hopeless, since I am just music lover, I don't know anything else about music. Is there any 3rd party program out there which can help me?

      – SpongeBob SquarePants
      Feb 14 '11 at 17:08











    • No, unless you have source material with separate tracks for the various audio pieces, separating out music from voice is extremely difficult and for the tools there are out there that claim to do it, quality is always poor.

      – BBlake
      Feb 14 '11 at 17:42



















    replacing it would be crazy and hopeless, since I am just music lover, I don't know anything else about music. Is there any 3rd party program out there which can help me?

    – SpongeBob SquarePants
    Feb 14 '11 at 17:08





    replacing it would be crazy and hopeless, since I am just music lover, I don't know anything else about music. Is there any 3rd party program out there which can help me?

    – SpongeBob SquarePants
    Feb 14 '11 at 17:08













    No, unless you have source material with separate tracks for the various audio pieces, separating out music from voice is extremely difficult and for the tools there are out there that claim to do it, quality is always poor.

    – BBlake
    Feb 14 '11 at 17:42







    No, unless you have source material with separate tracks for the various audio pieces, separating out music from voice is extremely difficult and for the tools there are out there that claim to do it, quality is always poor.

    – BBlake
    Feb 14 '11 at 17:42















    5














    Audacity – a free and open-source cross-platform audio editor – can do this, using the Vocal Reduction and Isolation effect. You should first extract the audio from the video file, e.g. using ffmpeg:



    ffmpeg -i video.mp4 -c:a pcm_s16le audio.wav


    And then load the audio.wav file into Audacity:





    If you only want to get background music, select the Remove Vocals option; if you want the opposite, choose Isolate Vocals.



    Note that this is never going to sound perfect. Vocal isolation is a hard task, as everything you hear is basically mixed into two tracks. An algorithm will never be as good as your brain in isolating different sound sources. Your audio source should be a stereo file with the vocals being panned dead-center. It might also produce false-positives, removing other instruments in the process.






    share|improve this answer
























    • This was one of best tutorials I've read about Audacity. Underrated answer. Thank you very much :)

      – Zeta.Investigator
      Dec 2 '17 at 18:59











    • Is it just a band-pass filter? Then what is "Strength" meaning in this? Does is attenuate the amplitude of the out-of-band signals or does it have something to do with power (or log of power)?

      – Zeta.Investigator
      Dec 2 '17 at 19:01






    • 1





      @Zeta.Investigator Just briefly skimming the article and the source code, it appears to be the power of the band pass / band stop that will be added or subtracted from the center. Apparently it's normalized by the difference between 1) the power of the middle vs. overall signal and 2) the power of the side vs. overall signal. Then it's exponentiated – so I assume it's on the log scale. (But I'm not an expert on this kind of processing or LISP.)

      – slhck
      Dec 3 '17 at 11:30


















    5














    Audacity – a free and open-source cross-platform audio editor – can do this, using the Vocal Reduction and Isolation effect. You should first extract the audio from the video file, e.g. using ffmpeg:



    ffmpeg -i video.mp4 -c:a pcm_s16le audio.wav


    And then load the audio.wav file into Audacity:





    If you only want to get background music, select the Remove Vocals option; if you want the opposite, choose Isolate Vocals.



    Note that this is never going to sound perfect. Vocal isolation is a hard task, as everything you hear is basically mixed into two tracks. An algorithm will never be as good as your brain in isolating different sound sources. Your audio source should be a stereo file with the vocals being panned dead-center. It might also produce false-positives, removing other instruments in the process.






    share|improve this answer
























    • This was one of best tutorials I've read about Audacity. Underrated answer. Thank you very much :)

      – Zeta.Investigator
      Dec 2 '17 at 18:59











    • Is it just a band-pass filter? Then what is "Strength" meaning in this? Does is attenuate the amplitude of the out-of-band signals or does it have something to do with power (or log of power)?

      – Zeta.Investigator
      Dec 2 '17 at 19:01






    • 1





      @Zeta.Investigator Just briefly skimming the article and the source code, it appears to be the power of the band pass / band stop that will be added or subtracted from the center. Apparently it's normalized by the difference between 1) the power of the middle vs. overall signal and 2) the power of the side vs. overall signal. Then it's exponentiated – so I assume it's on the log scale. (But I'm not an expert on this kind of processing or LISP.)

      – slhck
      Dec 3 '17 at 11:30
















    5












    5








    5







    Audacity – a free and open-source cross-platform audio editor – can do this, using the Vocal Reduction and Isolation effect. You should first extract the audio from the video file, e.g. using ffmpeg:



    ffmpeg -i video.mp4 -c:a pcm_s16le audio.wav


    And then load the audio.wav file into Audacity:





    If you only want to get background music, select the Remove Vocals option; if you want the opposite, choose Isolate Vocals.



    Note that this is never going to sound perfect. Vocal isolation is a hard task, as everything you hear is basically mixed into two tracks. An algorithm will never be as good as your brain in isolating different sound sources. Your audio source should be a stereo file with the vocals being panned dead-center. It might also produce false-positives, removing other instruments in the process.






    share|improve this answer













    Audacity – a free and open-source cross-platform audio editor – can do this, using the Vocal Reduction and Isolation effect. You should first extract the audio from the video file, e.g. using ffmpeg:



    ffmpeg -i video.mp4 -c:a pcm_s16le audio.wav


    And then load the audio.wav file into Audacity:





    If you only want to get background music, select the Remove Vocals option; if you want the opposite, choose Isolate Vocals.



    Note that this is never going to sound perfect. Vocal isolation is a hard task, as everything you hear is basically mixed into two tracks. An algorithm will never be as good as your brain in isolating different sound sources. Your audio source should be a stereo file with the vocals being panned dead-center. It might also produce false-positives, removing other instruments in the process.







    share|improve this answer












    share|improve this answer



    share|improve this answer










    answered Nov 26 '17 at 11:46









    slhckslhck

    162k47448470




    162k47448470













    • This was one of best tutorials I've read about Audacity. Underrated answer. Thank you very much :)

      – Zeta.Investigator
      Dec 2 '17 at 18:59











    • Is it just a band-pass filter? Then what is "Strength" meaning in this? Does is attenuate the amplitude of the out-of-band signals or does it have something to do with power (or log of power)?

      – Zeta.Investigator
      Dec 2 '17 at 19:01






    • 1





      @Zeta.Investigator Just briefly skimming the article and the source code, it appears to be the power of the band pass / band stop that will be added or subtracted from the center. Apparently it's normalized by the difference between 1) the power of the middle vs. overall signal and 2) the power of the side vs. overall signal. Then it's exponentiated – so I assume it's on the log scale. (But I'm not an expert on this kind of processing or LISP.)

      – slhck
      Dec 3 '17 at 11:30





















    • This was one of best tutorials I've read about Audacity. Underrated answer. Thank you very much :)

      – Zeta.Investigator
      Dec 2 '17 at 18:59











    • Is it just a band-pass filter? Then what is "Strength" meaning in this? Does is attenuate the amplitude of the out-of-band signals or does it have something to do with power (or log of power)?

      – Zeta.Investigator
      Dec 2 '17 at 19:01






    • 1





      @Zeta.Investigator Just briefly skimming the article and the source code, it appears to be the power of the band pass / band stop that will be added or subtracted from the center. Apparently it's normalized by the difference between 1) the power of the middle vs. overall signal and 2) the power of the side vs. overall signal. Then it's exponentiated – so I assume it's on the log scale. (But I'm not an expert on this kind of processing or LISP.)

      – slhck
      Dec 3 '17 at 11:30



















    This was one of best tutorials I've read about Audacity. Underrated answer. Thank you very much :)

    – Zeta.Investigator
    Dec 2 '17 at 18:59





    This was one of best tutorials I've read about Audacity. Underrated answer. Thank you very much :)

    – Zeta.Investigator
    Dec 2 '17 at 18:59













    Is it just a band-pass filter? Then what is "Strength" meaning in this? Does is attenuate the amplitude of the out-of-band signals or does it have something to do with power (or log of power)?

    – Zeta.Investigator
    Dec 2 '17 at 19:01





    Is it just a band-pass filter? Then what is "Strength" meaning in this? Does is attenuate the amplitude of the out-of-band signals or does it have something to do with power (or log of power)?

    – Zeta.Investigator
    Dec 2 '17 at 19:01




    1




    1





    @Zeta.Investigator Just briefly skimming the article and the source code, it appears to be the power of the band pass / band stop that will be added or subtracted from the center. Apparently it's normalized by the difference between 1) the power of the middle vs. overall signal and 2) the power of the side vs. overall signal. Then it's exponentiated – so I assume it's on the log scale. (But I'm not an expert on this kind of processing or LISP.)

    – slhck
    Dec 3 '17 at 11:30







    @Zeta.Investigator Just briefly skimming the article and the source code, it appears to be the power of the band pass / band stop that will be added or subtracted from the center. Apparently it's normalized by the difference between 1) the power of the middle vs. overall signal and 2) the power of the side vs. overall signal. Then it's exponentiated – so I assume it's on the log scale. (But I'm not an expert on this kind of processing or LISP.)

    – slhck
    Dec 3 '17 at 11:30













    0














    I managed to remove background conversation/noise from a video by running a low-pass filter that was built in to the editor that I was using. The whirrs of my robot, which was the target of my video, remained.



    Note that I am not an audio expert, and I cannot guarantee this will work for your particular case, but it's worth a try.






    share|improve this answer




























      0














      I managed to remove background conversation/noise from a video by running a low-pass filter that was built in to the editor that I was using. The whirrs of my robot, which was the target of my video, remained.



      Note that I am not an audio expert, and I cannot guarantee this will work for your particular case, but it's worth a try.






      share|improve this answer


























        0












        0








        0







        I managed to remove background conversation/noise from a video by running a low-pass filter that was built in to the editor that I was using. The whirrs of my robot, which was the target of my video, remained.



        Note that I am not an audio expert, and I cannot guarantee this will work for your particular case, but it's worth a try.






        share|improve this answer













        I managed to remove background conversation/noise from a video by running a low-pass filter that was built in to the editor that I was using. The whirrs of my robot, which was the target of my video, remained.



        Note that I am not an audio expert, and I cannot guarantee this will work for your particular case, but it's worth a try.







        share|improve this answer












        share|improve this answer



        share|improve this answer










        answered Aug 21 '15 at 3:48









        Bigbio2002Bigbio2002

        3,70411928




        3,70411928






























            draft saved

            draft discarded




















































            Thanks for contributing an answer to Super User!


            • Please be sure to answer the question. Provide details and share your research!

            But avoid



            • Asking for help, clarification, or responding to other answers.

            • Making statements based on opinion; back them up with references or personal experience.


            To learn more, see our tips on writing great answers.




            draft saved


            draft discarded














            StackExchange.ready(
            function () {
            StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fsuperuser.com%2fquestions%2f245741%2fhow-to-separate-voice-and-background-music-from-a-video-file%23new-answer', 'question_page');
            }
            );

            Post as a guest















            Required, but never shown





















































            Required, but never shown














            Required, but never shown












            Required, but never shown







            Required, but never shown

































            Required, but never shown














            Required, but never shown












            Required, but never shown







            Required, but never shown







            Popular posts from this blog

            Aardman Animations

            Are they similar matrix

            “minimization” problem in Euclidean space related to orthonormal basis