From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: X-Spam-Checker-Version: SpamAssassin 3.4.2 (2018-09-13) on dcvr.yhbt.net X-Spam-Level: X-Spam-Status: No, score=-3.9 required=3.0 tests=AWL,BAYES_00,DKIM_INVALID, DKIM_SIGNED,HEADER_FROM_DIFFERENT_DOMAINS,MAILING_LIST_MULTI, RCVD_IN_DNSWL_LOW,RCVD_IN_MSPIKE_H2,SPF_HELO_PASS,SPF_PASS, UNPARSEABLE_RELAY shortcircuit=no autolearn=ham autolearn_force=no version=3.4.2 Received: from lists.sourceforge.net (lists.sourceforge.net [216.105.38.7]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by dcvr.yhbt.net (Postfix) with ESMTPS id 834B91F66F for ; Fri, 20 Nov 2020 14:45:39 +0000 (UTC) Received: from [127.0.0.1] (helo=sfs-ml-2.v29.lw.sourceforge.com) by sfs-ml-2.v29.lw.sourceforge.com with esmtp (Exim 4.90_1) (envelope-from ) id 1kg7fI-0001C9-L1; Fri, 20 Nov 2020 14:45:32 +0000 Received: from [172.30.20.202] (helo=mx.sourceforge.net) by sfs-ml-2.v29.lw.sourceforge.com with esmtps (TLSv1.2:ECDHE-RSA-AES256-GCM-SHA384:256) (Exim 4.90_1) (envelope-from ) id 1kg7fG-0001By-LI for sox-users@lists.sourceforge.net; Fri, 20 Nov 2020 14:45:30 +0000 DKIM-Signature: v=1; a=rsa-sha256; q=dns/txt; c=relaxed/relaxed; d=sourceforge.net; s=x; h=In-Reply-To:Content-Type:MIME-Version:Message-ID: Subject:To:From:Date:Sender:Reply-To:Cc:Content-Transfer-Encoding:Content-ID: Content-Description:Resent-Date:Resent-From:Resent-Sender:Resent-To:Resent-Cc :Resent-Message-ID:References:List-Id:List-Help:List-Unsubscribe: List-Subscribe:List-Post:List-Owner:List-Archive; bh=c25xovSdUlt4tqVLXQIo8uN+yu85Qdp+0hTLJHpQvF4=; b=QOLgn6YqTRI0Xv/5ECi+3JZOcz b+hPCAXDQqhuvn9urUGANhrcgs9V4yFSEdRRvzAXFuSnmW6ny+qbgbqrTvENYXHmH/7usjlP9BTXf ax2A3Q4bklQZnlIKp4Wd5K6tkTXVnir8teINglJl20P5YG/Y+zE1cfA0HxJJAOe96vWQ=; DKIM-Signature: v=1; a=rsa-sha256; q=dns/txt; c=relaxed/relaxed; d=sf.net; s=x ; h=In-Reply-To:Content-Type:MIME-Version:Message-ID:Subject:To:From:Date: Sender:Reply-To:Cc:Content-Transfer-Encoding:Content-ID:Content-Description: Resent-Date:Resent-From:Resent-Sender:Resent-To:Resent-Cc:Resent-Message-ID: References:List-Id:List-Help:List-Unsubscribe:List-Subscribe:List-Post: List-Owner:List-Archive; bh=c25xovSdUlt4tqVLXQIo8uN+yu85Qdp+0hTLJHpQvF4=; b=I TL68S4oY28+YQVoOjeDFwGisdGwxShryzYpEsPnqy5QSSPMiWiyTTd1hcCYNtoEWfpyCYHD9GMEPD cU76kSeb77vvL3f2cQSCAp6+rmLG6uo088zMpxnt75eu7G1aSKIz0H5tsU3ql0KEkguCS/It8rbjX zhD9vX5e5EhBdj0Q=; Received: from uvt.stare.cz ([185.63.96.79] helo=mx.stare.cz) by sfi-mx-1.v28.lw.sourceforge.com with esmtps (TLSv1.2:ECDHE-RSA-AES256-GCM-SHA384:256) (Exim 4.92.2) id 1kg7f9-0037pe-Ox for sox-users@lists.sourceforge.net; Fri, 20 Nov 2020 14:45:30 +0000 Received: from localhost (stare.cz [local]) by stare.cz (OpenSMTPD) with ESMTPA id cca6255f for ; Fri, 20 Nov 2020 15:18:25 +0100 (CET) Date: Fri, 20 Nov 2020 15:18:25 +0100 From: Jan Stary To: sox-users@lists.sourceforge.net Message-ID: MIME-Version: 1.0 Content-Disposition: inline In-Reply-To: <6ff4d95eaff1bea310dc294ec0903c3d@wingsandbeaks.org.uk> <5e992665db5dc96823d9ef6830430718@wingsandbeaks.org.uk> X-Headers-End: 1kg7f9-0037pe-Ox Subject: Re: Search and remove audio sections X-BeenThere: sox-users@lists.sourceforge.net X-Mailman-Version: 2.1.21 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Reply-To: sox-users@lists.sourceforge.net Content-Type: text/plain; charset="us-ascii" Content-Transfer-Encoding: 7bit Errors-To: sox-users-bounces@lists.sourceforge.net On Nov 17 15:52:52, Dani@softco.co.il wrote: > I have a bunch of old MP3 podcasts that have ads in them, > at the beginning and the end. Are the ads always at the beginning and at the end, and never anywhere else? > The ads are about 30 seconds long and usually > have a small familiar jingle before they start and after they end. Usually: so not all of them have the jingle(s), or the jingle is not always the same, right? > I was wondering if there is an ability using SoX (or other tool) > to do a "search and remove" on these, in a batch format >- that would apply to hundreds of these files. General audio search is quite hard. But if you intend to actually listen to the 10 minutes of podcast, removing the ads manualy from the beginning and end is a matter of seconds on top of those 10 minutes. On Nov 17 20:42:51, jn.ml.sxu.88@wingsandbeaks.org.uk wrote: > If the jingles at the start and end of each ad are binary equal (which > they might be if an automated system placed copies of their contents > in the files) then in theory one could use a conventional file search > utility to locate each one. A prerequisite of that would be that oll the files are in the very same binary format. We know they are mp3s - are they the same samplerate, bitrate, etc? Because even if the jingle is one and the same every tome, it won't be, encoded into the individual mp3s. > Recognising the jingles might be hard if any aspect of mp3 compression > of the audio means that successive parts of jingles don't appear in the > exact same bit- and byte- pattern in each file. Yes, and they probably won't. > If the files contain, say, continuous music (or maybe even speech) then > there's a tiny gap (hopefully of digital silence) then a jingle then a > second tiny gap then more content, I think you could possibly look for > the positions of the gaps. The ads are supposed to be at the beginning and end. But cutting at silence is what I would go fro first, if there is a telling silence around the ads of course. Jan _______________________________________________ Sox-users mailing list Sox-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/sox-users