From: fmiser <fmiser@gmail.com>
To: sox-users@lists.sourceforge.net
Subject: Re: sox to chatscript
Date: Sun, 4 Jan 2015 14:31:20 -0600 [thread overview]
Message-ID: <20150104143120.7a9ab878@Jasper.condray.lan> (raw)
In-Reply-To: <trinity-663d44e9-8253-4e3e-a5e0-fac5ab00a8f3-1420387354336@3capp-mailcom-bs14>
> paul wrote:
>
> The nonsense that this would output is not a problem,
> provided that the sound /p/ as spoken by me produces the
> same set of patterns each time, and no other sound produces
> the same patters. Then it will do what I want. Chatscript
> will print the letter p.
It won't.
I'm still not sure I fully understand all you are scheming.
However, I can confidently say that the audio sound of the
letter "p", as it shows up in any word, will NOT "translate"
to the same pattern of numbers every time - even if it is only
one person saying the words. Speech is just too complex.
If you do precision gain matching, and the person talks slowly
and clearly, and you do statistical analysis on the numbers,
you might be able to determine that a particular set of
numbers is has a high probability of being a "p".
What I think you are trying to do is build a transcription
tool, or speech recognition. SoX can process the audio for
you - but if you are hoping a simple pattern matching will be
able to identify all the letters associated with the sounds,
it won't work. Speech is just too complex. :)
There are a few very mature projects working on speech
recognition and/or transcription. Maybe you should look at
some of the challenges they have dealt with to give you a
better idea of the task.
http://en.wikipedia.org/wiki/List_of_speech_recognition_software
------------------------------------------------------------------------------
Dive into the World of Parallel Programming! The Go Parallel Website,
sponsored by Intel and developed in partnership with Slashdot Media, is your
hub for all things parallel software development, from weekly thought
leadership blogs to news, videos, case studies, tutorials and more. Take a
look and join the conversation now. http://goparallel.sourceforge.net
next prev parent reply other threads:[~2015-01-04 20:32 UTC|newest]
Thread overview: 7+ messages / expand[flat|nested] mbox.gz Atom feed top
2015-01-04 16:02 sox to chatscript paul fellows
2015-01-04 16:31 ` Chris Angelico
2015-01-04 20:31 ` fmiser [this message]
-- strict thread matches above, loose matches on Subject: below --
2015-01-04 17:47 Mike Hamilton
2015-01-04 13:23 Mike Hamilton
2015-01-03 15:57 paul fellows
2015-01-03 17:49 ` Jan Stary
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-list from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
List information: https://lists.sourceforge.net/lists/listinfo/sox-users
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20150104143120.7a9ab878@Jasper.condray.lan \
--to=sox-users@lists.sourceforge.net \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
Code repositories for project(s) associated with this public inbox
https://80x24.org/mirrors/sox.git
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).