sox-users@lists.sourceforge.net unofficial mirror
 help / color / mirror / code / Atom feed
From: "paul fellows" <4-werk@gmx.com>
To: sox <sox-users@lists.sourceforge.net>
Subject: make spoken words machine recognizable.
Date: Sun, 4 Jan 2015 16:02:34 +0100	[thread overview]
Message-ID: <trinity-84194944-0df6-4626-a59c-e77581a447fa-1420383754028@3capp-mailcom-bs14> (raw)

A better way to explain why I want to add echo to a voice to make spoken words machine recognizable. 
Current voice to text systems use very complex mathematical algorithms to work out what are words and what are not words, and then to work out what the word might be.

-d gain -n -D -b 32 -r 8k -e filename.raw filenaname.wav play 

If I am starting to get the hang of sox this should just play the input. You as a human being will have no problem recognizing the words. As mentioned above machines do have difficulty with that task.

Now play the input from the mic with these echoes added: echo 11 1 13 1 17 1 19 1 23 1 29 1 31 1 37 1 47 1 53 1.

It will sound like an horrendous mush, but keep listening. After a while you will start to recognize patterns, a while latter you may even be able to convinces your self that you could what is being said. Human beings are very good at learning to recognize patterns. Machines are also able to recognize patterns, when they have to be given examples to match against.

Languages contain many tens of thousands of words, all of them made up of fewer fifty spoken sounds. So use a piece of pattern matching software to match the sounds in sequence, build up the words one sound at a time. I intend to use chatscript for this, but if you are more familiar with aiml or some other language, it should work.

To make the machine understand the words is a bigger problem.

------------------------------------------------------------------------------
Dive into the World of Parallel Programming! The Go Parallel Website,
sponsored by Intel and developed in partnership with Slashdot Media, is your
hub for all things parallel software development, from weekly thought
leadership blogs to news, videos, case studies, tutorials and more. Take a
look and join the conversation now. http://goparallel.sourceforge.net

             reply	other threads:[~2015-01-04 15:03 UTC|newest]

Thread overview: 3+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2015-01-04 15:02 paul fellows [this message]
2015-01-04 16:06 ` make spoken words machine recognizable Jan Stary
  -- strict thread matches above, loose matches on Subject: below --
2015-01-04 16:03 Mike Hamilton

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-list from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

  List information: https://lists.sourceforge.net/lists/listinfo/sox-users

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=trinity-84194944-0df6-4626-a59c-e77581a447fa-1420383754028@3capp-mailcom-bs14 \
    --to=sox-users@lists.sourceforge.net \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
Code repositories for project(s) associated with this public inbox

	https://80x24.org/mirrors/sox.git

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).