Questions tagged [phoneme]

A phoneme in linguistics is the smallest part of a word that can be pronounced. The word "the", for example, consists of phonemes /th/ and /e/. Use this tag to ask about how phonemes can be handled in natural language processing applications, or to identify them.

A phoneme in linguistics is the smallest part of a word that can be pronounced. The word "the", for example, consists of /th/ and /e/. Use this tag to ask about how phonemes can be handled in natural language processing applications, or to identify them.

For instance, in the Speech Synthesis Markup Language used by speech synthesisers to read text, the pronunciation of a phoneme can be specified:

<ssml:phoneme alphabet="x-microsoft-ups" ph="string"> </ssml:phoneme>

Here is what Wikipedia has on phonemes.

53 questions
0
votes
0 answers

using with Google TTS

I am trying to get individual phoneme sounds from Google Text to Speech. I'm getting the individual phonetic spellings and corresponding IPA from dictionary.com. For Ex) pizza (peet-suh) IPA: ('pit sə) I'm using Python to get the response from the…
0
votes
1 answer

Word/Phoneme Corpus for an Elman SRN (English)

I'm writing an Elman Simple Recurrent Network. I want to give it sequences of words, where each word is a sequence of phonemes, and I want a lot of training and test data. So, what I need is a corpus of English words, together with the phonemes…
Iskar Jarak
  • 5,136
  • 4
  • 38
  • 60
0
votes
1 answer

Can Mandarin pronunciation be coded by ARPABET phone set?

The code doesn't need to be exactly correct. A pronunciation in English can be used to represent a similar pronunciation in Chinese, e.g. /ʈ͡ʂ/ can use "CH" to represent.
0
votes
0 answers

String subclass for phonetic Levenshtein distance

I'm trying to create a string subclass, in order to represent Arpabet symbols as single characters. Here is what I got so far: import Levenshtein class ArpabetChar(str): """ Class that turn string into an Arpabet…
Michael
  • 1
  • 2
0
votes
1 answer

ibm-cloud speech-to-text: Is it possible to specify phonemes for custom vocabulary?

We need to build a custom model with a lot of already phonemically transcribed custom vocabulary, but the current API for specifying custom words has no published option for specifying a phonemic string rather than a manually generated, ad-hoc…
W. Sadkin
  • 261
  • 3
  • 8
0
votes
1 answer

how to extract phoneme data with its corresponding time-steps?

I could successfully extract phoneme data from an audio file using Pocketsphinx, but how can I also output the lasting time of each phoneme? def phonemes(filename): audio_path = os.path.join(dir_path, filename) # Create a decoder with…
Sam35
  • 11
  • 1
0
votes
0 answers

How to get speech recognition to detect SAPI emphasis markers?

It is possible to extract the default phonemes for a given word via SAPI by: Voice word with text-to-speech and store output in a .wav Use the .wav as input for speech recognition Upon recognition of the word extract the phonemes from…
Exergist
  • 157
  • 12
0
votes
0 answers

Phoneme extraction using Pocketsphinx: RuntimeError: new_Decoder returned -1

I am new to stack overflow. I am trying to achieve the results with the code provided in the citation below. https://stackoverflow.com/questions/30705028/convert-sound-to-list-of-phonemes-in-python However, this code isn't compiling successfully…
envi z
  • 677
  • 7
  • 13
0
votes
1 answer

can we programmatically call get audio phonemes instead of using command

I have used pocketsphinx command to extract phonemes and it is working fine pocketsphinx_continuous -infile test/data/goforward.raw -hmm model/en-us/en-us \ -allphone model/en-us/en-us-phone.lm.bin -backtrace yes \ …
Maria
  • 123
  • 8
0
votes
0 answers

Word to phonemes converter for slangs and made up words in Python

I currently use NLTK to convert words to phonemes in python. This works well for words in the library, but for slangs and made up words, NLTK doesn't work. E.g. words like "whasup" "dawg" Is there a library that does a best guess at the phonemes of…
Ian Lin
  • 384
  • 1
  • 5
  • 20
0
votes
0 answers

How to convert text to phonemes?

Is there a program that can convert words to their respective phonemes? It doesn't really matter which syntax/format is used as long as its consistent and reproducible. I have lots of data to convert but CMU's text to Arpabet converter only…
Jonathan
  • 51
  • 7
0
votes
1 answer

How to get a CTM File from CMU Sphinx?

I have decoded my speech into phonemes by using my language model.I need to segment the speech into sentences and words.I know , a ctm file will help .Can anyone tell me how to generate the ctm file for a particular speech instance ?
0
votes
1 answer

Python pattern matching of rules to parse text to phonemes

I have a set of rules that can be used to convert text to a set of phonemes. The application of these rules would result in conversions such as the following: a uh ability ae-b-ih-l-ih-t-ee aboard uh-b-oh-r-d abort …
d3pd
  • 7,935
  • 24
  • 76
  • 127
0
votes
0 answers

Phoneme Recognition with PocketSphinx

I need the real-time phoneme recognition from the microphone on Windows 8 Desktop. So I followed http://cmusphinx.sourceforge.net/wiki/phonemerecognition and built pocketsphinx_continuous from the subversion source in VS2013. Running it in the…
K-man
  • 117
  • 1
  • 10
0
votes
1 answer

categorizing short audio samples

I have a small number of similar types of sounds (I shall refer to these as DB_sounds) to which I need to match a recording (Rec_sounds). Each Rec_sound is short and unique and needs to be matched to its corresponding DB_sound. How do I go about…
Roman
  • 8,826
  • 10
  • 63
  • 103