Questions tagged [transcription]
160 questions
3
votes
0 answers
Speech Recognition: Alternate substrings always empty, confidence 0 most of the time
I'm trying to work with speech recognition in iOS, but I guess we're not getting friends...
I've set up an SFSpeechRegonizer and set its delegate accordingly. Now I implement the delegate method as follows:
func speechRecognitionTask(_ task:…

Mischa
- 15,816
- 8
- 59
- 117
3
votes
1 answer
How to automatically save PowerShell session
How can i redirect everything what happens within a PowerShell windows (inputs and outputs) automatically to a file? I don't mean single commands instead i would like to have something like a logger which captures everything and logs into a local…

STORM
- 4,005
- 11
- 49
- 98
3
votes
1 answer
Timestamping audio from any language given the audio source and an accurate transcription
I'm looking to get roughly accurate timestamps for each word in an audio file. I also have the original text to go with the audio file which could be used as a cross-reference source of sorts. This is similar to "audio mining," which I believe is…

Eli
- 61
- 1
- 4
2
votes
1 answer
OpenAI transcription API returns 400 bad request
Any idea why the API returns a 400 error? I verified that the MP3 file exists and the authorization is being added to the header of the request.
async function transcribeAudio() {
const configuration = new Configuration({
apiKey:…

aryaxt
- 76,198
- 92
- 293
- 442
2
votes
2 answers
How to add SFTranscriptionSegment data to Firebase Firestore?
I'm using SFTranscription in Swift to transcribe voice notes. I then want to add this transcription data to Firebase Firestore, but this is triggering an error, since Firestore only accepts these data types.
My question is:
How should I package up…

Meinong
- 31
- 2
2
votes
1 answer
Decryption and encryption algoritms funky
I made a little script for custom encryption and decryption algorithms, which are based on Caesar cipher and transcription cipher.
First a little bit of background for explanation on how it should work. Assume a message you want to encrypt before…

Polda18
- 162
- 14
2
votes
1 answer
Looking for a way to detect pauses in audio files, and then be able to set punctuation between sentences?
I have a transcript app that transcribes audio from a file to text.
The problem is the output text is one long sentence.
So I figured a solution could be, to look for pauses in the audio file and add punctuation's to the transcription.
If the audio…

Christian
- 21
- 3
2
votes
1 answer
Filter out certain voice Google Speech API
I am creating a voice bot using Google Speech streaming API and Google Text to Speech. I would like only the user's speech to be transcribed, even if the user "interrupts" the voicebot's response. How can I avoid the voicebot from transcribing its…

Harry Stuart
- 1,781
- 2
- 24
- 39
2
votes
1 answer
How to get chatbot conversation
I am working on azure bot service,Bot is working properly. Once the chat is completed i need to send entire chat conversation to email as a transcript-or. How to achieve this?

Sagar Patil
- 469
- 9
- 20
2
votes
1 answer
How to iterate over an audio file in 20s intervals?
I am trying to transcribe an audio file which is about 3 min long using SpeechRecognition, however, it seems to be unable to transcribe anything longer than 20 seconds. This is the code that I'm using:
r = sr.Recognizer()
audio = FLAC(output_name…

Ivan Novikov
- 558
- 6
- 13
2
votes
0 answers
Diffing algorithm: Obtaining timestamps of spoken syllables given actual transcript
I have an audio file that is a recording of a person speaking different letters in sequence, along with a correct, human created transcript of this audio file, e.g. a string ABCDEF.
This audio file is then passed into a speech-to-text transcription…

lreichold
- 755
- 12
- 27
2
votes
1 answer
Continuous speech recognition without restart after 1 minute
I'm trying to create an app that records the user's voice and at the same time it transcribes it. I'm using AVFoundation and Speech framework to do this work. The problem is that Apple limits the transcription time to one minute. So, after this…

BlueMind
- 31
- 1
- 5
2
votes
0 answers
Setting up Google Cloud Speech API to transcribe interviews
I've got over 100 hours of audio associated with video interviews for a documentary that need to be transcribed to text - hopefully with some kind of timecode markers every 30 seconds or so so the video can easily be matched up to the text in the…

rieka
- 21
- 1
2
votes
1 answer
Powershell start transcript error
I am running a powershell script from a batch file:
try {
Start-Transcript -path ("C:\PS\Logs\XXXX_Session_QA_" + (Get-Date).tostring("yyyyMMdd-hhmmss-tt") + ".txt")
}
catch {
stop-transcript
}
Every time I run the script,…

Vibhav MS
- 143
- 1
- 6
- 18
2
votes
1 answer
Convert Arpabet to IPA with stress
I create a dictionary and have a problem with converting Arpabet representation of a word to IPA (International Phonetic Alphabet) with stress.
In Arpabet, you can find which vowel has a stress with the number after the vowel. For example, the word…

Oleg
- 22,300
- 9
- 68
- 84