Questions tagged [transcription]

160 questions
3
votes
0 answers

Speech Recognition: Alternate substrings always empty, confidence 0 most of the time

I'm trying to work with speech recognition in iOS, but I guess we're not getting friends... I've set up an SFSpeechRegonizer and set its delegate accordingly. Now I implement the delegate method as follows: func speechRecognitionTask(_ task:…
3
votes
1 answer

How to automatically save PowerShell session

How can i redirect everything what happens within a PowerShell windows (inputs and outputs) automatically to a file? I don't mean single commands instead i would like to have something like a logger which captures everything and logs into a local…
STORM
  • 4,005
  • 11
  • 49
  • 98
3
votes
1 answer

Timestamping audio from any language given the audio source and an accurate transcription

I'm looking to get roughly accurate timestamps for each word in an audio file. I also have the original text to go with the audio file which could be used as a cross-reference source of sorts. This is similar to "audio mining," which I believe is…
Eli
  • 61
  • 1
  • 4
2
votes
1 answer

OpenAI transcription API returns 400 bad request

Any idea why the API returns a 400 error? I verified that the MP3 file exists and the authorization is being added to the header of the request. async function transcribeAudio() { const configuration = new Configuration({ apiKey:…
aryaxt
  • 76,198
  • 92
  • 293
  • 442
2
votes
2 answers

How to add SFTranscriptionSegment data to Firebase Firestore?

I'm using SFTranscription in Swift to transcribe voice notes. I then want to add this transcription data to Firebase Firestore, but this is triggering an error, since Firestore only accepts these data types. My question is: How should I package up…
2
votes
1 answer

Decryption and encryption algoritms funky

I made a little script for custom encryption and decryption algorithms, which are based on Caesar cipher and transcription cipher. First a little bit of background for explanation on how it should work. Assume a message you want to encrypt before…
Polda18
  • 162
  • 14
2
votes
1 answer

Looking for a way to detect pauses in audio files, and then be able to set punctuation between sentences?

I have a transcript app that transcribes audio from a file to text. The problem is the output text is one long sentence. So I figured a solution could be, to look for pauses in the audio file and add punctuation's to the transcription. If the audio…
Christian
  • 21
  • 3
2
votes
1 answer

Filter out certain voice Google Speech API

I am creating a voice bot using Google Speech streaming API and Google Text to Speech. I would like only the user's speech to be transcribed, even if the user "interrupts" the voicebot's response. How can I avoid the voicebot from transcribing its…
2
votes
1 answer

How to get chatbot conversation

I am working on azure bot service,Bot is working properly. Once the chat is completed i need to send entire chat conversation to email as a transcript-or. How to achieve this?
Sagar Patil
  • 469
  • 9
  • 20
2
votes
1 answer

How to iterate over an audio file in 20s intervals?

I am trying to transcribe an audio file which is about 3 min long using SpeechRecognition, however, it seems to be unable to transcribe anything longer than 20 seconds. This is the code that I'm using: r = sr.Recognizer() audio = FLAC(output_name…
Ivan Novikov
  • 558
  • 6
  • 13
2
votes
0 answers

Diffing algorithm: Obtaining timestamps of spoken syllables given actual transcript

I have an audio file that is a recording of a person speaking different letters in sequence, along with a correct, human created transcript of this audio file, e.g. a string ABCDEF. This audio file is then passed into a speech-to-text transcription…
2
votes
1 answer

Continuous speech recognition without restart after 1 minute

I'm trying to create an app that records the user's voice and at the same time it transcribes it. I'm using AVFoundation and Speech framework to do this work. The problem is that Apple limits the transcription time to one minute. So, after this…
2
votes
0 answers

Setting up Google Cloud Speech API to transcribe interviews

I've got over 100 hours of audio associated with video interviews for a documentary that need to be transcribed to text - hopefully with some kind of timecode markers every 30 seconds or so so the video can easily be matched up to the text in the…
rieka
  • 21
  • 1
2
votes
1 answer

Powershell start transcript error

I am running a powershell script from a batch file: try { Start-Transcript -path ("C:\PS\Logs\XXXX_Session_QA_" + (Get-Date).tostring("yyyyMMdd-hhmmss-tt") + ".txt") } catch { stop-transcript } Every time I run the script,…
Vibhav MS
  • 143
  • 1
  • 6
  • 18
2
votes
1 answer

Convert Arpabet to IPA with stress

I create a dictionary and have a problem with converting Arpabet representation of a word to IPA (International Phonetic Alphabet) with stress. In Arpabet, you can find which vowel has a stress with the number after the vowel. For example, the word…
Oleg
  • 22,300
  • 9
  • 68
  • 84
1
2
3
10 11