Google Cloud Speech-to-Text and Amazon Transcribe both offer punctuation and word timestamps. Can I get punctuation timestamps? Specifically, I want timestamps for sentence breaks (periods, question marks, exclamation points), e.g., at 0:33 seconds, 1:01, 1:23, 1:49, 2:05, etc.
I suppose that I could use Google or AWS to transcribe a file with punctuation, then break the transcript up into sentences, and then do a word timestamp for each sentence. It would be easier (and about 1/500 of the computer time, for a file with 500 sentences) if I could just set a parameter for getPunctuationTimestamps
.
IBM Watson Speech-to-Text offers keyword spotting but not punctuation.