Questions tagged [sound-recognition]
49 questions
1
vote
1 answer
What algorithm is used for audio feature extraction in google's audioset?
I am getting started with Google's Audioset. While the dataset is extensive, I find the information with regards to the audio feature extraction very vague. The website mentions
128-dimensional audio features extracted at 1Hz. The audio features…

jerpint
- 399
- 2
- 16
1
vote
0 answers
Sound recognition for Phonegap
I am creating an Android application using Phonegap. I would like to record a sound (for example, a doorbell), and detect if that sound is heard again. Is there any sound recognition plugin for Phonegap?
If not, how can I access the frequencies of…

PK123
- 43
- 8
1
vote
2 answers
How to compare .wav specimen in delphi?
I would like to start studying the .wav specimen by comparing two files. like returning a result of duration, accent, pronunciation in percentage. etc. or any code that comparing/analyzing the .wav.

XBasic3000
- 3,418
- 5
- 46
- 91
1
vote
1 answer
Does it matter which microphone is used for sound recognition?
If yes, then how does audio recognition apps work in different devices ?
For example, Google translate works in iphone-6, samsung-s6, htc-m9 ...

yutseho
- 1,639
- 1
- 15
- 27
1
vote
1 answer
How do I get the values of a specific frequency range
I have a .wav file, I load it and I get the next spectrogram showing the spectrum in dB
https://i.stack.imgur.com/22TjY.png
Now I would like to know these values exactly because I want to compare with other wav file, for recognizing if these 4…

fler
- 33
- 4
1
vote
1 answer
Morse code audio decoder (Android)
I need to create a "morse code decoder" for Android, very similar to this app: https://play.google.com/store/apps/details?id=org.jfedor.morsecode
My app must listen a sounds (morse code) from the microphone. And translate the code in original…

Gioce90
- 554
- 2
- 10
- 31
1
vote
1 answer
What are the meanings of audio data bytes? Android
I try to create a scream and whistle detector in my android application. Now I can detect the user's whistle by using musicg android library here. I have to implement the scream detector by myself because there are no library available.
In musicg,…

user2503808
- 71
- 1
- 10
1
vote
2 answers
Implementing neural network for vowel recognition in matlab - input layer units and the structure?
I am doing a project on vowel recognition and I need to implement a neural network. I am new to this field so I am not entirely sure about how to do it right. I have a training set of 800 words with 8 types of vowels, and my first step was to check…

nanachan
- 1,051
- 1
- 15
- 26
1
vote
1 answer
sound recognition in web
I am building a web page where i need to do something when a particular sound is caught by the microphone. I searched a lot and found this link :
Write a Program Which Recognizes a Sound and Performs Action
I am stuck at two things:
how to use java…

mayurvir
- 337
- 4
- 14
1
vote
1 answer
Find out the title and artist of a song in Ruby
I've found an interesting gem to discover from an unknown file which music it comes from returning the title and the artist : https://github.com/tenderlove/earworm
The fact is the last commit was 4 years ago and the website used is no longer…

Cydonia7
- 3,744
- 2
- 23
- 32
0
votes
1 answer
how to split wave signal into frames
I'm working a project about chord recognition. I'm using someone's journal as a reference but I still have little grasp in field of DSP. In her reference, first thing is I need to split the signal from wav file into number of frames. In my case, I…

Norman Pratama
- 67
- 2
- 12
0
votes
1 answer
how to analysis a voice and store the result?
I am thinking to create an application.
Following are the description:
1. People will upload previously recorded conversation to the server.
2. application from the server will detect pitch, speed,emphasis,pronunciation etc of that voice and create…

Md Romel
- 1
- 1
0
votes
1 answer
.NET sdk for acquiring song info from mp3 file (recognition)
Is there an SDK/Web service for acquiring music information (artist, album, year, etc.) using an MP3 file as input? Something like android's Soundhound, but as an API for use with .NET
EDIT: Maybe, I wasn't so clear. Input: an MP3 file without any…

Stefanos Kargas
- 10,547
- 22
- 76
- 101
0
votes
1 answer
Speech recognition returns unexpected error
So I get an error like the following:
Traceback (most recent call last):
File "C:\Users\pc\AppData\Local\Programs\Python\Python38-32\lib\site-packages\speech_recognition\__init__.py", line 108, in get_pyaudio
import pyaudio
…

LercDsgn
- 131
- 12
0
votes
2 answers
How to fit a scikit model, for feature-vectors of varying lengths
I'm working on a sound classification project, given a set of audio recordings I try to determine which class a certain recording would fall into. You might compare this to a music genre or topic recognition (of a body of text) problem, my samples…

madprogramer
- 599
- 3
- 12
- 36