Highest Voted 'sound-recognition' Questions

1

vote

1 answer

What algorithm is used for audio feature extraction in google's audioset?

I am getting started with Google's Audioset. While the dataset is extensive, I find the information with regards to the audio feature extraction very vague. The website mentions 128-dimensional audio features extracted at 1Hz. The audio features…

audio machine-learning sound-recognition

asked May 19 '17 at 22:48

jerpint

399
2
16

1

vote

0 answers

Sound recognition for Phonegap

I am creating an Android application using Phonegap. I would like to record a sound (for example, a doorbell), and detect if that sound is heard again. Is there any sound recognition plugin for Phonegap? If not, how can I access the frequencies of…

android cordova phonegap-plugins fft sound-recognition

asked Sep 04 '16 at 08:45

PK123

43
8

1

vote

2 answers

How to compare .wav specimen in delphi?

I would like to start studying the .wav specimen by comparing two files. like returning a result of duration, accent, pronunciation in percentage. etc. or any code that comparing/analyzing the .wav.

delphi audio wav pcm sound-recognition

asked Aug 13 '10 at 05:54

XBasic3000

3,418
5
46
91

1

vote

1 answer

Does it matter which microphone is used for sound recognition?

If yes, then how does audio recognition apps work in different devices ? For example, Google translate works in iphone-6, samsung-s6, htc-m9 ...

speech-recognition audio-recording google-search google-translate sound-recognition

asked Dec 29 '15 at 07:56

yutseho

1,639
1
15
27

1

vote

1 answer

How do I get the values of a specific frequency range

I have a .wav file, I load it and I get the next spectrogram showing the spectrum in dB https://i.stack.imgur.com/22TjY.png Now I would like to know these values exactly because I want to compare with other wav file, for recognizing if these 4…

python matplotlib frequency spectrogram sound-recognition

asked Apr 02 '14 at 14:08

fler

33
4

1

vote

1 answer

Morse code audio decoder (Android)

I need to create a "morse code decoder" for Android, very similar to this app: https://play.google.com/store/apps/details?id=org.jfedor.morsecode My app must listen a sounds (morse code) from the microphone. And translate the code in original…

java android audio audio-fingerprinting sound-recognition

asked Dec 06 '13 at 11:22

Gioce90

554
2
10
31

1

vote

1 answer

What are the meanings of audio data bytes? Android

I try to create a scream and whistle detector in my android application. Now I can detect the user's whistle by using musicg android library here. I have to implement the scream detector by myself because there are no library available. In musicg,…

android algorithm audio signal-processing sound-recognition

asked Oct 25 '13 at 16:59

user2503808

71
1
10

1

vote

2 answers

Implementing neural network for vowel recognition in matlab - input layer units and the structure?

I am doing a project on vowel recognition and I need to implement a neural network. I am new to this field so I am not entirely sure about how to do it right. I have a training set of 800 words with 8 types of vowels, and my first step was to check…

matlab neural-network sound-recognition

asked Sep 15 '13 at 07:42

nanachan

1,051
1
15
26

1

vote

1 answer

sound recognition in web

I am building a web page where i need to do something when a particular sound is caught by the microphone. I searched a lot and found this link : Write a Program Which Recognizes a Sound and Performs Action I am stuck at two things: how to use java…

java audio web sound-recognition

asked Jan 14 '13 at 12:19

mayurvir

337
4
14

1

vote

1 answer

Find out the title and artist of a song in Ruby

I've found an interesting gem to discover from an unknown file which music it comes from returning the title and the artist : https://github.com/tenderlove/earworm The fact is the last commit was 4 years ago and the website used is no longer…

ruby audio sound-recognition

asked Apr 26 '12 at 23:05

Cydonia7

3,744
2
23
32

0

votes

1 answer

how to split wave signal into frames

I'm working a project about chord recognition. I'm using someone's journal as a reference but I still have little grasp in field of DSP. In her reference, first thing is I need to split the signal from wav file into number of frames. In my case, I…

c# audio signal-processing sound-recognition music-notation

asked Feb 13 '12 at 00:43

Norman Pratama

67
2
12

0

votes

1 answer

how to analysis a voice and store the result?

I am thinking to create an application. Following are the description: 1. People will upload previously recorded conversation to the server. 2. application from the server will detect pitch, speed,emphasis,pronunciation etc of that voice and create…

.net speech-recognition audio sound-synthesis sound-recognition

asked Aug 30 '11 at 17:43

Md Romel

1
1

0

votes

1 answer

.NET sdk for acquiring song info from mp3 file (recognition)

Is there an SDK/Web service for acquiring music information (artist, album, year, etc.) using an MP3 file as input? Something like android's Soundhound, but as an API for use with .NET EDIT: Maybe, I wasn't so clear. Input: an MP3 file without any…

.net audio sound-recognition

asked Aug 03 '11 at 13:22

Stefanos Kargas

10,547
22
76
101

0

votes

1 answer

Speech recognition returns unexpected error

So I get an error like the following: Traceback (most recent call last): File "C:\Users\pc\AppData\Local\Programs\Python\Python38-32\lib\site-packages\speech_recognition\__init__.py", line 108, in get_pyaudio import pyaudio …

python speech-recognition pyaudio sound-recognition

asked Oct 02 '20 at 16:44

LercDsgn

131
12

0

votes

2 answers

How to fit a scikit model, for feature-vectors of varying lengths

I'm working on a sound classification project, given a set of audio recordings I try to determine which class a certain recording would fall into. You might compare this to a music genre or topic recognition (of a body of text) problem, my samples…

scikit-learn scipy audio-processing librosa sound-recognition

asked Jul 14 '20 at 11:23

madprogramer

599
3
12
36

Questions tagged [sound-recognition]