Questions tagged [sound-recognition]

49 questions
1
vote
1 answer

What algorithm is used for audio feature extraction in google's audioset?

I am getting started with Google's Audioset. While the dataset is extensive, I find the information with regards to the audio feature extraction very vague. The website mentions 128-dimensional audio features extracted at 1Hz. The audio features…
jerpint
  • 399
  • 2
  • 16
1
vote
0 answers

Sound recognition for Phonegap

I am creating an Android application using Phonegap. I would like to record a sound (for example, a doorbell), and detect if that sound is heard again. Is there any sound recognition plugin for Phonegap? If not, how can I access the frequencies of…
1
vote
2 answers

How to compare .wav specimen in delphi?

I would like to start studying the .wav specimen by comparing two files. like returning a result of duration, accent, pronunciation in percentage. etc. or any code that comparing/analyzing the .wav.
XBasic3000
  • 3,418
  • 5
  • 46
  • 91
1
vote
1 answer

Does it matter which microphone is used for sound recognition?

If yes, then how does audio recognition apps work in different devices ? For example, Google translate works in iphone-6, samsung-s6, htc-m9 ...
1
vote
1 answer

How do I get the values of a specific frequency range

I have a .wav file, I load it and I get the next spectrogram showing the spectrum in dB https://i.stack.imgur.com/22TjY.png Now I would like to know these values exactly because I want to compare with other wav file, for recognizing if these 4…
1
vote
1 answer

Morse code audio decoder (Android)

I need to create a "morse code decoder" for Android, very similar to this app: https://play.google.com/store/apps/details?id=org.jfedor.morsecode My app must listen a sounds (morse code) from the microphone. And translate the code in original…
Gioce90
  • 554
  • 2
  • 10
  • 31
1
vote
1 answer

What are the meanings of audio data bytes? Android

I try to create a scream and whistle detector in my android application. Now I can detect the user's whistle by using musicg android library here. I have to implement the scream detector by myself because there are no library available. In musicg,…
1
vote
2 answers

Implementing neural network for vowel recognition in matlab - input layer units and the structure?

I am doing a project on vowel recognition and I need to implement a neural network. I am new to this field so I am not entirely sure about how to do it right. I have a training set of 800 words with 8 types of vowels, and my first step was to check…
nanachan
  • 1,051
  • 1
  • 15
  • 26
1
vote
1 answer

sound recognition in web

I am building a web page where i need to do something when a particular sound is caught by the microphone. I searched a lot and found this link : Write a Program Which Recognizes a Sound and Performs Action I am stuck at two things: how to use java…
mayurvir
  • 337
  • 4
  • 14
1
vote
1 answer

Find out the title and artist of a song in Ruby

I've found an interesting gem to discover from an unknown file which music it comes from returning the title and the artist : https://github.com/tenderlove/earworm The fact is the last commit was 4 years ago and the website used is no longer…
Cydonia7
  • 3,744
  • 2
  • 23
  • 32
0
votes
1 answer

how to split wave signal into frames

I'm working a project about chord recognition. I'm using someone's journal as a reference but I still have little grasp in field of DSP. In her reference, first thing is I need to split the signal from wav file into number of frames. In my case, I…
0
votes
1 answer

how to analysis a voice and store the result?

I am thinking to create an application. Following are the description: 1. People will upload previously recorded conversation to the server. 2. application from the server will detect pitch, speed,emphasis,pronunciation etc of that voice and create…
0
votes
1 answer

.NET sdk for acquiring song info from mp3 file (recognition)

Is there an SDK/Web service for acquiring music information (artist, album, year, etc.) using an MP3 file as input? Something like android's Soundhound, but as an API for use with .NET EDIT: Maybe, I wasn't so clear. Input: an MP3 file without any…
Stefanos Kargas
  • 10,547
  • 22
  • 76
  • 101
0
votes
1 answer

Speech recognition returns unexpected error

So I get an error like the following: Traceback (most recent call last): File "C:\Users\pc\AppData\Local\Programs\Python\Python38-32\lib\site-packages\speech_recognition\__init__.py", line 108, in get_pyaudio import pyaudio …
LercDsgn
  • 131
  • 12
0
votes
2 answers

How to fit a scikit model, for feature-vectors of varying lengths

I'm working on a sound classification project, given a set of audio recordings I try to determine which class a certain recording would fall into. You might compare this to a music genre or topic recognition (of a body of text) problem, my samples…