Questions tagged [sample-data]

Sample data is a term used for publicly available sets of data in a variety of formats.

Sample data is used to get an application started quickly with data for demo purposes or to load test an application or database platform. The data held within may accurately represent a data set, such as a list of countries or be completely manufactured.

The idea is that is it not used as a basis of a good data sample, but merely useful as 'data'.

Examples include (but not limited to) CSV, Database Backups, Excel or plain text files. The OP would usually specify the format that they require in their question.

151 questions
3
votes
0 answers

How to remove silence from an audioFile

I have recorded an audio file using AVAudioEngine: [mainMixer installTapOnBus:0 bufferSize:4096 format:[mainMixer outputFormatForBus:0] block:^(AVAudioPCMBuffer *buffer, AVAudioTime *when) { NSError *error; // as AVAudioPCMBuffer's are…
Thomas
  • 177
  • 2
  • 9
3
votes
5 answers

Resample with replacement by group

This is my first post so please bear with me. Below is a small sample of my data. My actual dataset has over 4,000 individual IDs and each ID can have anywhere from one to two hundred separate dollar amounts assigned to it. ID Dollars 001 …
YimYames
  • 99
  • 1
  • 12
3
votes
2 answers

How to set parameters in WEKA to balance data with SMOTE filter?

I'm using SMOTE filter in WEKA to balance data. I have doubts about the two parameters nearestNeighbors and percentage. nearestNeighbors -- The number of nearest neighbors to use. percentage -- The percentage of SMOTE instances to create. How…
user3290180
  • 4,260
  • 9
  • 42
  • 77
3
votes
1 answer

building positive samples for rotated images for cascaded training in OpenCV

I need to train a cascaded classifier to detect vehicles and different viewing angles. I'm using OpenCV. Some of the angles that I need to capture cause the placement of the vehicle within the image to be diagonal, as shown below: Now the problem…
user961627
  • 12,379
  • 42
  • 136
  • 210
3
votes
1 answer

Magento Sample data Installation: Table already exists

I am trying to install Magento's sample data for a few days now and have tried numerous things but nothing seems to work. I am using easyphp 5.3.3.1 on windows 7. I downloaded the sample data files and ran the SQL to create tables for the sample…
Sohail
  • 45
  • 1
  • 6
2
votes
2 answers

call javascript function for every nth user

I am collecting some sample data on my site and for now just want to collect data from say 10% of my visitors using a javascript function. One way I can think of is to pick a number between 1 to 10 randomly and if its 10 then call the JS function to…
Rols
  • 153
  • 11
2
votes
2 answers

Sample data for IPv6?

Do any sample data sets exist for IPv6 traffic, e.g. Apache logs or traffic analysis logs? Alternatively does anyone have any ideas how to generate one or - optimistically - an existing tool to generate data? Ideally I would like sample addresses…
Rup
  • 33,765
  • 9
  • 83
  • 112
2
votes
3 answers

How can I disable the sample data in trac

I just created my first trac project, but I don't want the sample data, how can I disable that. I read the documentation and there is the wiki remove command. do I have to use this for all pages? Are there easy way to disable the sample wiki?
Mokus
  • 10,174
  • 18
  • 80
  • 122
2
votes
1 answer

Tensorflow Resnet with unbalanced classes

I use Resnet with Tensorflow to train a model with 20 classes. My problem is that I have 6-7 classes with A LOT of samples, about the same number of classes with a medium number of samples and the rest of classes with few samples. With this given…
2
votes
2 answers

Sample Data Not Appearing During Runtime

I have a RadGrid with sample data being used. The problem i am facing is that, even after i changed the design time data context mode(that is, changed "d:DataContext" to "DataContext"), i am unable to see the sample data during runtime, though the…
Sangeetha
  • 485
  • 2
  • 9
  • 24
2
votes
1 answer

Take a sample without group in dplyr, R

I know how to take a random sample each group from a dataframe using sample_n or sample_frac in dplyr, which can go like this, dataset %>% group_by(user_id) %>% sample_n(10) However, I have a slightly different question. I want to take a random…
Ding Li
  • 673
  • 1
  • 7
  • 19
2
votes
1 answer

create a matrix of samples in R

I have a probability distribution X and I would like to create samples of 100 observations: I use sample(X,size=100,replace=TRUE) I would like to plot the sample mean PDF on 100,1000,10000 samples so I tried to create matrices of observations…
KJohn
  • 193
  • 1
  • 1
  • 15
2
votes
3 answers

Where to find free-to-use, complex test databases?

I need a database for testing a data masking solution. Any database would do (Oracle/MSSQL flavor). I just need the schema / db definition. I have tools to generate data records. The magnitude of database I am looking should have 30 odd tables with…
Jay
  • 2,394
  • 11
  • 54
  • 98
2
votes
2 answers

Oracle sample data problems

So, I have this java based data trasformation / masking tool, which I wanted to test out on Oracle 10g. The good part with Oracle 10g is that you get a load of sample schemas with half a million records in some. The schemas are : SH, OE, HR, IX and…
Jay
  • 2,394
  • 11
  • 54
  • 98
2
votes
1 answer

How to obtain the unsampled samples after PROC SURVEYSELECT in SAS?

I use PROC SURVEYSELECT in SAS to sample a data set DT, OUT=DT_SAMPLED, but I would also like to have those samples in DT that didn't get selected. May I know if there is a way? Thanks in advance.
Kevin
  • 2,191
  • 9
  • 35
  • 49
1 2
3
10 11