Questions tagged [data-generation]
263 questions
5
votes
3 answers
Test data generator for JPA or Hibernate
Are there any tools or libraries that can be used to generate test data using JPA or entity beans? I believe this will be very helpful for unit testing where we can have an in memory database with data dynamically generated just when we start our…

Bala
- 1,193
- 2
- 12
- 34
5
votes
1 answer
Vs2010 Data Generation Plan fails with "Data generation failed because of the following exception: Column "xyz" does not allow DBNull.Value"
I'm fairly new to Vs Data capabilities, and this is my first data generation plan. I have implemented a database using a Vs2010 database project, and used it to deploy to a sql server express 2008 database. All the tables use identity columns as…

bwerks
- 8,651
- 14
- 68
- 100
5
votes
2 answers
How do I exclude auto generated columns from SQL Server Generate Script Data Only?
SQL Server Generate Script does a great job of creating a script for the data in the tables using the Data Only option for 'Types of data to script' in the advanced option. However the script generated also includes all the identifiers such as…

dchitnis
- 83
- 1
- 5
5
votes
2 answers
Where get data to compare cache algorithms
I wan`t to compare cache algorithms like LRU, SLRU, LFU etc on real data.
That`s why I need some method to generate real-like data to compare cache algorithms or to get this data from some application.

chinskiy
- 2,557
- 4
- 21
- 40
5
votes
0 answers
SSMS 2012 - Data Only Script Generation With If Not Exists
I would like to generate a data only insert script within SSMS 2012 which contain the if not exists statements to make the scripts idempotent.
Right clicking the database I wish to generate the script for and selecting Tasks -> Generate Scripts the…

Jpin
- 1,527
- 5
- 18
- 27
5
votes
1 answer
How to generate test data for a "group by data from other rows" algorithm
UPDATE: I'm looking for a technique to compute data for all edge cases of my algorithm (or arbitrary algorithm for that matter).
What i tried so far is just thinking about what might be edge cases + producing some "random" data, but i don't know how…

Aprillion
- 21,510
- 5
- 55
- 89
4
votes
2 answers
Pytorch Data Generator for extracting 2D images from many 3D cube
I'm struggling in creating a data generator in PyTorch to extract 2D images from many 3D cubes saved in .dat format
There is a total of 200 3D cubes each having a 128*128*128 shape. Now I want to extract 2D images from all of these cubes along…

Quamer Nasim
- 366
- 1
- 5
- 18
4
votes
1 answer
Generate new data based on existing dataset using Python
I have a dataset consisting of 21 unique data records. To benchmark the performance certain algorithms like kNN and SVM by increasing the no of samples for each class, I would like to test on data with at least 20 or more unique records for each…

Rob Alamgir
- 51
- 1
- 2
4
votes
4 answers
How to repeat data with flow_from_directory in Keras
I am trying to use keras flow_from_directory to train a model. But it does not repeat
data after the epoch(i.e. when all the data has been iterated). I could not find any
option to do so either. Below is my code for data generation while…

danishansari
- 634
- 5
- 21
4
votes
2 answers
Generate data by using existing dataset as the base dataset
I have a dataset consisting of 100k unique data records, to benchmark the code, I need to test on data with 5 million unique records, I don't want to generate random data. I would like to use the 100k data records which I have as the base dataset…

namrutha
- 183
- 2
- 14
4
votes
2 answers
Should I generate massive amounts of SQL data on the client or in SQL Server?
I am writing a program to generate a massive (~1 billion records spread across ~20 tables) amount of data and populate tables in SQL Server. This is data that spans across multiple tables with potentially multiple foreign key constraints, as well as…

Andrew
- 1,355
- 2
- 13
- 28
4
votes
2 answers
Generate a data frame with three columns and each row with a constant sum
I want to generate a large data frame (100000 rows and 3 columns) that has three columns (A, B and C).
This data frame satisfies the two conditions:
in each row, A+B+C=1;
all the A have a triangular distribution (min=0.2,mod=0.3,max=0.4), all…

Songchao Chen
- 41
- 2
4
votes
2 answers
Keras Image Data Generator show labels
I am using an ImageDataGenerator to augment my images. I need to get the y labels from the generator.
Example : I have 10 training images, 7 are label 0 and 3 are label 1. I want to increase training set size to 100.
total_training_images =…

Adit Sanghvi
- 142
- 1
- 2
- 12
4
votes
2 answers
Satellite Map Analysis for Building Generation
Has anyone every heard of a program which analyses a satellite map and attempts to generate three dimensional buildings that roughly match the length/width of their real life counterparts?
The use in programs like Google Earth or FlightGear would be…

Tomas
- 1,379
- 1
- 13
- 20
4
votes
1 answer
Fast way to mockup hierarchical data easily
I'm seeking a quick and easy solution to help mock/populate/test an org chart (in ASP MVC), with tree based or hierarchical data. I need the test data for this...
I have used both http://www.generatedata.com and mock-aro (both of which I like, but…

aggie
- 798
- 2
- 8
- 23