Questions tagged [dataset]

A dataset is a collection of data, generally represented in tabular form, with columns signifying different variables and rows signify different members of the set. If you are looking for a freely available dataset for any purpose, please consider asking your question on https://opendata.stackexchange.com.

11414 questions
33
votes
1 answer

Update DataSet structure in Visual Studio to match new SQL Database Structure

After making some changes to my SQL database structure (using scripts in SQL Server Management Studio), how can I update my DataSet.xsd file in Visual Studio to match the new structure? Must I do this manually? I considered deleting the DataSet and…
MattyG
  • 8,449
  • 6
  • 44
  • 48
33
votes
6 answers

Are there any example data sets for Python?

For quick testing, debugging, creating portable examples, and benchmarking, R has available to it a large number of data sets (in the Base R datasets package). The command library(help="datasets") at the R prompt describes nearly 100 historical…
a different ben
  • 3,900
  • 6
  • 35
  • 45
31
votes
1 answer

Handwritten English Character Data Set..Where to get (and openly available)

We have created a Handwritten Character Recognition system and now want to test the system on English characters (both digits and alphabets). For digits, we have performed our testing on MNIST data set. However for the English alphabets we have not…
Ankit Rawat
  • 351
  • 1
  • 3
  • 8
29
votes
4 answers

Deleting hdf5 dataset using h5py

Is there any way to remove a dataset from an hdf5 file, preferably using h5py? Or alternatively, is it possible to overwrite a dataset while keeping the other datasets intact? To my understanding, h5py can read/write hdf5 files in 5 modes f =…
hsnee
  • 543
  • 2
  • 6
  • 17
29
votes
6 answers

Turn off IDENTITY_INSERT for Dataset insert

I am using a dataset to insert data being converted from an older database. The requirement is to maintain the current Order_ID numbers. I've tried using: SET IDENTITY_INSERT orders ON; This works when I'm in SqlServer Management Studio, I am able…
Nathan Koop
  • 24,803
  • 25
  • 90
  • 125
28
votes
4 answers

Feeding .npy (numpy files) into tensorflow data pipeline

Tensorflow seems to lack a reader for ".npy" files. How can I read my data files into the new tensorflow.data.Dataset pipline? My data doesn't fit in memory. Each object is saved in a separate ".npy" file. each file contains 2 different ndarrays as…
Sluggish Crow
  • 383
  • 1
  • 3
  • 5
28
votes
1 answer

Data sets for emotion detection in text

I'm implementing a system that could detect the human emotion in text. Are there any manually annotated data sets available for supervised learning and testing? Here are some interesting datasets: https://dataturks.com/projects/trending
ekka
  • 355
  • 1
  • 4
  • 11
27
votes
7 answers

Complex dataset split - StratifiedGroupShuffleSplit

I have a dataset of ~2m observations which I need to split into training, validation and test sets in the ratio 60:20:20. A simplified excerpt of my dataset looks like this: +---------+------------+-----------+-----------+ | note_id | subject_id |…
amin_nejad
  • 989
  • 10
  • 22
27
votes
4 answers

What should I use to compare DBNull ? Using DBNull.Value or ToString().IsNullOrEmpty()

I can check for a DBnull on a data row using any of the methods. Either by using if(dr[0][0]==DBNull.Value) //do somethin or by doing if(dr[0][0].ToString().IsNullOrEmpty()) //do something In Both Cases I will be getting same result. But Which…
Shantanu Gupta
  • 20,688
  • 54
  • 182
  • 286
27
votes
3 answers

Adding rows to dataset

How can I create a DataSet that is manually filled? ie. fill through the code or by user input. I want to know the required steps if I need to create a DataTable or a DataRow first, I really don't know the steps to fill the DataSet.
sikas
  • 5,435
  • 28
  • 75
  • 120
27
votes
5 answers

How to add header to a dataset in R?

I need to read the ''wdbc.data' in the following data folder: http://archive.ics.uci.edu/ml/machine-learning-databases/breast-cancer-wisconsin/ Doing this in R is easy using command read.csv but as the header is missing how can I add it? I have the…
blueSurfer
  • 5,651
  • 13
  • 42
  • 63
27
votes
8 answers

Convert generic list to dataset in C#

I have a Generic list of Objects. Each object has 9 string properties. I want to turn that list into a dataset that i can pass to a datagridview......Whats the best way to go about doing this?
Goober
  • 13,146
  • 50
  • 126
  • 195
27
votes
4 answers

Looping through a DataTable

Well. I have a DataTable with multiple columns and multiple rows. I want to loop through the DataTable dynamically basically the output should look as follows excluding the braces : Name (DataColumn) Tom (DataRow) Peter (DataRow) Surname…
SpaceApple
  • 1,309
  • 1
  • 24
  • 46
26
votes
7 answers

Nullable types in strongly-typed datatables/datasets - workarounds?

Strongly-typed DataTables support "nullable" field types, except that the designer will not allow you change the setting to "allow nulls" for any value type fields. (ie: String types allow nullable, but int's do not). The workaround is to call…
Brady Moritz
  • 8,624
  • 8
  • 66
  • 100
26
votes
11 answers

Delphi: using TClientDataset as an in-memory dataset

According to this page, it's possible to use TClientDataset as an in-memory dataset, completely independent of any actual databases or files. It describes how to setup the dataset's table structure and how to load data into it at runtime. But when…
Mason Wheeler
  • 82,511
  • 50
  • 270
  • 477