Questions tagged [dataset]

A dataset is a collection of data, generally represented in tabular form, with columns signifying different variables and rows signify different members of the set. If you are looking for a freely available dataset for any purpose, please consider asking your question on https://opendata.stackexchange.com.

11414 questions
23
votes
2 answers

Need a data set for fraud detection

I have a fraud detection algorithm, and I want to check to see if it works against a real world data set. My algorithm says that a claim is usual or not. Are there any data sets available?
saeed arash
  • 865
  • 2
  • 8
  • 14
23
votes
9 answers

How to fill Dataset with multiple tables?

I'm trying to fill DataSet which contains 2 tables with one to many relationship. I'm using DataReader to achieve this : public DataSet SelectOne(int id) { DataSet result = new DataSet(); using (DbCommand command =…
Andriy Zakharko
  • 1,623
  • 2
  • 16
  • 37
22
votes
4 answers

What is the difference between dataset and database?

What is the difference between a dataset and a database ? If they are different then how ? Why is huge data difficult to be manageusing databases today?! Please answer independent of any programming language.
Lokesh Sah
  • 2,283
  • 5
  • 23
  • 33
22
votes
3 answers

How do you alter the size of a Pytorch Dataset?

Say I am loading MNIST from torchvision.datasets.MNIST, but I only want to load in 10000 images total, how would I slice the data to limit it to only some number of data points? I understand that the DataLoader is a generator yielding data in the…
mikal94305
  • 4,663
  • 8
  • 31
  • 40
22
votes
3 answers

How to create dataset similar to cifar-10

I want to create a dataset that has the same format as the cifar-10 data set to use with Tensorflow. It should have images and labels. I'd like to be able to take the cifar-10 code but different images and labels, and run that code.
BlackyTheCat
  • 343
  • 1
  • 2
  • 9
22
votes
3 answers

How to get the nth row from data frame in R

For example, I have the following data frame: > dataFrame <- read.csv(file="data.csv") > dataFrame Ozone Solar.R Wind Temp Month Day 1 41 190 7.4 67 5 1 2 36 118 8.0 72 5 2 3 12 149 12.6 74 5 …
maksadbek
  • 1,508
  • 2
  • 15
  • 28
22
votes
5 answers

ERROR [IM014] [Microsoft][ODBC Driver Manager] The specified DSN contains an architecture mismatch between the Driver and Application

how can i fix this error? ERROR [IM014] [Microsoft][ODBC Driver Manager] The specified DSN contains an architecture mismatch between the Driver and Application.. it is occuring in a new project that i am doing... im using vb 2008 , MYSQL as my…
bolek evora
  • 231
  • 1
  • 2
  • 8
21
votes
5 answers

Git + a large data set?

We're often working on a project where we've been handed a large data set (say, a handful of files that are 1GB each), and are writing code to analyze it. All of the analysis code is in Git, so everybody can check changes in and out of our central…
bk.
  • 6,068
  • 2
  • 24
  • 28
21
votes
5 answers

Load a dataset into R with data() using a variable instead of the dataset name

I am trying to load a dataset into R using the data() function. It works fine when I use the dataset name (e.g. data(Titanic) or data("Titanic")). What doesn't work for me is loading a dataset using a variable instead of its name. For example: #…
pazof
  • 944
  • 1
  • 12
  • 26
21
votes
13 answers

C#: What Else Do You Use Besides DataSet

I've found myself increasingly unsatisfied with the DataSet/DataTable/DataRow paradigm in .Net, mostly because it's often a couple of steps more complicated than what I really want to do. In cases where I'm binding to controls, DataSets are fine. …
saalon
  • 3,644
  • 3
  • 33
  • 40
20
votes
6 answers

Export a C# DataSet to a text file

There are a lot of examples online of how to fill a DataSet from a text file but I want to do the reverse. The only thing I've been able to find is this but it seems... incomplete? I want it to be in a readable format, not just comma delimited, so…
bobble14988
  • 1,749
  • 5
  • 26
  • 38
20
votes
1 answer

How to create a Spark Dataset from an RDD

I have an RDD[LabeledPoint] intended to be used within a machine learning pipeline. How do we convert that RDD to a DataSet? Note the newer spark.ml apis require inputs in the Dataset format.
WestCoastProjects
  • 58,982
  • 91
  • 316
  • 560
20
votes
11 answers

How to Change Date Format in .net RDLC Report?

I need to set my date column as 01-Jan-2013, what is the format to acheieve this in rdlc? I have given =CDate(Fields!IssuingDate.Value).ToString("dd-mmm-yyyy") its not working correctly. Any one post me the format for 02-Jul-2013.
Stephen L
  • 351
  • 2
  • 6
  • 18
19
votes
3 answers

Dremel - repetition and definition level

Reading Interactive Analysis of Web-Scale Datasets paper, I bumped into the concept of repetition and definition level. while I understand the need for these two, to be able to disambiguate occurrences, it attaches a repetition and definition level…
Tony Tannous
  • 14,154
  • 10
  • 50
  • 86
19
votes
2 answers

Replacing the obsolete System.Xml.XmlDataDocument?

I have a System.Web.UI.WebControls.Xml control (Xml1) in a webforms app that I have upgraded from .NET 2.0 to .NET 4.0 I am getting two warnings from the code-behind page that I'd like to do something about. ... Dim ds As DataSet =…
Nick
  • 4,115
  • 10
  • 45
  • 57