Questions tagged [sample-data]

Sample data is a term used for publicly available sets of data in a variety of formats.

Sample data is used to get an application started quickly with data for demo purposes or to load test an application or database platform. The data held within may accurately represent a data set, such as a list of countries or be completely manufactured.

The idea is that is it not used as a basis of a good data sample, but merely useful as 'data'.

Examples include (but not limited to) CSV, Database Backups, Excel or plain text files. The OP would usually specify the format that they require in their question.

151 questions
1
vote
3 answers

Sampling in oracle

I'm trying to take a sample from a insurance claims database. For example 20% random, sample from 1 million claims data where provider type is '25' and year is '2012'. Data is in sqldeveloper. I am a statistician with basic SQL knowledge.
Kuusum
  • 11
  • 1
  • 2
1
vote
1 answer

AdventureWorksDW2008R2: is there a data lineage documentation available?

AFAICS the Microsoft Sample database AdventureWorksDW2008R2 is based on AdventureWorks2008R2. Is there an SSIS ETL package available which creates AdventureWorksDW2008R2 database schema and data? And is there available a data mapping (data lineage)…
jrara
  • 16,239
  • 33
  • 89
  • 120
1
vote
1 answer

Creating Sample Data in MySQL Rails Database with Faker

I've had "Fake" data loaded in my database for awhile. Now, I've made enough changes I need to re-fresh the data. I have 3 tables that I've populated...Users, Stores, & Gears. My issue is populating the Gears table. I'm having 2 issues. First and…
DaveG
  • 1,203
  • 1
  • 25
  • 45
1
vote
1 answer

WP7 - access to sample data from code

In expression blend I created a sample data source in visual editor. In case of using a listbox I simply drag the collection there and data is automatically shown. Now, I am interested to retrieve data from datasource from code behind. Is this…
Aliens
  • 984
  • 3
  • 14
  • 23
0
votes
1 answer

Sample data for testing binary linear classificaion code

I am loking for some sample binary data for testing my linear classifiation code. I need a data set where the data is 2d and belongs to either one of two classes. If anyone has such data or any reference for the same, kindly reply. Any help is…
freeborn
  • 115
  • 1
  • 2
  • 8
0
votes
5 answers

Need Sample data for Magento1.5

I need sample data for Magento ver-1.5.1.0. I didn't found in Magentocommerce site. Kindly let me know where we get this?
Sankar Subburaj
  • 4,992
  • 12
  • 48
  • 79
0
votes
1 answer

AS3: Extracted sampledata loops last chunk forever

I'm working on a virtual piano keyboard that processes imported mp3 sound samples. I've come to a solution with this question, but as I run this test code below, the sound loops its last data chunk of each sound source forever, and obviously, I want…
Eirún
  • 11
  • 4
0
votes
0 answers

How to adaptively sample a function with vector input and output?

I'm trying to adaptively sample a function using adaptive. Code: import adaptive def measure(setpoints: Union[float, np.ndarray]) -> Union[float, np.ndarray]: for setpoint in setpoints: measurement_schedule.add(setpoint) …
0
votes
0 answers

Data Model Sample for an Accounting Database (Credit/Debit)

I am looking for a sample database (or csv/excel files) of accounting journal entries. It needs to be reasonably close to an actual company. Preferably, with chart of accounts, departments/cost centers. I'll use it to teach data modelling in Power…
mshparber
  • 103
  • 8
0
votes
1 answer

Sampling with conditions using Pandas

Here is my…
0
votes
0 answers

Applying sample portfolio pricing to the overall portfolio

I have a large mortgage portfolio that includes mortgages from about 50 different financial institutions and this portfolio will receive pricing from one of the major valuation providers. I used stratified random sampling when selecting the sample,…
0
votes
0 answers

Incorrect number of dimensions error with bootstrap method using R

I am a newbie in programming with R, and trying to reproduce the values from research paper (Eur. Phys. J. C (2022) 82:1106 https://doi.org/10.1140/epjc/s10052-022-11071-9), where the maximum frequency values (MFV) statistic has been applied. The…
0
votes
0 answers

T-distributed Stochastic Neighbor Embedding (t-SNE)

I am trying to run T-distributed Stochastic Neighbor Embedding (t-SNE) in Jupyter but always facing a issue with ValueError: could not convert string to float: '' Code: enter image description here import numpy as np import pandas as…
0
votes
0 answers

What is the best way to sample data based on date range?

I have weather dataset from 01 Nov 2007 until 18 May 2008 my data is date-dependent I want to predict the temperature from 07 May 2008 until 18 May 2008 (which is maybe a total of 10-15 observations) my data size is around 200 I will be using…
nullUser
  • 11
  • 3
0
votes
2 answers

Test data for spatial search unit tests

I'm looking for a small database (e.g. 50 addresses) of US postal addresses with ZIP codes, and if possible, a distance between each one for the spatial search engine unit tests. Do you know where I can get such a database?
Sfisioza
  • 3,830
  • 6
  • 42
  • 57