Questions tagged [missing-data]

For questions relating to missing data problems, which can involve special data structures, algorithms, statistical methods, modeling techniques, visualization, among other considerations.

When working with data in regular data structures (e.g. tables, matrices, arrays, tensors), some data may not be observed, may be corrupted, or may not yet be observed. Treatment of such data requires additional annotation as well as methodological considerations when deciding how to impute or use such data in standard contexts. This becomes a problem in data-intensive contexts, such as large statistical analyses of databases.

Missing data occur in many fields, from survey data to industrial data. There are many underlying missing data mechanisms (reasons why the data is missing). In survey data for example, data might be missing due to drop-out. People answering the survey might run out of time.

Rubin classified missing data into three types:

  1. missing completely at random;
  2. missing at random;
  3. missing not at random.

Note that some statistical analysis is only valid under certain class.

2809 questions
1
vote
1 answer

How to replace multiple type of values to NAs in a dataframe in R

I have a data frame where multiple type of values need to be replaced with NAs while some other columns with these values are valid data to kept. For example, >df<-data.frame( …
David Z
  • 6,641
  • 11
  • 50
  • 101
1
vote
1 answer

forcasting to filling missing values in time series

I have many time series objects that look like Jan Feb Mar Apr ...... 2007 1 2 3 NA 2008 3 4 5 7 2009 4 2 NA 1 . . I want to forecast to fill in the missing data with rforecast, by actively looking for the NAs and…
BaconDoggie
  • 153
  • 3
  • 14
1
vote
1 answer

Fitting discrete (negative binomial) distribution for early data values

I'm having some difficulties with fitting a discrete distribution function (I'm specifically using the negative binomial distribution). Here's my setting: I have a source of incoming items, each with an unknown lifetime. Everyday, some expire (a big…
1
vote
1 answer

WCF service receives incorrect parameter data

There is a service which is migrated to WCF. The most methods have int, string, DateTime parameters and everything is working fine, but one method receives an array of custom class objects. [DataContract] public class Term { [DataMember] …
va.
  • 848
  • 3
  • 17
  • 39
1
vote
0 answers

First Data E4 Gateway Hosted Payment Pages Form Values Missing

I'm using the First Data E4 Gateway Hosted Payment Pages. Under Payment Pages - Receipt Page Settings I've selected: "Automatic linking to the receipt page including result parameters using HTTP POST method (AUTO-POST)". I'm not use Relay Response…
CoolBreeze
  • 381
  • 4
  • 14
1
vote
2 answers

How to fill missing data set from Matlab or Excel

I got a data set (daily and monthly rainfall data) but it is missing so much value. In order to training the data i need to fill up the missing data, can you tell me how can I do it? I used with SPSS Statistic but seem the values not good or maybe i…
Peter
  • 79
  • 3
  • 12
1
vote
2 answers

imputing missing values from respective column

I'm trying to fill in the missing values randomly sampled from the column. For instance, for variable "apple" the data looks like (1, 2, 3, NA) then I want NA to be imputed randomly from (1,2,3). I've come up with the following code, but I'm trying…
halo09876
  • 2,725
  • 12
  • 51
  • 71
1
vote
0 answers

Missing Data (Event Matching System)

I am working in an event matching system and i have a large dataset of pairs of events. Each pair of events has 2 titles, 2 venues, 2 start dates; 2 latitude/longitude coordinates pairs, etc. Some of the information is missing,( for example, start…
1
vote
1 answer

subtracting array values from another unequal array

I have the following which works just fine when the arrays are of an equal length: (example) $highNums = array(10,20,30,40,50,60); $lowNums = array(0,1,2,3,4,5); $result = array(); for($i=0;$i
WayneT
  • 176
  • 1
  • 12
1
vote
0 answers

LDAP missing attributes

When I try to get all attributes of my LDAP InitialDirContext, some attributes are missing. _IDC = new javax.naming.directory.InitialDirContext("..."); javax.naming.directory.DirContext dc = (javax.naming.directory.DirContext)…
Charles Follet
  • 827
  • 1
  • 10
  • 28
1
vote
2 answers

Remove columns from data set in R having constant non-missing values

Consider the following data frame: data <- data.frame(a=c("Yes","Yes",NA),b=c(2,4,3),c=c(NA,3,3),d=c("B","B","B")) a b c d 1 Yes 2 NA B 2 Yes 4 3 B 3 3 3 B Desired output: b 1 2 2 4 3 3
Sup
  • 33
  • 2
  • 5
1
vote
1 answer

Plotting different shapes for data points belonging to different groups in R

I am trying to plot the first two columns against each other of a matrix Y, and assigning different data points different shapes and colors according to which group they belong to in the 12th column of my data set. Below is my code: X <-…
user2976477
1
vote
2 answers

How to na.locf in R without using additional packages

Given a vector such as (say) c(2,NA,5,NA,NA,1,NA) the problem is to "last observation carry forward" resulting in vector c(2,2,5,5,5,1,1). As answered here, na.locf from the zoo package can do this. However, given the simplicity of the problem, and…
Museful
  • 6,711
  • 5
  • 42
  • 68
1
vote
1 answer

SPSS Syntax - How to deal with missing values through SPSS Syntax

Im new in this forum. I have to do a presentation on how SPSS deals with missing values. Specificaly, our professor gave us the task to: 1) Find out if, besides the functions accesible through the menus, there are functions accesible via SPSS…
1
vote
1 answer

JSF not displaying first row, first column data

I have a very simple page that is displaying canned data. I experienced this problem with data from a DB so I made this simple example and still have the issue. When this page is navigated to for the first time for a session, the first column in…
Russ
  • 1,996
  • 3
  • 19
  • 31