Questions tagged [data-analysis]

Data Analysis involves extracting meaning and insights from raw data. It involves methods and algorithms that examine, clean, transform and model the data to obtain conclusions.

Data Analysis involves extracting meaning and insights from raw data.

It involves methods and algorithms that examine, clean, transform and model the data to obtain conclusions and insights.

Typically, data analysis involves a series of steps. Starting with measuring some parameters of interest, collecting the data, cleaning it, storing it in meaningful ways, then summarizing and examining it, and also testing various hyoptheses about the data.

More information can be found the Wikipedia's Data Analysis page.

4642 questions
1
vote
0 answers

CSV spreadsheet analysis

I'm trying to complete the assignment (Quiz 21) described below for the following course: https://classroom.udacity.com/courses/ud170/lessons/5430778793/concepts/53961386480923 The first code fragment is the one I wrote, which outputs the wrong…
1
vote
1 answer

python- concatenate and append a pandas dataframe in a for loop

I am sorry I didnt really know how to word the title of this question. I do not work with Python too often and I am just starting to work with the pandas and numpy packages. I am getting unexpected results when trying to concatenate and append a…
JoeG
  • 512
  • 8
  • 19
1
vote
1 answer

How to split date and time from string?

I have a dataset from excel imported in pandas. There is a one column date and time in string format. 16-MAR-16 11.35.27.000000000 AM 05-APR-16 05.21.14.000000000 PM 16-FEB-16 09.56.36.000000000 AM 16-MAR-16 11.35.27.000000000 AM 16-MAR-16…
kepy97
  • 988
  • 10
  • 12
1
vote
0 answers

Making decision depending on complicated factors without starting data

Currently I'm building an automatically making decision system, which depends on many different factors. The problems I meet that is don't have any data to analyze and train. My system has some factors, such as is on Holiday or not, is on…
1
vote
1 answer

Issue using Tweepy to pull data from Twitter Stream: Data Analysis

from tweepy import OAuthHandler from tweepy import StreamListener class listener(StreamListener): def on_data(self, data): print(data) return(True) def on_error(self, status): print (status) auth =…
Yepram Yeransian
  • 321
  • 1
  • 14
1
vote
1 answer

How to find the peak coordinate from dataset

I have a group of dataset. This is the graph I draw using this dataset. How to find the coordinate of peak value from this dataset? Anyone got good java algorithm regarding this issue?
phoon
  • 369
  • 1
  • 6
  • 21
1
vote
1 answer

split pandas single column(List of dict) and append as new keys of dict as new columns

Input : df = pd.DataFrame({'a':[1,2], 'b':[[{'x1':1,'x2':3},{'x1':4,'x2':1}], [{'x1':5},{'x1':3,'x2':6}]], 'c':[5,6]}) If I apply the operation print(df['b'].apply(pd.Series)) Output…
Rakesh Bhagam
  • 117
  • 1
  • 2
  • 9
1
vote
1 answer

Python: MemoryError for .corr(method='pearson').applymap

I am trying to calculate the pearsons correlation for every single possible pair of two columns in a dataframe. I have 57997 columns. But I am getting a memory error. t_logs = logs.T print t_logs results =…
1
vote
0 answers

How to re-scaling signal intensity in image in relation to their spatial position?

Hi I have a 1D radial profile of a sample across a pipe (fig_1). One data point (along the orange straight line) is acquired at each "band" from the image. The resolution (x,y,z) of each data point is 100um x 100um x 1000um. (fig_1) However in…
1
vote
1 answer

Is it possible to project fields based on the field name in mongodb?

In a collection, the documents may contains fields like "name", "name:en", "name:de" { _id:123456 tag:{ name: HongKong name-zh: 香港 other_tag1: value1 other_tag2: value2 } } { _id:123457 tag:{ …
陈梓泉
  • 145
  • 2
  • 9
1
vote
2 answers

Pandas percentage change to a n value before

Hello I have a pandas column with n values 2018-01-03 01:30:00 14873.46 2018-01-03 01:45:00 14848.01 2018-01-03 02:00:00 14888.90 2018-01-03 02:15:00 14834.59 2018-01-03 02:30:00 14895.08 2018-01-03 02:45:00 15049.98 2018-01-03…
hopieman
  • 399
  • 7
  • 22
1
vote
1 answer

Speed up Python Loop append

Hello I have an huge list of values, I want to to find all n values pattern like list[0:30], list[1:31]. And to each value compare percentage to the first, like percentage_change(array[0],array[1]), percentage_change(array[0],array[2]), all the way…
hopieman
  • 399
  • 7
  • 22
1
vote
2 answers

How to 'pivot' a dataframe's values into columns

I'm trying to transform a dataframe df = pd.DataFrame({ 'c1': ['x','y','z'], 'c2': [[1,2,3],[1,3],[2,4]]}) which looks like c1 c2 0 x [1, 2, 3] 1 y [1, 3] 2 z [2, 4] into p = pd.DataFrame({ 'c1': ['x','y','z'], 1:…
1
vote
1 answer

R neural Networks

I am playing around with Adult Dataset https://archive.ics.uci.edu/ml/datasets/adult and R. I am trying to use the neuralnet package to train a Neural Network with Back propagation. I have cleaned the data. Now I am trying to run this part : n <-…
Ioannis K
  • 95
  • 1
  • 9
1
vote
1 answer

How to convert or decode the Unicode characters in pandas DataFrame?

I was reading some data from an excel file using pandas and did some row traversal to make a python dictionary which i then put into a json file . The problem is that I am getting Unicode characters in the json file itself : "C V M College of Fine…
Natesh bhat
  • 12,274
  • 10
  • 84
  • 125