Questions tagged [data-analysis]

Data Analysis involves extracting meaning and insights from raw data. It involves methods and algorithms that examine, clean, transform and model the data to obtain conclusions.

Data Analysis involves extracting meaning and insights from raw data.

It involves methods and algorithms that examine, clean, transform and model the data to obtain conclusions and insights.

Typically, data analysis involves a series of steps. Starting with measuring some parameters of interest, collecting the data, cleaning it, storing it in meaningful ways, then summarizing and examining it, and also testing various hyoptheses about the data.

More information can be found the Wikipedia's Data Analysis page.

4642 questions
1
vote
2 answers

Whats the logic behind 'impact' argument in ahp topsis function

d <- matrix(rpois(12, 5), nrow = 4) w <- c(1, 1, 2) i <- c("+", "-", "+") topsis(d, w, i) this is the function available in R for Ahp topsis, i am confused about how to assign "+" and "-" signs here for "impact" argument. how is it done here in…
Aisha atique
  • 551
  • 2
  • 5
  • 16
1
vote
1 answer

add a sentiment column onto a dataset in r

I have done some basic sentiment analysis in r and wanted to know if there was a way to have the sentiment of a sentence or row analyzed, and then have a column appended with the sentiment of the sentence. All analysis I have done up until now gives…
Josh
  • 35
  • 5
1
vote
1 answer

How Can I import a Data set in Jupiter notebook (AD_Data.xlsx) data got xlsx extention

Tried all the possible options like import pandas as pd df = pd.read_csv('AD_Data') data = pd.ExcelFile("AD_Data") xl_file = pd.ExcelFile(AD_Data) dfs = {sheet_name: xl_file.parse(AD_Data) for sheet_name in xl_file.AD_Data} dfs =…
Masum Billah
  • 133
  • 1
  • 4
  • 13
1
vote
2 answers

Calculating MSS and RSS in R

I am trying to calculate the MSS and RSS using the output and the components of the regression model that I have created (model.1) model.1<-glm(wbw.df$x.percap ~ wbw.df$y.percap,family=gaussian) Which part of the output do I need to be focusing…
Mary
  • 13
  • 1
  • 5
1
vote
1 answer

Grouping all rows of a pandas DataFrame(with many columns) with the same value in a given column

I have been searching for hours.I have a DataFrame like so :- col1. col2. col3. col4 row1. a. p u 0 row2. b. q v 1 row3. a. r w 2 row4. d. s x 3 row5. b. t y …
Devansh Singh
  • 53
  • 1
  • 10
1
vote
0 answers

Adding hover tool to datashader interactive image

I want to perform datashading on a plot created in bokeh. I encountered with this python notebook. But I want to know can I add hovertool to resultant image after datashading. If yes then how can I add tools like hovertool,taptool to the…
Cracken
  • 306
  • 2
  • 5
  • 17
1
vote
2 answers

Python : Function which works on initial/default values if no parameters are passed

I have defined a function which calculates the mean of a column Test 1 in my dataframe between two time limits. The dataframe is -:- df = pd.DataFrame({'Time':[0.0, 0.25, 0.5, 0.68, 0.94, 1.25, 1.65, 1.88, 2.05, 2.98, 3.45, 3.99, 4.06],'Test…
Arpit Sharma
  • 345
  • 6
  • 15
1
vote
2 answers

Pandas : finding the average of values within a particular limit in a column

I have imported the first three columns of a .csv file named as Time, Test 1 and Test 2 in my python program. import pandas as pd fields = ['Time', 'Time 1', 'Time 2'] df=pd.read_csv('file.csv', skipinitialspace=True, usecols=fields) Here is the…
Arpit Sharma
  • 345
  • 6
  • 15
1
vote
1 answer

Splitting dictionary/list inside a Pandas Column and convert as new dataframe

I have data saved in a excel. I am querying this data using Python2.7 and turning it into a Pandas DataFrame. i have a column called category in my dataframe.It has a dictionary (or list?) of values within it. The DataFrame looks like this: [1]…
Hrithu
  • 281
  • 4
  • 13
1
vote
2 answers

Pandas Data Frame Average Values

I have a large dataset that looks like this: Year Month Day Hour Count Value 2017 1 1 1 59 0 .. .. .. .. .. .. 2011 12 29 23 60 12 I am trying to use…
1
vote
2 answers

how to map two rows of different dataframe based on a condition in pandas

I have two dataframes, df1, Names one two three Sri is a good player Ravi is a mentor Kumar is a cricketer player df2, values sri NaN sri, is kumar,cricketer player I am trying to get the row in df1 which contains the all the items in…
Pyd
  • 6,017
  • 18
  • 52
  • 109
1
vote
2 answers

Python - Drop rows from a Pandas DataFrame that contain numbers

I have a dataframe with one column like this: Value xyz123 123 abc def I want to remove any rows that contain numbers so I end up with a dataframe like this: Value abc def I have tried df = df[df['Value'].str.contains(r'[^a-z]')] but I got this…
dasvootz
  • 413
  • 1
  • 5
  • 15
1
vote
1 answer

Is Data Mart Normalized ? if Yes then What is the normal form of Data Mart?

I read a lot about data mart and know that data mart uses star and snowflake schema. But I want to know that what is normal form of Data Mart.
Manoj Kumar Dhakad
  • 1,862
  • 1
  • 12
  • 26
1
vote
1 answer

How do I set up the values for the x axis in matplot in python?

plt.figure(figsize=(15,5)) plt.plot(data['Unemployment Rate']) plt.axis([1948,2017,0,15]) plt.show() These code returned an empty graph. The table contains 2 columns, data['Year'] and data['Unemployment Rate']. The year is between 1948 and…
1
vote
2 answers

How to combine 2 level column index of DataFrame into a single column index?

I have a DataFrame like below having 2 level column index, now I have to combine it into single level column index in a pattern like "Level1_Level2" CNT_LOAN CNT_RCHG month 201605 201606 201607 201608 …
Abhishek Jain
  • 65
  • 1
  • 2
  • 8