Questions tagged [kaggle]

Relating to Competitions, Datasets, Kernels, Learn, or Kaggle's API.

Relating to the following Kaggle data science categories:

1115 questions
0
votes
1 answer

Docker on Ubuntu 14.04 my browser is not starting after run command for Kaggle/python

In Kaggle/python docker on Ubuntu 14.04 my browser is not starting.Anyone has face this issue and resolution? I am using below command from terminal "(sleep 3 && sensible-browser "http://127.0.0.1:8888")& docker run -v $PWD:/tmp/working…
Kris
  • 47
  • 5
0
votes
1 answer

Python [[0]] meaning

I am running a Python script (Kaggle script). It works in a 3.4.5 virtualenv, but not in 3.5.2 I am not sure why and I am not familiar with the [[0]] syntax. Below is the snippet. import pandas as pd data =…
Chad Crowe
  • 1,260
  • 1
  • 16
  • 21
0
votes
0 answers

Errno 13 Permission denied: 'train.zip' upon downloading data using Kaggle CLI

I'm trying to download data from kaggle to my AWS p2.xlarge instance using Kaggle CLI. I've manually accepted the competition terms and conditions through the browser before trying to download data. But I'm getting the error as shown…
xpm
  • 353
  • 2
  • 10
0
votes
1 answer

Not getting expected output - Plotting Histograms in Pandas

I am new to Pandas and I am trying to get the histogram of my data set. When I am running the same query on Kaggle notebook I am getting my desired output i.e. The histograms appearing, while running the same query on Jupyter notebook shows nothing.…
0
votes
0 answers

Apply Feature Engineering on Both Training & Test Sets

I am getting slightly confused. I am attempting to do a Data Science competition (my first time, on a website similar to Kaggle). I need to do a classification. I have a training set and a test set. Very classic. I analyzed data and created some new…
ML_Enthousiast
  • 1,147
  • 1
  • 15
  • 39
0
votes
4 answers

How to finish code to replace NA with median in R

I am very new to R, so please please be gentle. I am working on the Kaggle Titanic competition, to get me into R and working things out. I am working my way through engineering a feature and I am a bit stuck with the logic of what to do next. So,…
Jade Reynolds
  • 201
  • 5
  • 14
0
votes
1 answer

Why does my implementation of linear regression in Tensorflow done on Ames Housing dataset converge very very slowly?

I am trying to use Linear Regression on the Ames Housing dataset available on Kaggle. I did some manual cleaning up of the data by removing many features first. Then, I used the following implementation to train. train_size =…
Apara
  • 375
  • 4
  • 14
0
votes
1 answer

converting csv to arff

I am working on a school project for data mining, where we were given CSV data from kaggle (this is how the data looks (2 lines out of 6970)): 4,1970,Female,150,DomesticPartnersKids,Bachelor's…
candy
  • 1
  • 1
0
votes
1 answer

How to get F1,Precision and Recall for a Cross Validated Data Set in R

I have two data sets. train <- read.csv("train.csv") test <- read.csv("test.csv") The data in train set look as below. > str(train) 'data.frame': 891 obs. of 12 variables: $ PassengerId: int 1 2 3 4 5 6 7 8 9 10 ... $ Survived : Factor w/…
Yash
  • 245
  • 1
  • 7
  • 19
0
votes
2 answers

How to implement next_batch() function for custom data in python

I am currently working on the cats vs dogs classification task on kaggle by implementing a deep convNet. The following lines of code is used for data preprocessing: def label_img(img): word_label = img.split('.')[-3] if word_label == 'cat':…
Kaustabh Kakoty
  • 87
  • 3
  • 15
0
votes
3 answers

How to convert format csv to kaggle submission format in rapidminer

I am trying to complete San Francisco Crime Classification kaggle excercise with Rapidminer but I don't find any help on how to create a csv in kaggle submission format from rapidminer.
Imran Shafqat
  • 508
  • 1
  • 9
  • 23
0
votes
1 answer

ValueError in creating submission csv

I am learning data science and reading other people's scripts. There is this one titanic algorithm (kaggle) has this code to apply the Logistic Regression then supposedly export to a .csv file as suggested in the code. However, it always generates…
anicehat
  • 45
  • 1
  • 1
  • 8
0
votes
1 answer

TensorFlow - Training accuracy not improving in MNIST data

I write a program with tensorflow to process Kaggle's digit-recognizer problem.Program can run normally,but the training accuracy is always low,about 10%,such as following : step 0, training accuracy 0.11 step 100, training accuracy 0.13 step 200,…
Lixudong
  • 9
  • 2
0
votes
1 answer

Python pandas read data of the form Label : :

For example, a row in the data looks like this -1 0:183.3575741549828 1:3.11164735151736 2:2.171277907851733 3:26.68849990272964 4:24.76677388937082 5:0.02710337995527495 The reason why index is specified is because attributes for which index is…
0
votes
0 answers

Numpy error: operands could not be broadcast together with shapes (65,) (77,)

I'm trying to generate a graph in IPython notebook for a .csv spreadsheet (easily found in kaggle = titanic_data.csv) and put below a summary of the steps I have done. First I imported the worksheet and took out some columns. titanic_data =…