Questions tagged [kaggle]

Relating to Competitions, Datasets, Kernels, Learn, or Kaggle's API.

Relating to the following Kaggle data science categories:

1115 questions
0
votes
1 answer

Submission on Kaggle

Kaggle requires us to upload the CSV file, but I don't understand how to do it. I've a complete code which gives me predictions and I need to write those predictions against their ids and make a CSV file out of them (i.e. containing two columns, one…
0
votes
1 answer

Use python 2.7 on Kaggle kernel

I am new to Kaggle kernels, I tried using my code on the kernel. The only issue I am facing is my script is in python 2.7, but kernel support 3.5. Is there any way, I can support python 2.7 on kernel? Any links also would be helpful!
user5722540
  • 590
  • 8
  • 24
0
votes
1 answer

kaggle dataset or python split CLI

I downloaded the dataset from kaggle: https://www.kaggle.com/c/dogs-vs-cats/data Then tried to get image label from the downloaded data using cv2.split('.')[-3] command. (code in the end) However, i got an "index out of range error". I checked the…
b.mira
  • 143
  • 1
  • 8
0
votes
1 answer

regression on images using keras

I want to use images to predict positions of hands, face, and steering. source: Kaggle to do this I have manually labelled this dataset as follows image Head_x Head_y left_hand_x left_hand_y steering_wheel_x steering_wheel_y right_hand_x…
Ani
  • 2,848
  • 2
  • 24
  • 34
0
votes
1 answer

Titanic Kaggle dataset Naive Bayes classifier error R programming

I am trying to train a naive bayes classifier for the Kaggle - Titanic dataset (URL- https://www.kaggle.com/c/titanic/data for "train.csv" and "test.csv"). The code that I have come up with so far is as follows- library(e1071) train_d <-…
Arun
  • 2,222
  • 7
  • 43
  • 78
0
votes
0 answers

Increase margins between plot and dropdown widget in Jupyter notebook.

I'm basically just trying to place some room between my plots and a dropdown widget in a Jupyter notebook. Currently the plots are covering up the dropdown boxes. Here is link to the notebook on Kaggle- …
Sam Dotson
  • 35
  • 1
  • 6
0
votes
1 answer

Reading large csv files using pandas with specifying dtypes giving memory error?

I am trying to read expedia data from Kaggle which contains a 4GB csv file I tried reading it using pd.read_csv('filename') and got memory error. Second approach I tried reading particular columns only using the code: pd.read_csv('train.csv',…
Samyak Upadhyay
  • 573
  • 1
  • 12
  • 24
0
votes
1 answer

Shiny R randomForest Error

I am trying to use Shiny app and found out that below code works. ui.R: data(Titanic) head(as.data.frame(Titanic),5) library(shiny) shinyUI(pageWithSidebar( headerPanel("Titanic Survival Calculator"), sidebarPanel( p("Select person…
Steve
  • 1
  • 2
0
votes
0 answers

Tensorflow Keras Regression model - Biased Results

I am trying to implement a Keras Regression model on a dataset for my learning purpose. I have taken the data from the Kaggle Loan Default Prediction Challenge and I am trying to predict whether a person will default on a loan or not The target…
Anonymous
  • 11
  • 4
0
votes
2 answers

How to deal with data when making a decision tree

I am trying to make a decision tree for dataset I got from Kaggle. Since I don't have any experience for dealing with real-life datasets, I have no idea how to deal with cleaning, integrating, and scaling the data (mainly scaling). For example,…
0
votes
0 answers

get_dummies causing column mismatch between train and test sets?

Working on the Titanic dataset from kaggle and having an issue trying to handle categorical variables and deal with NA values. The following code produces this error: import pandas as pd from sklearn.preprocessing import Imputer nonpredictors =…
mowglis_diaper
  • 479
  • 1
  • 9
  • 18
0
votes
1 answer

radar chart with plotly (python 3x)

I try to set up a radar chart with plotly but I currently have two issues: I have some spurious points that appear on the graph I don't manage to make the text of individual points appearing on hover The whole script (with the graph) can be found…
0
votes
3 answers

What are training and test data sets

I am getting started in kaggle. I have just gone through various data science and machine learning competition I have seen that for every competition they have uploaded training data, test data and Original data. Can someone explain me what are…
Abhishek Sharma
  • 205
  • 2
  • 8
0
votes
1 answer

Why are Logistic Regression and SVM predictions multiplied by constants at the end?

I'm currently trying to understand certain high-level classification problems and have come across some code from a Kaggle competition that ran in 2012. The competition discussion board are (here) and the winning code is (here). At almost the end of…
salvu
  • 519
  • 5
  • 14
0
votes
1 answer

How to reshape csv file into matrix form for Digit Recognition Machine Learning

so I am new in machine learning, and I try to work on Digit Recognizer on Kaggle... so I already load the csv file with train_df=pd.read_csv('train.csv') and the result is label,pixel0,pixel1,pixel2,pixel3,...,pixel783 4, 0 ,0 ,0 ,10 ,...,0 I know…
Nicholas
  • 71
  • 9