Questions tagged [kaggle]

Relating to Competitions, Datasets, Kernels, Learn, or Kaggle's API.

Relating to the following Kaggle data science categories:

1115 questions
0
votes
1 answer

Resnet cannot be loaded

I have trying to import resnet34 and resent50 into a Kaggle Kernal with no success. When I run: from keras.applications.resnet50 import ResNet50 as resnet50 resnet = ResNet50(weights='imagenet') learn = ConvLearner.pretrained(resnet, data,…
Rahul Deora
  • 157
  • 1
  • 8
0
votes
0 answers

ValueError: Cannot feed value of shape (891,) for Tensor 'Placeholder_37:0', which has shape '(?, 1)'

import pandas as pd import tensorflow as tf PREDICTORS = ["Pclass", "Sex", "Age", "SibSp", "Parch", "Fare", "Embarked"] TARGET = ["Survived"] def load_file(path): data = pd.read_csv(path) data["Age"] =…
0
votes
2 answers

Can't convert float to int in python DataFrame/Array

I'm new to both Kaggle and Python and can't figure out how to convert this data set. For anyone familiar, I'm trying to reproduce the gender based solution for the Titanic tutorial. I have: submission = pd.DataFrame({'PassengerId' :…
Frankie
  • 11,508
  • 5
  • 53
  • 60
0
votes
1 answer

Link objects in nested list by dictionary key

I am currently these datasets from the Kiva Kaggle Competition: https://www.kaggle.com/kiva/data-science-for-good-kiva-crowdfunding/data I want to link a float 'MPI' value (a 'Multidimensional Poverty Index') to their corresponding geographical…
0
votes
0 answers

Titanic Dataset. Logits and Labels of different shape error

I am currently building a DNN for the Titanic Dataset classification. During that process, I get the following error: InvalidArgumentError: logits and labels must have the same first dimension, got logits shape [1,5] and labels shape [5] My code…
0
votes
0 answers

What is the difference between Statsmodels and LogisticRegression in Python?

I'm a novice in Python. I'm working on the Titanic task from kaggle.com. Could someone explain me the difference between doing LogisticRegression this way: from sklearn.linear_model import…
0
votes
0 answers

ValueError: "concat" mode can only merge layers with matching output shapes except for the concat axis

I am trying to classify, Identify(location) and segmentation of the LAS data(Drone data)like buildings, trees and roads using https://github.com/ternaus/kaggle_dstl_submission. When i ran the building prediction code (python…
Sridhar
  • 11
  • 2
  • 4
0
votes
1 answer

Does the test set need data cleaning in machine learning?

I am on an interesting machine learning project about the NYC taxi data (https://s3.amazonaws.com/nyc-tlc/trip+data/green_tripdata_2017-04.csv), the target is predicting the tip amount, the raw data looks like (2 data samples): VendorID…
0
votes
1 answer

How to extract specific values in a column in python /kaggle dataset

Im trying to extract the following fields from the "jobpost" column: 1.Job Title 2. Position Duration 3. Job Responsibilities 4. Required Qualifications in the https://www.kaggle.com/madhab/jobposts/data. Ive tried slicing and regex but im…
nabskim
  • 9
  • 1
  • 7
0
votes
2 answers

How to extract text at newline using regex in python?

I am having trouble trying to extract text/values on a newline using regex. Im trying to get ("REQUIRED QUALIFICATIONS:") values if i use:- pattern = re.compile(r"JOB RESPONSIBILITIES: .*") matches = pattern.finditer(gh) The output would…
nabskim
  • 9
  • 1
  • 7
0
votes
1 answer

Searching a Pandas DataFrame column for empty values gives contradictory results

I'm trying to clean test data from Kaggle's Titanic dataset, specifically the columns - sex, fare, pclass, and age. In order to do this, I'd like to find out if any of these columns have empty values. I load the data: import pandas as…
0
votes
1 answer

what are the best methods to classify the user gender based on names?

If you check my github, I have successfully implemented CNN, KNN for classifying signal faults. For that, I have taken the signal with little preprocessing for dimensionality reduction and provided it to the network, using its class information I…
Raady
  • 1,686
  • 5
  • 22
  • 46
0
votes
2 answers

Why some portion of statistics is not used in data science

I have learned statistics including mean, median, mode and different tests being Z test, F test and chi-square and all but generally participating in difficult numeric data prediction challenges like on kaggle and other platforms I hardly…
jatin singh
  • 123
  • 1
  • 1
  • 13
0
votes
3 answers

c# Script to login and download from Kaggle

Recently, I came across a python script to download files directly from Kaggle : https://ramhiser.com/2012/11/23/how-to-download-kaggle-data-with-python-and-requests-dot-py/ I am trying to do something similar using WebClients in C#. I've came the…
Jeremy Loh
  • 175
  • 1
  • 9
0
votes
2 answers

Google Cloud DataLab + BigQuery: how to set region/zone/location

I'm using the Datalab for a Python notebook that loads data from Cloud Storage into BigQuery basically following this example. I then saw that my original data in the Cloud Storage bucket is in the EU (eu-west3-a), the VM that executes the Datalab…
Matthias
  • 5,574
  • 8
  • 61
  • 121