Questions tagged [kaggle]

Relating to Competitions, Datasets, Kernels, Learn, or Kaggle's API.

Relating to the following Kaggle data science categories:

1115 questions
-1
votes
1 answer

Random Forest Regression not give 0 or 1

I'm currently using RandomForestRegression for Titanic(Kaggle). %%timeit model = RandomForestRegressor(n_estimators=200, oob_score=False,n_jobs=1,random_state=42) model.fit(X,y) #y_oob = model.oob_prediction_ #print("c-stat:",…
user10733243
-1
votes
3 answers

Solving Kaggle's Titanic Machine Learning

I'm trying to solving Kaggle's Titanic with Python. But I have an error trying to fit my data. This is my code: import pandas as pd from sklearn import linear_model def clean_data(data): data["Fare"] =…
-1
votes
2 answers

Kaggle airbus ship detection challenge.How to deal with class imbalance?

My model always predict under probability 0.5 for all pixels. I dropped all images without ships and have tried focal loss,iou loss,weighted loss to deal with imbalance . But the result is same.After few batches the masks i predicted gradually…
-1
votes
1 answer

How to upload large image datasets from kaggle to google colab?

I wanted to work on a image dataset. I dont have local GPU, so i wanted to make use of free GPU on Google colab. But i don't know how to upload a large image dataset to colab.
abdkumar
  • 142
  • 1
  • 2
  • 10
-1
votes
1 answer

kaggle cli installation error

After installing kaggle, when I am passing kaggle command to command window it is throwing this error. Successfully installed kaggle-1.4.6 You are using pip version 10.0.1, however version 18.0 is available. You should consider upgrading via …
-1
votes
1 answer

How to use the test data against the trained model?

I'm a beginner in Machine Learning and I'm going through the Titanic competition. At first, my model gave me an accuracy of 1.0, which was too good to be true. Then I realized that I am comparing my trained model with the training data that I've…
Onur-Andros Ozbek
  • 2,998
  • 2
  • 29
  • 78
-1
votes
1 answer

download error Diabetic Retinopathy dataset from Kaggle

I was starting new project of Diabetic retinopathy but now it shows error in dataset download. Though it gets downloaded, unzipping results error. This consist of 5 training dataset consisting more than 8 GB each. I am in huge trouble. How can i…
Ashwin Dhakal
  • 31
  • 1
  • 5
-1
votes
2 answers

Trying to work out a python code

I am new to Machine Learning and python. Recently i have been working with Amazon fine food review data from kaggle and its code. What i don't understand is how is the 'partiton' method used here ? Moreover, What actually does last 3 lines of code…
-1
votes
1 answer

How to extract and format the string in a python data-frame

I am new in Python and I would like to extract a certain kind of string data from my data frame. Here is my data frame: I want to extract the first letter in the cabin column(like A,B,C,D...) and add it as a new feature column named as…
-1
votes
1 answer

pandas str.extractall on complete words

I have a column of tweets. I want to get a list of all mentions inside the tweet using the regex: \@(\w+) I tried using df.Tweets.str.extractall('\@(\w+)') but it doesn't succeed with matching the entire word as it wants (my guess) to separate each…
DeanLa
  • 1,871
  • 3
  • 21
  • 37
-1
votes
1 answer

unexpected line continuation character

i am learning how to do data science and was following the kaggle tutorial for titanic. However, women_only_stats = data[ \ #Which element (data[0::,4] == "female") \ #is a female …
aceminer
  • 4,089
  • 9
  • 56
  • 104
-1
votes
1 answer

Text preprocessing in Python

I would like to build a text corpus for a NLP project in Python. I've seen this text format in the LSHTC4 Kaggle challenge: 5 0:10 8:1 18:2 54:1 442:2 3784:1 5640:1 43501:1 The first number corresponds to the label. Each set of numbers separated…
-2
votes
0 answers

Does Colab stops runtime if it reaches to RAM limit?

I was using Kaggle notebook for running my NN models but whenever ram usage reaches 13GB it stops the runtime and i have to start all from the beginning. The dataset is huge, almost 450gb so i don't want to shift on colab as i need to import it…
Nirob
  • 29
  • 5
-2
votes
1 answer

Loop is slow when adding a @tf.function inside, why?

I have a train_step function like this: @tf.function def train_step(timestep_values,noised_image,noise): # calculate loss and update parameters with tf.GradientTape() as tape: prediction = model(noised_image, timestep_values) …
-2
votes
4 answers

Understanding the functioning of 'and' and 'or'

Here's a doubt I am facing here. The code with its purpose in docstring is below : This is the correct code but I am confused about the final 'if statement' in the blackjack_hand_greater_than(a,b) function. Here, the 'if statement' is not True for…
novice
  • 3
  • 2