Questions tagged [training-data]

A training set is a set of data used to discover potentially predictive relationships, used in fields like artificial intelligence, machine learning, and statistics.

A training set is a set of data used to discover potentially predictive relationships, used in fields like artificial intelligence, machine learning, and statistics.

More info

1782 questions
15
votes
2 answers

How to improve accuracy of a FeedForward Neural Network?

I want to draw StackOverflow's logo with this Neural Network: The NN should ideally become [r, g, b] = f([x, y]). In other words, it should return RGB colors for a given pair of coordinates. The FFNN works pretty well for simple shapes like a…
15
votes
8 answers

Split image dataset into train-test datasets

So I have a main folder which contains sub-folders which in turn contains images for the dataset as…
Ishan Dixit
  • 379
  • 1
  • 3
  • 11
14
votes
4 answers

Train Model fails because 'list' object has no attribute 'lower'

I am training a classifier over tweets for sentiment analysis purposes. The code is the following: df = pd.read_csv('Trainded Dataset Sentiment.csv', error_bad_lines=False) df.head(5) #TWEET X = df[['SentimentText']].loc[2:50000] #SENTIMENT…
Alex
  • 1,447
  • 7
  • 23
  • 48
14
votes
28 answers

CountNonDivisible - Codility training task

I'm training on codility now. Some tasks I can solve by myself, but with some tasks have problems. Difficulty of this task is <**>. It's medium, but I stalled. Problem: You are given a non-empty zero-indexed array A consisting of N integers. For…
Dima
  • 514
  • 2
  • 7
  • 18
13
votes
2 answers

angularjs free video tutorials

I check on google but didnt find any good free video tutorial for anjular js this web looking fine (http://egghead.io) but require money. Do you any free video training of angularjs??
Mohammad Faizan khan
  • 1,213
  • 3
  • 17
  • 32
12
votes
1 answer

Validation and Testing accuracy widely different

I am currently working on a dataset in kaggle. After training the model of the training data, I testing it on the validation data and got an accuracy of around 0.49. However, the same model gives an accuracy of 0.05 on the testing data. I am using…
12
votes
1 answer

Should I keep/remove identical training examples that represent different objects?

I have prepared a dataset to recognise a certain type of objects (about 2240 negative object examples and only about 90 positive object examples). However, after calculating 10 features for each object in the dataset, the number of unique training…
10
votes
4 answers

Can flow_from_directory get train and validation data from the same directory in Keras?

I got the following example from here. train_datagen = ImageDataGenerator( rescale=1./255, shear_range=0.2, zoom_range=0.2, horizontal_flip=True) test_datagen = ImageDataGenerator(rescale=1./255) train_generator =…
BAE
  • 8,550
  • 22
  • 88
  • 171
10
votes
1 answer

Ideal number of HoG features

So there many options of how one can extract HoG features. Using different orientations, different numbers of pixels per cell and different block sizes. But is there a standard or optimal configuration? I have training images of size 50x100, and I'm…
user961627
  • 12,379
  • 42
  • 136
  • 210
9
votes
3 answers

Stop Keras Training when the network has fully converge

How will I configure Keras to stop training until convergence or when the loss is 0? I intendedly want to overfit it. I don't want to set number of epochs. I just wanted it to stop when it converges.
alyssaeliyah
  • 2,214
  • 6
  • 33
  • 80
9
votes
1 answer

Normalization of data in continuous neural network training in R

I would like to implement a constant training of my neural network as my input keep coming. However, as I get new data, the normalized values will change over time. Let's say that in time one I get: df <- "Factor1 Factor2 Factor3 Response 10…
user3091668
  • 2,230
  • 6
  • 25
  • 42
9
votes
2 answers

Incremental training of random forest model using python sklearn

I am using the below code to save a random forest model. I am using cPickle to save the trained model. As I see new data, can I train the model incrementally. Currently, the train set has about 2 years data. Is there a way to train on another 2…
ForeverLearner
  • 1,901
  • 2
  • 28
  • 51
9
votes
2 answers

Keras ImageDataGenerator Slow

I am looking for the best approach to train on larger-than-memory-data in Keras and currently noticing that the vanilla ImageDataGenerator tends to be slower than I would hope. I have two networks training on the Kaggle cat's vs dogs dataset (25000…
John Cast
  • 1,771
  • 3
  • 18
  • 40
9
votes
1 answer

Self Training Algorithm

I'd like to develop a self training algorithm for a specific problem. To keep things simple i'll nail it down to simple example. Update: I have added a working solution as answer to this question below. Let's say i have a huge list of entities…
endeffects
  • 431
  • 4
  • 15
8
votes
1 answer

What is the way to use Tensor flow 2.0 object in open cv2 python and why is it so circuitous?

I load an image using tensor flow api (2.0) like so : def load(image_file): image = tf.io.read_file(image_file) image = tf.image.decode_jpeg(image) Now that I have this object, I want to show this image, I can simply use matplotlib.pyplot, and…