Questions tagged [deep-learning]

Deep Learning is an area of machine learning whose goal is to learn complex functions using special neural network architectures that are "deep" (consist of many layers). This tag should be used for questions about implementation of deep learning architectures. General machine learning questions should be tagged "machine learning". Including a tag for the relevant software library (e.g., "keras", "tensorflow","pytorch","fast.ai" etc) is helpful.

Deep Learning is a branch of aimed at building to learn complex functions using special neural network architectures with many layers (hence the term "deep").

Deep neural network architectures allow for more complex tasks to be learned because, in addition to these neural networks having more layers to perform transformations, the larger number of layers and more complex architectures of the neural network allow a hierarchical organization of functionality to emerge.

Deep Learning was introduced into machine learning research with the intention of moving machine learning closer to artificial intelligence. A significant impact of deep learning lies in feature learning, mitigating much of the effort going into manual feature engineering in non-deep learning neural networks.

NOTE: If you want to use this tag for a question not directly concerning implementation, then consider posting on Cross Validated, Data Science, or Artificial Intelligence instead; otherwise your question is probably off-topic. Please choose one site only and do not cross-post to more than one - see Is cross-posting a question on multiple Stack Exchange sites permitted if the question is on-topic for each site? (tl;dr: no).

Resources

Papers

Books

Videos

Stack Exchange Sites

Other StackExchange sites with Deep Learning tag:

27406 questions
6
votes
1 answer

Deep multi-task learning with missing labels

I have a deep multi-task network that is designed to work on three separate classification tasks. While each training example has a label for two of the classification tasks, only about 10% to 15% of the training examples have a label for the third…
phil510
  • 111
  • 1
  • 11
6
votes
3 answers

How to split a model trained in keras?

I trained a model with 4 hidden layers and 2 dense layers and I have saved that model. Now I want to load that model and want to split into two models, one with one hidden layers and another one with only dense layers. I have splitted the model with…
Akhilesh
  • 1,024
  • 1
  • 9
  • 26
6
votes
1 answer

TensorFlow: TypeError: Using a `tf.Tensor` as a Python `bool` is not allowed

I'm trying to define a triplet loss using descriptor from a CNN's output, but this error showed up when I try to train the network. My definition of loss function: def compute_loss(descriptor, margin): diff_pos = descriptor[0:1800:3] -…
Yingqiang Gao
  • 939
  • 4
  • 16
  • 29
6
votes
1 answer

Why is ReLU applied after residual connection in ResNet?

In the ResNet architecture, why is the ReLU activation applied after the element-wise addition with the residual in a residual block, instead of before it?
Shen Zhuoran
  • 385
  • 3
  • 13
6
votes
1 answer

Keras training with batches: Is the training loss computed before or after each optimization step?

this is probably a very basic question, however I wasn't able to find an answer to it: When I train a network with Keras using batches, the console output shows and keeps updating a display of the current loss value of the training set during each…
KiraMichiru
  • 958
  • 6
  • 13
6
votes
1 answer

Keras: Confusion matrix at every epoch

I am able to log the loss at every epoch using Keras Callback after referring this. Is there any way I can compute Confusion Matrix and use it as a metric? Update I have tried to define the following function to return me the confusion matrix, but…
user5722540
  • 590
  • 8
  • 24
6
votes
1 answer

First training epoch is very slow

Hi… I’m running mnist code in my P3 AWS machine and the initialization process seems to be very long compared to my previous P2 machine (although P3>P2) Train on 60000 samples, validate on 10000 samples Epoch 1/10 60000/60000…
Jenia Golbstein
  • 374
  • 2
  • 12
6
votes
5 answers

How to upload and save large data to Google Colaboratory from local drive?

I have downloaded large image training data as zip from this Kaggle link https://www.kaggle.com/c/yelp-restaurant-photo-classification/data How do I efficiently achieve the following? Create a project folder in Google Colaboratory Upload zip file…
6
votes
1 answer

AttributeError: 'str' object has no attribute 'ndim'

I'm using Keras to implement a sentiment analysis code. I have my training data as follows: pos.txt : text file of all positive reviews separated by line neg.txt : text file of all negative reviews separated by line I build my code in a similar…
Amy
  • 79
  • 1
  • 2
  • 8
6
votes
3 answers

Keras MSE definition

I stumbled across the definition of mse in Keras and I can't seem to find an explanation. def mean_squared_error(y_true, y_pred): return K.mean(K.square(y_pred - y_true), axis=-1) I was expecting the mean to be taken across the batches, which…
Nima Mousavi
  • 1,601
  • 2
  • 21
  • 30
6
votes
2 answers

1d CNN audio in keras

I want to try to implement the neural network architecture of the attached image: 1DCNN_model Consider that I've got a dataset X which is (N_signals, 1500, 40) where 40 is the number of features where I want to do the 1d convolution on. My Y is…
6
votes
1 answer

Computational GPU (Tesla K40c) is slower than graphics GPU (GTX 960)

I am running deep learning CNN (4-CNN layers and 3 FNN layers) model (written in Keras with tensorflow as backend) on two different machines. I have 2 machines (A: with a GTX 960 graphics GPU with 2GB memory & clock speed: 1.17 GHz and B: with a…
SUB
  • 287
  • 3
  • 14
6
votes
1 answer

Tensorflow feature columns in Dataset map Table already initialized issue

I've run into an issue trying to use Tensorflow's feature_column mappings inside of a function passed in to the Dataset map method. This happens when trying to one hot encode categorical string features of a Dataset as part of the input pipeline…
ahmed
  • 61
  • 4
6
votes
1 answer

Get the loss that a given optimizer is minimizing in Tensorflow

I am working in a unit test system for my tensorflow workspace and I would like to know if there is any method or attribute, given a graph with an optimizer operation (after calling .minimize()), to obtain the final loss tensor that it is optimizing…
ivallesp
  • 2,018
  • 1
  • 14
  • 21
6
votes
1 answer

How to obtain the convex curve for weights vs loss in a neural network

In most of the literature of Neural networks the 3D plot of weights, bias and the loss function is shown as below, When I tried I obtained a plot like this one Here are more details, Here is the glimpse of the dataset, there are 15,000 training…