Questions tagged [deep-learning]

Deep Learning is an area of machine learning whose goal is to learn complex functions using special neural network architectures that are "deep" (consist of many layers). This tag should be used for questions about implementation of deep learning architectures. General machine learning questions should be tagged "machine learning". Including a tag for the relevant software library (e.g., "keras", "tensorflow","pytorch","fast.ai" etc) is helpful.

Deep Learning is a branch of machine-learning aimed at building neural-networks to learn complex functions using special neural network architectures with many layers (hence the term "deep").

Deep neural network architectures allow for more complex tasks to be learned because, in addition to these neural networks having more layers to perform transformations, the larger number of layers and more complex architectures of the neural network allow a hierarchical organization of functionality to emerge.

Deep Learning was introduced into machine learning research with the intention of moving machine learning closer to artificial intelligence. A significant impact of deep learning lies in feature learning, mitigating much of the effort going into manual feature engineering in non-deep learning neural networks.

NOTE: If you want to use this tag for a question not directly concerning implementation, then consider posting on Cross Validated, Data Science, or Artificial Intelligence instead; otherwise your question is probably off-topic. Please choose one site only and do not cross-post to more than one - see Is cross-posting a question on multiple Stack Exchange sites permitted if the question is on-topic for each site? (tl;dr: no).

Resources

Papers

Deep Learning in Neural Networks: An Overview

Books

Neural Networks and Deep Learning By Michael Nielsen - this is a free book with associated Python source code on Github
Deep Learning
Deep Learning Made Easy with R: A Gentle Introduction For Data Science
Deep Learning: Methods and Applications
Autonomous Robotics and Deep Learning
Deep Learning with Python
Probabilistic Deep Learning with Python

Videos

Neural Networks Demystified - accompanied by a set of Jupyter Notebooks

Stack Exchange Sites

Other StackExchange sites with Deep Learning tag:

27406 questions

votes

1 answer

Deep multi-task learning with missing labels

I have a deep multi-task network that is designed to work on three separate classification tasks. While each training example has a label for two of the classification tasks, only about 10% to 15% of the training examples have a label for the third…

python tensorflow neural-network deep-learning

asked Mar 11 '18 at 01:03

phil510

votes

3 answers

How to split a model trained in keras?

I trained a model with 4 hidden layers and 2 dense layers and I have saved that model. Now I want to load that model and want to split into two models, one with one hidden layers and another one with only dense layers. I have splitted the model with…

machine-learning deep-learning keras conv-neural-network

asked Mar 09 '18 at 12:08

Akhilesh

1,024
1
9
26

votes

1 answer

TensorFlow: TypeError: Using a `tf.Tensor` as a Python `bool` is not allowed

I'm trying to define a triplet loss using descriptor from a CNN's output, but this error showed up when I try to train the network. My definition of loss function: def compute_loss(descriptor, margin): diff_pos = descriptor[0:1800:3] -…

python numpy tensorflow neural-network deep-learning

asked Mar 08 '18 at 15:50

Yingqiang Gao

votes

1 answer

Why is ReLU applied after residual connection in ResNet?

In the ResNet architecture, why is the ReLU activation applied after the element-wise addition with the residual in a residual block, instead of before it?

computer-vision deep-learning resnet

asked Mar 01 '18 at 08:49

Shen Zhuoran

votes

1 answer

Keras training with batches: Is the training loss computed before or after each optimization step?

this is probably a very basic question, however I wasn't able to find an answer to it: When I train a network with Keras using batches, the console output shows and keeps updating a display of the current loss value of the training set during each…

python deep-learning keras loss-function

asked Feb 27 '18 at 12:35

KiraMichiru

votes

1 answer

Keras: Confusion matrix at every epoch

I am able to log the loss at every epoch using Keras Callback after referring this. Is there any way I can compute Confusion Matrix and use it as a metric? Update I have tried to define the following function to return me the confusion matrix, but…

python deep-learning keras

asked Feb 27 '18 at 09:59

user5722540

votes

1 answer

First training epoch is very slow

Hi… I’m running mnist code in my P3 AWS machine and the initialization process seems to be very long compared to my previous P2 machine (although P3>P2) Train on 60000 samples, validate on 10000 samples Epoch 1/10 60000/60000…

tensorflow deep-learning keras mnist

asked Feb 27 '18 at 09:58

Jenia Golbstein

votes

5 answers

How to upload and save large data to Google Colaboratory from local drive?

I have downloaded large image training data as zip from this Kaggle link https://www.kaggle.com/c/yelp-restaurant-photo-classification/data How do I efficiently achieve the following? Create a project folder in Google Colaboratory Upload zip file…

python google-drive-api deep-learning jupyter-notebook google-colaboratory

asked Feb 19 '18 at 06:27

GeorgeOfTheRF

8,244
23
57
80

votes

1 answer

AttributeError: 'str' object has no attribute 'ndim'

I'm using Keras to implement a sentiment analysis code. I have my training data as follows: pos.txt : text file of all positive reviews separated by line neg.txt : text file of all negative reviews separated by line I build my code in a similar…

python string tensorflow deep-learning keras

asked Feb 07 '18 at 18:34

Amy

votes

3 answers

Keras MSE definition

I stumbled across the definition of mse in Keras and I can't seem to find an explanation. def mean_squared_error(y_true, y_pred): return K.mean(K.square(y_pred - y_true), axis=-1) I was expecting the mean to be taken across the batches, which…

python machine-learning neural-network deep-learning keras

asked Feb 05 '18 at 08:37

Nima Mousavi

1,601
2
21
30

votes

2 answers

1d CNN audio in keras

I want to try to implement the neural network architecture of the attached image: 1DCNN_model Consider that I've got a dataset X which is (N_signals, 1500, 40) where 40 is the number of features where I want to do the 1d convolution on. My Y is…

python machine-learning neural-network deep-learning keras

asked Feb 04 '18 at 13:22

SilverMatt

votes

1 answer

Computational GPU (Tesla K40c) is slower than graphics GPU (GTX 960)

I am running deep learning CNN (4-CNN layers and 3 FNN layers) model (written in Keras with tensorflow as backend) on two different machines. I have 2 machines (A: with a GTX 960 graphics GPU with 2GB memory & clock speed: 1.17 GHz and B: with a…

tensorflow deep-learning keras gpu

asked Feb 02 '18 at 19:05

SUB

votes

1 answer

Tensorflow feature columns in Dataset map Table already initialized issue

I've run into an issue trying to use Tensorflow's feature_column mappings inside of a function passed in to the Dataset map method. This happens when trying to one hot encode categorical string features of a Dataset as part of the input pipeline…

python tensorflow machine-learning deep-learning

asked Jan 25 '18 at 19:49

ahmed

votes

1 answer

Get the loss that a given optimizer is minimizing in Tensorflow

I am working in a unit test system for my tensorflow workspace and I would like to know if there is any method or attribute, given a graph with an optimizer operation (after calling .minimize()), to obtain the final loss tensor that it is optimizing…

tensorflow neural-network deep-learning

asked Jan 23 '18 at 20:07

ivallesp

2,018
1
14
21

votes

1 answer

How to obtain the convex curve for weights vs loss in a neural network

In most of the literature of Neural networks the 3D plot of weights, bias and the loss function is shown as below, When I tried I obtained a plot like this one Here are more details, Here is the glimpse of the dataset, there are 15,000 training…

python machine-learning neural-network deep-learning gradient-descent

asked Jan 23 '18 at 11:30

Karthic Rao

3,624
8
30
44

Prev 1 2 3

…

99 100 Next