Questions tagged [deep-learning]

Deep Learning is an area of machine learning whose goal is to learn complex functions using special neural network architectures that are "deep" (consist of many layers). This tag should be used for questions about implementation of deep learning architectures. General machine learning questions should be tagged "machine learning". Including a tag for the relevant software library (e.g., "keras", "tensorflow","pytorch","fast.ai" etc) is helpful.

Deep Learning is a branch of machine-learning aimed at building neural-networks to learn complex functions using special neural network architectures with many layers (hence the term "deep").

Deep neural network architectures allow for more complex tasks to be learned because, in addition to these neural networks having more layers to perform transformations, the larger number of layers and more complex architectures of the neural network allow a hierarchical organization of functionality to emerge.

Deep Learning was introduced into machine learning research with the intention of moving machine learning closer to artificial intelligence. A significant impact of deep learning lies in feature learning, mitigating much of the effort going into manual feature engineering in non-deep learning neural networks.

NOTE: If you want to use this tag for a question not directly concerning implementation, then consider posting on Cross Validated, Data Science, or Artificial Intelligence instead; otherwise your question is probably off-topic. Please choose one site only and do not cross-post to more than one - see Is cross-posting a question on multiple Stack Exchange sites permitted if the question is on-topic for each site? (tl;dr: no).

Resources

Papers

Deep Learning in Neural Networks: An Overview

Books

Neural Networks and Deep Learning By Michael Nielsen - this is a free book with associated Python source code on Github
Deep Learning
Deep Learning Made Easy with R: A Gentle Introduction For Data Science
Deep Learning: Methods and Applications
Autonomous Robotics and Deep Learning
Deep Learning with Python
Probabilistic Deep Learning with Python

Videos

Neural Networks Demystified - accompanied by a set of Jupyter Notebooks

Stack Exchange Sites

Other StackExchange sites with Deep Learning tag:

27406 questions

votes

2 answers

What is a multi-headed model? And what exactly is a 'head' in a model?

What is a multi-headed model in deep learning? The only explanation I found so far is this: Every model might be thought of as a backbone plus a head, and if you pre-train backbone and put a random head, you can fine tune it and it is a good…

machine-learning neural-network deep-learning

asked May 06 '19 at 11:39

spacer.34

votes

9 answers

Keras model.summary() object to string

I want to write a *.txt file with the neural network hyperparameters and the model architecture. Is it possible to write the object model.summary() to my output file? (...) summary = str(model.summary()) (...) out = open(filename +…

python tensorflow machine-learning keras deep-learning

asked Jan 15 '17 at 20:14

lmpeixoto

votes

9 answers

Error when checking model input: expected convolution2d_input_1 to have 4 dimensions, but got array with shape (32, 32, 3)

I want to train a deep network starting with the following layer: model = Sequential() model.add(Conv2D(32, 3, 3, input_shape=(32, 32, 3))) using history = model.fit_generator(get_training_data(), samples_per_epoch=1,…

deep-learning keras keras-layer

asked Jan 10 '17 at 07:51

Oblomov

8,953
22
60
106

votes

4 answers

Keras - Difference between categorical_accuracy and sparse_categorical_accuracy

What is the difference between categorical_accuracy and sparse_categorical_accuracy in Keras? There is no hint in the documentation for these metrics, and by asking Dr. Google, I did not find answers for that either. The source code can be found…

python tensorflow machine-learning keras deep-learning

asked Jun 10 '17 at 19:55

jcklie

4,054
3
24
42

votes

4 answers

How to find Number of parameters of a keras model?

For a Feedforward Network (FFN), it is easy to compute the number of parameters. Given a CNN, LSTM etc is there a quick way to find the number of parameters in a keras model?

deep-learning keras

asked Mar 04 '16 at 09:25

Anuj Gupta

6,328
7
36
55

votes

3 answers

Evaluating pytorch models: `with torch.no_grad` vs `model.eval()`

When I want to evaluate the performance of my model on the validation set, is it preferred to use with torch.no_grad: or model.eval()?

python machine-learning deep-learning pytorch autograd

asked Apr 11 '19 at 08:16

Tom Hale

40,825
36
187
242

votes

4 answers

Unbalanced data and weighted cross entropy

I'm trying to train a network with an unbalanced data. I have A (198 samples), B (436 samples), C (710 samples), D (272 samples) and I have read about the "weighted_cross_entropy_with_logits" but all the examples I found are for binary…

python machine-learning tensorflow deep-learning

asked Jun 15 '17 at 06:51

Sergiodiaz53

1,268
2
14
23

votes

5 answers

Dimension of shape in conv1D

I have tried to build a CNN with one layer, but I have some problem with it. Indeed, the compilator says me that ValueError: Error when checking model input: expected conv1d_1_input to have 3 dimensions, but got array with shape (569, 30) This is…

python tensorflow keras deep-learning conv-neural-network

asked Apr 13 '17 at 15:44

protti

votes

2 answers

How to append data to one specific dataset in a hdf5 file with h5py

I am looking for a possibility to append data to an existing dataset inside a .h5 file using Python (h5py). A short intro to my project: I try to train a CNN using medical image data. Because of the huge amount of data and heavy memory usage during…

python numpy deep-learning hdf5 h5py

asked Nov 02 '17 at 10:23

Midas.Inc

1,730
3
13
25

votes

3 answers

TensorFlow - regularization with L2 loss, how to apply to all weights, not just last one?

I am playing with a ANN which is part of Udacity DeepLearning course. I have an assignment which involves introducing generalization to the network with one hidden ReLU layer using L2 loss. I wonder how to properly introduce it so that ALL weights…

machine-learning neural-network tensorflow deep-learning regularized

asked Jul 09 '16 at 22:00

Maksim Khaitovich

4,742
7
39
70

votes

3 answers

Pytorch: nn.Dropout vs. F.dropout

There are two ways to perform dropout: torch.nn.Dropout torch.nn.functional.Dropout I ask: Is there a difference between them? When should I use one over the other? I don't see any performance difference when I switched them around.

python deep-learning neural-network pytorch dropout

asked Nov 21 '18 at 19:44

CutePoison

4,679
5
28
63

votes

6 answers

Keras Text Preprocessing - Saving Tokenizer object to file for scoring

I've trained a sentiment classifier model using Keras library by following the below steps(broadly). Convert Text corpus into sequences using Tokenizer object/class Build a model using the model.fit() method Evaluate this model Now for scoring…

machine-learning neural-network nlp deep-learning keras

asked Aug 17 '17 at 12:25

Rajkumar Kaliyaperumal

votes

8 answers

OpenCL / AMD: Deep Learning

While "googl'ing" and doing some research I were not able to find any serious/popular framework/sdk for scientific GPGPU-Computing and OpenCL on AMD hardware. Is there any literature and/or software I missed? Especially I am interested in deep…

sdk opencl neural-network gpgpu deep-learning

asked Jun 03 '15 at 14:20

daniel451

10,626
19
67
125

votes

4 answers

2-D convolution as a matrix-matrix multiplication

I know that, in the 1D case, the convolution between two vectors, a and b, can be computed as conv(a, b), but also as the product between the T_a and b, where T_a is the corresponding Toeplitz matrix for a. Is it possible to extend this idea to…

neural-network deep-learning conv-neural-network matrix-multiplication convolution

asked May 28 '13 at 18:24

no_name

1,315
2
16
20

votes

3 answers

Saving best model in keras

I use the following code when training a model in keras from keras.callbacks import EarlyStopping model = Sequential() model.add(Dense(100, activation='relu', input_shape = input_shape)) model.add(Dense(1)) model_2.compile(optimizer='adam',…

python keras deep-learning neural-network

asked Jan 16 '18 at 15:51

dJOKER_dUMMY

Prev 1 2 3

…

99 100 Next