Questions tagged [mxnet]

Apache MXNet is a fast, scalable, open source deep learning framework that supports Python, R, Scala, C++, Julia, Go, and Javascript. MXNet can run on multiple CPUs and multiple GPUs distributed across multiple hosts, with almost linear scalability.

Apache MXNet: Flexible and Efficient Library for Deep Learning

For more details see http://mxnet.io/

Flexible

Supports both imperative and symbolic programming

Portable

Runs on CPUs or GPUs, on clusters, servers, desktops, or mobile phones

Multiple Languages

Supports over 7 programming languages, including C++, Python, R, Scala, Julia, Matlab, Javascript and Perl.

Auto-Differentiation

Calculates the gradient automatically for training a model

Distributed on Cloud

Supports distributed training on multiple CPU/GPU machines, including AWS, GCE, Azure, and Yarn clusters

Performance

Optimized C++ backend engine parallelizes both I/O and computation

701 questions
0
votes
1 answer

How to resume training with optimizer status in MxNet

I want to realize the “resume training” function for my training program. But I don’t know how to correctly resume the optimizer status. My program is like this: opt = mx.optimizer(learning_rate=lr, ....) ctx = [...] sym = get_symbol() # The…
0
votes
1 answer

Training MXNet with large batch size and small number of GPUs

When training MXNet, if the batch size is large(say 128), and the number of GPUs is small(say 2), and each GPU can only handle a few samples each iteration(say 16). By default, the maximum batch size of this configuration is 16 * 2 = 32. In theory,…
Targo
  • 67
  • 6
0
votes
1 answer

How to do linear algebraic functions with mxnet symbols to write a custom loss function(eg. online triplet mining)?))

I referred to this implementation in tensorflow. Itrequires the shape of the output batch embeddings but I can't get the actual shape of an mxnet symbol. Any ideas how?
swmeng
  • 1
0
votes
2 answers

Error in library(mxnet) : there is no package called ‘mxnet’

I started my mxnet package installation for R in ubuntu with the following command install.packages("drat", repos="https://cran.rstudio.com") drat:::addRepo("dmlc") install.packages("mxnet") while executing this command I got the following…
Vishnu Mankulam
  • 37
  • 1
  • 1
  • 9
0
votes
1 answer

MXNet using NDArray Scatter_nd on multi GPU

When I use multi gpu to train on MXNet(CUDA8.0+cudnn7), I firstly initialize parameters on different context, then I perform scatter_nd on different contexts, the first time scatter nd would work perfectly, but when compute for the second gpu card,…
Poodar
  • 145
  • 1
  • 2
  • 7
0
votes
1 answer

Serving multiple deep learning models from cluster

I was thinking about how one should deploy multiple models for use. I am currently dealing with tensorflow. I was referring this and this article. But I am not able to find any article which targets need to serve several models distributed manner.…
MsA
  • 2,599
  • 3
  • 22
  • 47
0
votes
1 answer

mxnet: how to set up a custom mx.io.DataIter to prefetch?

My mxnet script is likely limited by i/o of data loading into the GPU, and I am trying to speed this up by prefetching. The trouble is I can't figure out how to prefetch with a custom data iterator. My first hypothesis/hope was that it would be…
TFdoe
  • 571
  • 5
  • 16
0
votes
1 answer

Define a custom function with mxnet model

I am trying to use a custom function with an mxnet neural network model. This custom function is supposed to create a fuzzy representation of the final layer activation vector. I am confused how to make this work as regular python functions are…
DigitalFox
  • 1,486
  • 1
  • 13
  • 17
0
votes
1 answer

mxnet: when using an RNN in model, model initializer crashes with context of multuple GPUs

I am new to mxnet and running a script lightly modified from the documentation on RNN with gluon. I modified the code so that I am working with strictly numerical time series rather than an NLP problem. Everything was running great until I modified…
TFdoe
  • 571
  • 5
  • 16
0
votes
1 answer

mxnet infer_shape_entry TypeError: an integer is required

I met an infer_shape error when I test my custom operator. Error Message: Error in .infer_shape: Traceback (most recent call last): File "~/anaconda2/envs/mx110-py27/lib/python2.7/site-packages/mxnet/operator.py", line 658, in…
Weijie Su
  • 1
  • 1
0
votes
1 answer

Advice on when to stop training a resnet50 network (mxnet on aws)

I have a custom dataset of approximately 20k images (10% used of validation). I have roughly 1/3 in label class 0, 1/3 in label class 1, and 1/3 that do not have class 0, or 1 objects with a -1 label. I have run approximately 400 epochs, the last 40…
jlwebuser
  • 49
  • 7
0
votes
1 answer

Theoretical underpinning behind Hardmax operator

In the tensor flow Github repository, in the file attentionwrapper.py, hardmax operator has been defined. On the docs, it has been mentioned tf.contrib.seq2seq.hardmax I want to know what's the theoretical underpinning behind providing this…
Chaitanya Bapat
  • 3,381
  • 6
  • 34
  • 59
0
votes
1 answer

The problem with the allocation of memory for labels in mxnet, python

I classify five classes, color, 128x128 pixel images, batch size = 64, using the Generative Adversarial Network. When creating a discriminator module, when execution discriminator.bind(data_shapes = image_iter.provide_data, label_shapes =…
0
votes
2 answers

gluon model shape inconsistency

I am trying gluon model zoo. import mxnet as mx from mxnet.gluon.model_zoo import vision import cv2 import numpy as np ctx = mx.gpu(6) # successful net = vision.alexnet(pretrained=True, ctx=ctx) # preparing input image. # You may ignore this…
plhn
  • 5,017
  • 4
  • 47
  • 47
0
votes
1 answer

MXNET CNN+LSTM save/serialize to json

I'm finding a hardtime figuring out how to correctly define a mxnet net so that i can serialize/convert this model to a json file. The pipeline is composed of a CNN + biLSTM + CTC. I now i must use HybridBlock and hybridize() but i can't seem to…