The end of the first tutorial, "MNIST for ML Beginners", has this paragraph:
What matters is that we learned from this model. Still, if you're feeling a bit down about these results, check out the next tutorial where we do a lot better, and learn how to build more sophisticated models using TensorFlow!
But the next tutorial is "Deep MNIST for Experts", and it does not explain things nearly as well as the first tutorial. For example, the second tutorial starts differing at the Build a Multilayer Convolutional Network
section, but the very first paragraph under Weight Initialization
does not explain the concepts it is using:
To create this model, we're going to need to create a lot of weights and biases. One should generally initialize weights with a small amount of noise for symmetry breaking, and to prevent 0 gradients. Since we're using ReLU neurons, it is also good practice to initialize them with a slightly positive initial bias to avoid "dead neurons." Instead of doing this repeatedly while we build the model, let's create two handy functions to do it for us.
What are "ReLU neurons"? Why are we using them? What _is_ a convolutional network even?
I would say, either
the next tutorial
but instead say a later tutorial on MNIST for Experts
.Thanks!
Seconded. I think an explanation of conv nets would be helpful in the context of MNIST digits.
The TensorFlow tutorials cannot replace a deep learning or machine learning textbook or course. We may add pointers to additional resources as those become available.
Not only this tutorial is complicated, all these tutorials are complicated.
For example, what I want to do is:
But Tensorflow gives me already at the beginning a 4-file MNIST tutorial, with complicated graphs and complicated mechanisms for loading files into the network
Most helpful comment
Not only this tutorial is complicated, all these tutorials are complicated.
For example, what I want to do is:
But Tensorflow gives me already at the beginning a 4-file MNIST tutorial, with complicated graphs and complicated mechanisms for loading files into the network