Learn Before
Reference

How to debug a neural network with gradient checking