ResearchHub | Open Science Community

PS

P. Simard

Author with expertise in Learning with Noisy Labels in Machine Learning

Achievements

Cited Author

Key Stats

Upvotes received:

0

Publications:

3

(0% Open Access)

Cited by:

10,421

h-index:

28

/

i10-index:

37

Reputation

Biology

< 1%

Chemistry

< 1%

Economics

< 1%

Show more

How is this calculated?

Publications

Learning long-term dependencies with gradient descent is difficult

Yoshua Bengio et al.Mar 1, 1994

Recurrent neural networks can be used to map input sequences to output sequences, such as for recognition, production or prediction problems. However, practical difficulties have been reported in training recurrent neural networks to perform tasks in which the temporal contingencies present in the input/output sequences span long intervals. We show why gradient based learning algorithms face an increasingly difficult problem as the duration of the dependencies to be captured increases. These results expose a trade-off between efficient learning by gradient descent and latching on information for long periods. Based on an understanding of this problem, alternatives to standard gradient descent are considered.

Artificial Intelligence

Machine Learning

0

Paper

Artificial Intelligence

Save

Best practices for convolutional neural networks applied to visual document analysis

P. Simard et al.Apr 12, 2005

Neural networks are a powerful technology forclassification of visual inputs arising from documents.However, there is a confusing plethora of different neuralnetwork methods that are used in the literature and inindustry. This paper describes a set of concrete bestpractices that document analysis researchers can use toget good results with neural networks. The mostimportant practice is getting a training set as large aspossible: we expand the training set by adding a newform of distorted data. The next most important practiceis that convolutional neural networks are better suited forvisual document tasks than fully connected networks. Wepropose that a simple do-it-yourself implementation ofconvolution with a flexible architecture is suitable formany visual document problems. This simpleconvolutional neural network does not require complexmethods, such as momentum, weight decay, structure-dependentlearning rates, averaging layers, tangent prop,or even finely-tuning the architecture. The end result is avery simple yet general architecture which can yieldstate-of-the-art performance for document analysis. Weillustrate our claims on the MNIST set of English digitimages.

Artificial Intelligence

0

Paper

Save

Comparison of classifier methods: a case study in handwritten digit recognition

Léon Bottou et al.Dec 17, 2002

This paper compares the performance of several classifier algorithms on a standard database of handwritten digits. We consider not only raw accuracy, but also training time, recognition time, and memory requirements. When available, we report measurements of the fraction of patterns that must be rejected so that the remaining patterns have misclassification rates less than a given threshold.

Artificial Intelligence

Computer Vision And Pattern Recognition

0

Paper

Artificial Intelligence

Save