Using random weights to train multilayer networks of hard-limiting units

Barlett, Peter; Downs, T.

doi:10.1109/72.125861

Cited by 44 publications

(22 citation statements)

References 10 publications

Supporting

Mentioning

Contrasting

Unclassified

Order By: Relevance

“…Threshold activation functions can be realized with analog comparators. Very few hardlimiting function, multilayer network training algorithms have been developed [15,16,20,21,22]. A multilayer extension of the LMS, known as MR2, was chosen for ease of coding [1 6].…”

Section: Algorithm Selectionmentioning

confidence: 99%

<title>Character recognition using novel optoelectronic neural network</title>

1993

View full text Add to dashboard Cite

BROOKS AIR FORCE BASE, TEXAS NOTICESThis technical paper is published as received and has not been edited by the technical editing staff of the Armstrong Laboratory.The interpretations, conclusions, recommendations, and opinions are those of the author and are not necessarily endorsed by the U.S. Air Force or the Department of Defense.When Government drawings, specifications, or other data are used for any purpose other than in connection with a definitely Government-related procurement, the United States Government incurs no responsibility or any obligation whatsoever. The fact that the Government may have formulated or in any way supplied the said drawings, specifications, or other data, is not to be regarded by Implication, or otherwise in any manner construed, as licensing the holder, or any other person or corporation; or as conveying any rights or permission to manufacture, use, or sell any patented invention that may in any way be related thereto.The Office of Public Affairs has reviewed this paper, and it is releasable to the National Technical Information Service, where it will be available to the general public, including foreign nationals. This paper has been reviewed and is approved for publication. Approved for public release; distribution is unlimited. ABSTRACT (Aaxnmum 200 words)A novel optoelectronic neural network has been designed and constructed to recognize a set of characters from the alphabet. The network consists of a 15Xl binary input vector, two optoelectronic vector matrix multiplication layers, and a 15X1 binary output layer. The network utilizes a pair of custom fabricated Spatial Light Modulators (SLMs) with 120 levels of gray scale per pixel. The SIMs realize the matrix weights. Previous networks of this type were hampered by limited levels of gray scale and the need to use two separate weight masks (matrices) per layer. The weight masks are operated in unipolar mode. This allows both positive and negative weights to be realized from the same mask. A hard limiting function is used for the network's nonlinearity. A modification of Widrow's lesser known MR2 training algorithm is used to train the network. Furthermore, the network introduces a novel lens-free crossbar matrix-vector multiplier. multiplier.

show abstract

Section: Algorithm Selectionmentioning

confidence: 99%

<title>Character recognition using novel optoelectronic neural network</title>

1993

View full text Add to dashboard Cite

show abstract

“…the means and standard deviations of the appropriate Gaussian distributions. 33,34 In the reversible generalisation, where each neuron is replaced by a permutation matrix, we find that the output is no longer a function of the inputs and continuous weights, but rather of the inputs and a discrete set of permutation matrices. However, in the generalisation to unitaries, for a gate with n inputs and outputs, there exist an infinite number of unitaries, in contrast with the discrete set of permutation matrices.…”

mentioning

confidence: 97%

Quantum generalisation of feedforward neural networks

et al. 2017

View full text Add to dashboard Cite

We propose a quantum generalisation of a classical neural network. The classical neurons are firstly rendered reversible by adding ancillary bits. Then they are generalised to being quantum reversible, i.e., unitary (the classical networks we generalise are called feedforward, and have step-function activation functions). The quantum network can be trained efficiently using gradient descent on a cost function to perform quantum generalisations of classical tasks. We demonstrate numerically that it can: (i) compress quantum states onto a minimal number of qubits, creating a quantum autoencoder, and (ii) discover quantum communication protocols such as teleportation. Our general recipe is theoretical and implementation-independent. The quantum neuron module can naturally be implemented photonically.

show abstract

“…Bartlett in [1] introduced another approach by defining the weights as random variables with smooth distribution functions and proposed an algorithm that uses an approach that is similar to BP to adjust the parameters of the weights' distributions. In [4], Corwin suggested to train NDAs with progressively steeper analog functions to facilitate training.…”

Section: Training Methods For Network With Discrete Activationsmentioning

confidence: 99%

“…Various modifications of the gradient descent have been presented to train NDAs [1,3,4,6,14,17]. However, these methods require to a certain degree, depending on the method, that the learning task should be static.…”

Section: Introductionmentioning

confidence: 99%

Training multilayer networks with discrete activation functions

Plagianakos

Magoulas

Nousis

et al.

IJCNN'01. International Joint Conference on Neural Networks. Proceedings (Cat. No.01CH37222)

View full text Add to dashboard Cite

Efficient training of multilayer networks with discrete activation functions is a subject of considerable ongoing research. The use of these networks greatly reduces the complexity of the hardware implementation, provides tolerance to noise and improves the interpretation of the internal representations. Methods available in the literature mainly focus on two-state (binary) nodes and try to train these networks by approximating the gradient and modifying appropriately the gradient descent. However, they exhibit slow convergence speed and low possibility of success compared to networks with continuous activations. In this work, we propose an evolution-motivated approach, which is eminently suitable for networks with discrete output states and compare its performance with four other methods.

show abstract

Using random weights to train multilayer networks of hard-limiting units

Cited by 44 publications

References 10 publications

<title>Character recognition using novel optoelectronic neural network</title>

<title>Character recognition using novel optoelectronic neural network</title>

Quantum generalisation of feedforward neural networks

Training multilayer networks with discrete activation functions

Contact Info

Product

Resources

About