An Optoelectronic Neural Network Packet Switch Scheduler K. J. Symington, A. J. Waddie, T. Yasue, M....
-
Upload
juniper-wright -
Category
Documents
-
view
216 -
download
1
Transcript of An Optoelectronic Neural Network Packet Switch Scheduler K. J. Symington, A. J. Waddie, T. Yasue, M....
An OptoelectronicNeural Network Packet
Switch Scheduler
K. J. Symington, A. J. Waddie, T. Yasue,M. R. Taghizadeh and J. F. Snowdon.
http://www.optical-computing.co.uk
Outline• Packet switch scheduler.
• Previous demonstrator has proven system feasibility.
• Current demonstrator enhances functionality and performance.
• Motivation.
• Implementation and scalability.
• Conclusions.
The Assignment Problem
Solution is computationally intensive.Neural networks are capable of solving the assignment problem.Their inherent parallelism allows them to outperform any other known method at higher orders.
Can be found in situations such as:• Network service management.• Distributed computer systems.• Work management systems.• General scheduling, control or resource
allocation.
Crossbar Switching
A size N crossbar switch has the same number of inputs as outputs: i.e. m=n=N.
Crossbar Switching
Crossbar Switching
• Packets stored in buffer until output free.
• Packets can request any output line.
• Buffer depth very important.
• Real traffic tends to be ‘bursty’.
Crossbar Switching
• Channel operation exclusive.
• Maximum capacity of N packets per switch cycle.
Crossbar Switching
• Packet can only pass when crosspoint set.
• N2 crosspoint switches required.
• Generic crossbar switch architecture.
Crossbar Switching• Neural network chooses optimal set of packets.
• One neuron required for every crosspoint.
Crossbar Switching
Banyan Switching
• Routing input 2 to output 2 allows only 1 packet to pass.Solution is sub-optimal.
Solution Optimality
24
2
• Routing input 2 to output 2 allows only 1 packet to pass.Solution is sub-optimal.
• Routing input 2 to output 4 and input 4 to output 2 allows 2 packets to pass. Solution is optimal.
The Neuron• Inputs taken from the outputs of
other neurons.
The Neuron• Inputs taken from the outputs of
other neurons.
• Synaptic weights multiply inputs.
The Neuron• Inputs taken from the outputs of
other neurons.
• Synaptic weights multiply inputs.
• Inputs are summed and bias added.
The Neuron• Inputs taken from the outputs of
other neurons.
• Synaptic weights multiply inputs.
• Inputs are summed and bias added.
• Transfer function f(x) performed before output.
Neural Algorithm
2
CyByAt1txtx
n
ikkj
n
jkikijij
ijxijij
e1
1xfy
xij: Summation of all the inputs to the neuron referenced by ij: including the bias.
yij: Output of neuron ij.
A, B and C: Optimisation parameters.
‘Iterations to Convergence’ is an important parameter.
Iterations related to, but not necessarily equal to, time.
: Controls gain of neuron.
Next state defined by:
Neural transfer function:
Neural Interconnect
Convergence Example
Start state – all requested neurons are on.
Convergence Example
1/3 Evolved: Neurons (2, 4) and (4, 2) are beginning to inhibiting neuron (2, 2).
Convergence Example
2/3 Evolved: Neuron (2, 2) is nearly off.
Convergence Example
Fully Evolved. Optimal solution reached.
• Neural network scalability limited in silicon.
• Optoelectronics allows scaleable networks.
• Free-space optics can be used to perform interconnection.
• Only transfer function f(x) need be performed in silicon.
• Input summation is done in an inherently analogue manner.
• Noise added naturally.
Why Optoelectronics?
The VCSEL Array• Optical output element.
• A laser that emits from the surface of the substrate.
• High optical output powers.
The VCSEL Array• Each neuron has one VCSEL for
optical output.
• Performance: Capable of >1GHz operation.
• Scalability: Currently N=16.
Detector Arrays• Optical input element.
• Available in a wide range off the shelf.
• Performance: >1GHz.
• Caveat: faster detectors require more power.
Diffractive Optic Elements (DOEs)
• Large fan-outpossible.
• Efficiency:~50-60%.
• Non-uniformity:<3%.
• Period Size:90µm.
These elements are used as array generators and interconnection elements.
Crossbar switch interconnect.Banyan switch interconnect.
Optical Interconnect
DOE interconnect is space invariant.
VCSEL Array
Lens Ø10mm f=100mm-150mm
DOE Ø15mm Period 90µm
Photodetector Array
Optical System
First Generation System
• Constructed using discrete components.
• Lacked ability to prioritise packets: can lead to channel saturation.
• Uses similar optical system (~330mm).
Current System
• System uses 4×40MHz Texas Instruments 320C5x DSPs.
• DSPs perform transfer function.
• Transfer function fully programmable.
• Reduction of hardware by digital thresholding.
System Scalability
Digital vs. Analogue
Analogue: Optimal ~97%. Digital: Optimal ~91%.
Crossbar Switch ResultsHistogram of packets routed successfully in a crossbar switch.
Banyan Switch ResultsHistogram of packets routed successfully in a banyan switch.
Mean Packet Delay
Mean Packet Delay
• ISLIP4 cannot be implemented larger than N=16.
Mean Packet Delay
• ISLIP4 cannot be implemented larger than N=16.
3 Major effects to consider:• Active effects: <1Hz thermal changes and component
creep.• Static effects: Tolerances in fabricated components
could lead to misalignment in final system.• Adaptive effects: Vibrational effects >1Hz - e.g.
10kHz.Solutions:• Measurement and correction of focusing and positional
error in real time (active optic alignment or adaptive optics).
• Commercially viable: e.g. personal CD player, ASDA £22:95.
• Pre-packaged, pre-aligned modules.
Engineering Issues
Encapsulated System
R. Stone, J. Kim and P. Guilfoyle,“High Performance Shock Hardened Optoelectronic Communications Module”, OC2001, Lake Tahoe,pp. 105-107.
Conclusions• Performance of 100MHz feasible, 1GHz foreseeable.• Scalability mainly limited by VCSEL array size (N=16).• Scalability independent of number of inputs/outputs (N).• A digital system running at 1GHz could supply 2.5 million
switch configurations per second.• Second generation builds on first in that it supports
prioritisation.• What good is a truck without a steering wheel?• Further work:
• Smart pixel implementation and packaging.• Examination of QoS provided by scheduler.• FPGA or custom ASIC implementation using optical interconnects.• Novel neural algorithms and learning.