Fast Convergence techniques-APNIC-42wiki.bdnog.org/lib/exe/fetch.php/bdnog7/fast... · 50-ms...

38
Fast Convergence Techniques Sumon Ahmed Sabir [email protected] MPLS Workshop BDNOG 7, Dhaka

Transcript of Fast Convergence techniques-APNIC-42wiki.bdnog.org/lib/exe/fetch.php/bdnog7/fast... · 50-ms...

Page 1: Fast Convergence techniques-APNIC-42wiki.bdnog.org/lib/exe/fetch.php/bdnog7/fast... · 50-ms Convergence: Do we really need this? • Most of the applications and services we are

FastConvergenceTechniques

[email protected]

MPLSWorkshopBDNOG7,Dhaka

Page 2: Fast Convergence techniques-APNIC-42wiki.bdnog.org/lib/exe/fetch.php/bdnog7/fast... · 50-ms Convergence: Do we really need this? • Most of the applications and services we are

NeedforFastConvergence

Itsnotonlybrowsing,mailandwatchingvideosanymore.InternetandNetworkscarryingVoice/Video calls.Carryingbusinessandmissioncriticaldata.

Nooptionforoutageorinterruption.

Page 3: Fast Convergence techniques-APNIC-42wiki.bdnog.org/lib/exe/fetch.php/bdnog7/fast... · 50-ms Convergence: Do we really need this? • Most of the applications and services we are

NeedforFastConvergence

FewyearsbeforeinEthernetnetworkConvergencetimewasabout2minutes.

AtpresentittakesfewsecondswithoutanyfastconvergencetechniquesappliedinInterfaceandprotocolconfiguration.

Butmanycriticalservicesdemand<50msconvergencetimeinacarriergradenetwork.

Page 4: Fast Convergence techniques-APNIC-42wiki.bdnog.org/lib/exe/fetch.php/bdnog7/fast... · 50-ms Convergence: Do we really need this? • Most of the applications and services we are

DesignConsideration

• NetworkTopology• IPPlanning• IGPFineTuning• ScalingBGP• TypeofServiceDelivery

Page 5: Fast Convergence techniques-APNIC-42wiki.bdnog.org/lib/exe/fetch.php/bdnog7/fast... · 50-ms Convergence: Do we really need this? • Most of the applications and services we are

NetworkTopology:BadExample

Page 6: Fast Convergence techniques-APNIC-42wiki.bdnog.org/lib/exe/fetch.php/bdnog7/fast... · 50-ms Convergence: Do we really need this? • Most of the applications and services we are

NetworkTopology:BetterExample

Page 7: Fast Convergence techniques-APNIC-42wiki.bdnog.org/lib/exe/fetch.php/bdnog7/fast... · 50-ms Convergence: Do we really need this? • Most of the applications and services we are

BetterIPPlanBetterConvergence

• Domain/AreaBasedIPPlanmustbetakingplacetominimizetheprefixes

• PrefixSummeryorAreasummeryisveryeffectivetoaggregateindividualsmallprefixeswithintheArea

Page 8: Fast Convergence techniques-APNIC-42wiki.bdnog.org/lib/exe/fetch.php/bdnog7/fast... · 50-ms Convergence: Do we really need this? • Most of the applications and services we are

IGPFastConvergence

• FailureDetection• EventPropagation• SPFRun• RIBFIBUpdate

• Timetodetectthenetworkfailure,e.g.interfacedowncondition.

• Timetopropagatetheevent,i.e.floodtheLSAacrossthetopology.

• TimetoperformSPFcalculationsonallroutersuponreceptionofthenewinformation.

• Timetoupdatetheforwardingtablesforallroutersinthearea.

Page 9: Fast Convergence techniques-APNIC-42wiki.bdnog.org/lib/exe/fetch.php/bdnog7/fast... · 50-ms Convergence: Do we really need this? • Most of the applications and services we are

PurgingtheRIBonlinkfailure

• RoutingprotocolsaremoreefficientthanRIBprocessindetectinglinkfailuretodeletetheassociatenext-hoproutesofthefailedinterface.Enablingthisfeaturereducesconvergencetimesignificantlyspeciallyincaseofalargeroutingtable.

ip routing protocol purge interface

Page 10: Fast Convergence techniques-APNIC-42wiki.bdnog.org/lib/exe/fetch.php/bdnog7/fast... · 50-ms Convergence: Do we really need this? • Most of the applications and services we are

LinkFailureDetectionProcess

Hereisfewmethodstodetectthelinkfailure1. IGPkeepalive times/fasthelloswiththedead/holdinterval

ofonesecondandsub-secondhellointervals.ItisCPUhungry

2. carrier-delaymsec 0,PhysicalLayer3. BFD,OpenStandardmorereliableratherthanIGP

Keepalive fasthello

Page 11: Fast Convergence techniques-APNIC-42wiki.bdnog.org/lib/exe/fetch.php/bdnog7/fast... · 50-ms Convergence: Do we really need this? • Most of the applications and services we are

LinkFailureDetection

• SetCarrier-delay to0mstochangethelink stateinstantly.Ifyou areusing any other transportserviceslike SDHorDWDMsetthevalueaccording toyourtransportnetwork

int gi0/0/1carrier-delay msec0

Page 12: Fast Convergence techniques-APNIC-42wiki.bdnog.org/lib/exe/fetch.php/bdnog7/fast... · 50-ms Convergence: Do we really need this? • Most of the applications and services we are

LinkFailureDetection

• Enable BFDtonotify routing protocols aboutthelinkfailure insub secondinterval.Without BFDit will takeat least1second

int gi0/0/1ip ospf bfdbfd interval 50 min_rx 50 multiplier 3

Page 13: Fast Convergence techniques-APNIC-42wiki.bdnog.org/lib/exe/fetch.php/bdnog7/fast... · 50-ms Convergence: Do we really need this? • Most of the applications and services we are

LinkFailureDetection

• InEthernetinterface,ISIS/OSPFwillattempttoelectaDIS/DRwhenitformsanadjacency– Asitisrunningasapoint-to-pointlink,configuringISIS/OSPFtooperatein"point-to-pointmode”reduceslinkfailuredetectiontime

int gi0/0/1isis network point-to-point

int gi0/0/1ip ospf network point-to-point

Page 14: Fast Convergence techniques-APNIC-42wiki.bdnog.org/lib/exe/fetch.php/bdnog7/fast... · 50-ms Convergence: Do we really need this? • Most of the applications and services we are

SPFCalculation

• TheuseofIncrementalSPF(iSPF)allowstofurtherminimizetheamountofcalculationsneededwhenpartialchangesoccurinthenetwork

• Needtoenableispf underospf/isis process

router ospf 10ispf

Page 15: Fast Convergence techniques-APNIC-42wiki.bdnog.org/lib/exe/fetch.php/bdnog7/fast... · 50-ms Convergence: Do we really need this? • Most of the applications and services we are

SetOverloadbit

• WaituntiliBGP isrunningbeforeprovidingtransitpathrouter isis isp

set-overload-bit on-startup wait-for-bgp

router ospf 10max-metric router-lsa on-startup wait-

for-bgp• Avoidsblackholing trafficonrouterrestart• CausesOSPF/ISIStoannounceitsprefixeswithhighestpossiblemetricuntiliBGP isupandrunning

Page 16: Fast Convergence techniques-APNIC-42wiki.bdnog.org/lib/exe/fetch.php/bdnog7/fast... · 50-ms Convergence: Do we really need this? • Most of the applications and services we are

NonStopForwarding

• CiscoNSFwithSSOorJuniperNonStopActiveRoutingforsystemswithdualrouteprocessorallowsarouterthathasexperiencedahardwareofsoftwarefailureofanactiverouteprocessortomaintaindatalinklayerconnectionsandtocontinueforwardingpacketsduringtheswitchovertothestandbyrouteprocessor

Page 17: Fast Convergence techniques-APNIC-42wiki.bdnog.org/lib/exe/fetch.php/bdnog7/fast... · 50-ms Convergence: Do we really need this? • Most of the applications and services we are

EventPropagation

After LinkDownEvent Remarks Command

LSAgenerationdelay timersthrottle lsa initialholdmax_wait

timersthrottle lsa 0201000

LSAreceptiondelay ThisdelayisasumoftheingressqueuingdelayandLSAarrivaldelay

timerspacingretransmission100

ProcessingDelay timerspacingflood(ms)withthedefaultvalueof55ms

timerspacingflood15

PacketPropagationDelay 12usecfor1500bytespacketovera1Gbpslink

N/A

Page 18: Fast Convergence techniques-APNIC-42wiki.bdnog.org/lib/exe/fetch.php/bdnog7/fast... · 50-ms Convergence: Do we really need this? • Most of the applications and services we are

RIB/FIBUpdate

Link/NodeDown

SPFCalculatio

n

RIBUpdate

FIBUpdate

Communication

LesserNumberofPrefixeslessertimetoconvergetheRIBandFIB

Page 19: Fast Convergence techniques-APNIC-42wiki.bdnog.org/lib/exe/fetch.php/bdnog7/fast... · 50-ms Convergence: Do we really need this? • Most of the applications and services we are

RIB/FIBUpdate

• AftercompletingSPFcomputation,OSPF/ISISperformssequentialRIBupdatetoreflectthechangedtopology.TheRIBupdatesarefurtherpropagatedtotheFIBtable

• TheRIB/FIBupdateprocessmaycontributethemosttotheconvergencetimeinthetopologieswithlargeamountofprefixes,e.g.thousandsortensofthousands

• Platformwhatyouareusing,highercapacityCPUandRAMwillcaterbetterperformance.

Page 20: Fast Convergence techniques-APNIC-42wiki.bdnog.org/lib/exe/fetch.php/bdnog7/fast... · 50-ms Convergence: Do we really need this? • Most of the applications and services we are

ConfigurationTemplate

router ospf 10max-metric router-lsa on-startup wait-for-bgptimers lsa arrival 50 timers throttle lsa all 10 100 1000 timers throttle spf 10 100 1000 timers pacing flood 5 timers pacing retransmission 60 ispfbfd all interfaces

Page 21: Fast Convergence techniques-APNIC-42wiki.bdnog.org/lib/exe/fetch.php/bdnog7/fast... · 50-ms Convergence: Do we really need this? • Most of the applications and services we are

ConfigurationTemplate

router isis ISPset-overload-bit on-startup wait-for-bgpspf-interval 5 1 20lsp-gen-interval 5 1 20prc-interval 5 1 20fast-flood 10bfd all-interfacesispf level-1-2 60

Page 22: Fast Convergence techniques-APNIC-42wiki.bdnog.org/lib/exe/fetch.php/bdnog7/fast... · 50-ms Convergence: Do we really need this? • Most of the applications and services we are

FinalCalculationEvent Time(ms) Remarks

FailureDetectionDelay:Carrier-delaymsec 0 0 about5-10msworstcasetodetect

In BFDCase 150 Multiplayer3 islastcount:50msinterval

MaximumSPFruntime 64 doubling forsafetymakesit64ms

MaximumRIBupdate 20 doubling forsafetymakesit20ms

OSPFinterfacefloodpacing timer 5 doesnotapply totheinitial LSAflooded

LSAGenerationInitialDelay 10 enough todetectmultiple linkfailuresresultingfromSRLGfailure

SPFInitialDelay 10 enough toholdSPFtoallowtwoconsecutiveLSAstobeflooded

Networkgeographicalsize/PhysicalMedia(Fiber) 0 signalpropagation isnegligible

FinalFIBUPDATETime:Maximum500ms.Itissub-secondconvergence

Page 23: Fast Convergence techniques-APNIC-42wiki.bdnog.org/lib/exe/fetch.php/bdnog7/fast... · 50-ms Convergence: Do we really need this? • Most of the applications and services we are

BeyondSubsecondConvergence

Butifyouneed<50ms Convergencetime,Needtodomore…….

i. RSVPBasedlink/nodeprotectionrouteii. LDPBasedLFA-FRR

Page 24: Fast Convergence techniques-APNIC-42wiki.bdnog.org/lib/exe/fetch.php/bdnog7/fast... · 50-ms Convergence: Do we really need this? • Most of the applications and services we are

50-msConvergence:Dowereallyneedthis?

• Mostoftheapplicationsandservicesweareusingtodayarefinewithsubsecond(500ms)convergence.

• Fewapplicationslikestocktrading,mobilephonerecharge,fewotherpoorlywrittenappspeopleusingasksfor50msconvergence.

• L2CircuitemulationoverIPsometimesbreaksover100ms

• http://www.ethernetacademy.net/Ethernet-Academy-Articles/putting-50-milliseconds-in-perspective

Page 25: Fast Convergence techniques-APNIC-42wiki.bdnog.org/lib/exe/fetch.php/bdnog7/fast... · 50-ms Convergence: Do we really need this? • Most of the applications and services we are

LFA-FRR

• Providelocalsub-100msconvergencetimesandcomplementanyotherfastconvergencetuningtechniquesthathavebeenemployed

• LFA-FRRiseasilyconfiguredonarouterbyasinglecommand,calculateseverythingautomatically

• EasyandlessercomplexthanRSVPBasedTrafficEngineering.

Page 26: Fast Convergence techniques-APNIC-42wiki.bdnog.org/lib/exe/fetch.php/bdnog7/fast... · 50-ms Convergence: Do we really need this? • Most of the applications and services we are

Prerequisite

• NeedMPLSLDPConfiguration

• NeedBFDConfigurationtotriggerFastReroute

• NeedsomeFastRerouteconfigurationunderOSPFProcess

• Needsomespecialconfigurationbasedonplatform

mpls ldp discovery targeted-hello accept

router ospf Yrouter-id xxxxxispfprefix-priority high route-map

TE_PREFIXfast-reroute per-prefix enable area y prefix-priority high

fast-reroute per-prefix remote-lfa tunnel mpls-ldp

ip prefix-list TE_PREFIX seq 5 permit a.b.c.d/32

!route-map TE_PREFIX permit 10match ip address prefix-list TE_PREFIX

Page 27: Fast Convergence techniques-APNIC-42wiki.bdnog.org/lib/exe/fetch.php/bdnog7/fast... · 50-ms Convergence: Do we really need this? • Most of the applications and services we are

Howitworks1. Initiallybestpathfortheprefix172.16.1.0/24isB-A-B1-B32. OncethelinkfailsbetweenB-AthenpriorcomputedLFATunnelTriggeredbyBFD3. ImmediateTargetPrefix(es)arepassedthroughB-DLFATunnel4. PackdropdoesnotobservebecauseBrouterdoesnotwaitforIGPconvergence

Page 28: Fast Convergence techniques-APNIC-42wiki.bdnog.org/lib/exe/fetch.php/bdnog7/fast... · 50-ms Convergence: Do we really need this? • Most of the applications and services we are

LFA-FRRDesignConsideration

• InaRingTopology• LesserPrefixmakequickerconvergence• SpecificPrefixwithhigherprioritywillshowbestperformancewithoutanyserviceinterruptionandpacketdrop.

ROBI39-DHKTL25#sh ip int briefLoopback1 10.253.51.91 YES NVRAM up upMPLS-Remote-Lfa124 10.10.202.69 YES unset up up

show ip cef 10.255.255.2910.255.255.29/32nexthop 10.10.202.65 Vlan10 label [166|1209]

repair: attached-nexthop 10.253.51.94 MPLS-Remote-Lfa124

Page 29: Fast Convergence techniques-APNIC-42wiki.bdnog.org/lib/exe/fetch.php/bdnog7/fast... · 50-ms Convergence: Do we really need this? • Most of the applications and services we are

Before/AfterLFAFRRXshell:\> ping 10.252.51.111 –tReply from 10.252.51.111: bytes=32 time=2ms TTL=253Reply from 10.252.51.111: bytes=32 time=4ms TTL=253Reply from 10.252.51.111: bytes=32 time=2ms TTL=253Reply from 10.252.51.111: bytes=32 time=2ms TTL=253Request timed out.Reply from 10.252.51.111: bytes=32 time=61ms TTL=253Reply from 10.252.51.111: bytes=32 time=86ms TTL=253Reply from 10.252.51.111: bytes=32 time=70ms TTL=253Reply from 10.252.51.111: bytes=32 time=147ms TTL=253

Reply from 10.252.51.111: bytes=32 time=2ms TTL=253Reply from 10.252.51.111: bytes=32 time=2ms TTL=253Reply from 10.252.51.111: bytes=32 time=1ms TTL=253Reply from 10.252.51.111: bytes=32 time=1ms TTL=253Reply from 10.252.51.111: bytes=32 time=27ms TTL=253Reply from 10.252.51.111: bytes=32 time=32ms TTL=253Reply from 10.252.51.111: bytes=32 time=1ms TTL=253Reply from 10.252.51.111: bytes=32 time=2ms TTL=253Reply from 10.252.51.111: bytes=32 time=2ms TTL=253Reply from 10.252.51.111: bytes=32 time=1ms TTL=253

Page 30: Fast Convergence techniques-APNIC-42wiki.bdnog.org/lib/exe/fetch.php/bdnog7/fast... · 50-ms Convergence: Do we really need this? • Most of the applications and services we are

BGPFastConvergence

LFA-FRRorRSVPcanimproveL2-VPNandIntra-ASConvergencebutcan’tdomuchforExternalprefixeslearnviaEBGP

Page 31: Fast Convergence techniques-APNIC-42wiki.bdnog.org/lib/exe/fetch.php/bdnog7/fast... · 50-ms Convergence: Do we really need this? • Most of the applications and services we are

BGPFastConvergence

TheBGPPICEdgeforIPandMPLS-VPNfeatureimprovesBGPconvergenceonceanetworkfailure.

Page 32: Fast Convergence techniques-APNIC-42wiki.bdnog.org/lib/exe/fetch.php/bdnog7/fast... · 50-ms Convergence: Do we really need this? • Most of the applications and services we are

Prerequisites

• BGPandtheIPorMultiprotocolLabelSwitching(MPLS)networkisupandrunningwiththecustomersiteconnectedtotheprovidersitebymorethanonepath(multihomed).

• Ensurethatthebackup/alternatepathhasauniquenexthopthatisnotthesameasthenexthopofthebestpath.

• EnabletheBidirectionalForwardingDetection(BFD)protocoltoquicklydetectlinkfailuresofdirectlyconnectedneighbors.

Page 33: Fast Convergence techniques-APNIC-42wiki.bdnog.org/lib/exe/fetch.php/bdnog7/fast... · 50-ms Convergence: Do we really need this? • Most of the applications and services we are

HowToWork:PE-CELink/PEFailure

• eBGP sessionsexistbetweenthePEandCErouters.• TrafficfromCE1usesPE1toreachnetworkx.x.x.x/24towardstherouterCE2.CE1has

twopaths:• PE1astheprimarypathandPE2asthebackup/alternatepath.• CE1isconfiguredwiththeBGPPICfeature.BGPcomputesPE1asthebestpathandPE2

asthebackup/alternatepathandinstallsbothroutesintotheRIBandCEFplane.WhentheCE1-PE1link/PEgoesdown,CEFdetectsthelinkfailureandpointstheforwardingobjecttothebackup/alternatepath.TrafficisquicklyreroutedduetolocalfastconvergenceinCEF.

Page 34: Fast Convergence techniques-APNIC-42wiki.bdnog.org/lib/exe/fetch.php/bdnog7/fast... · 50-ms Convergence: Do we really need this? • Most of the applications and services we are

HowtoWork:DualCE-PELine/NodeFailure

• eBGP sessionsexistbetweenthePEandCErouters.TrafficfromCE1usesPE1toreachnetworkx.x.x.x/24throughrouterCE3.

• CE1hastwopaths:PE1astheprimarypathandPE2asthebackup/alternatepath.• AniBGP sessionexistsbetweentheCE1andCE2routers.• IftheCE1-PE1linkorPE1goesdownandBGPPICisenabledonCE1,BGPrecomputes thebestpath,

removingthenexthopPE1fromRIBandreinstalling CE2asthenexthopintotheRIBandCiscoExpressForwarding.CE1automaticallygetsabackup/alternaterepairpathintoCiscoExpressForwardingandthetrafficlossduringforwardingisnowinsubseconds,therebyachieving fastconvergence.

Page 35: Fast Convergence techniques-APNIC-42wiki.bdnog.org/lib/exe/fetch.php/bdnog7/fast... · 50-ms Convergence: Do we really need this? • Most of the applications and services we are

HowtoWork:IPMPLSPEDown

• ThePEroutersareVPNv4iBGP peerswithreflect routersintheMPLSnetwork.• TrafficfromCE1usesPE1toreachnetworkx.x.x.x/24towardsrouterCE3.CE3isdual-homedwith

PE3andPE4.PE1hastwopathstoreachCE3fromthereflect routers:PE4istheprimarypathwiththenexthopasaPE4address.

• PE3isthebackup/alternatepathwiththenexthopasaPE3address.• WhenPE4goesdown,PE1knowsabouttheremovalofthehostprefixbyIGPsinsubseconds,

recomputes thebestpath,selectsPE3asthebestpath,andinstallstheroutesintotheRIBandCiscoExpressForwardingplane.NormalBGPconvergencewillhappenwhileBGPPICisredirecting thetraffictowardsPE3,andpacketsarenotlost.

Page 36: Fast Convergence techniques-APNIC-42wiki.bdnog.org/lib/exe/fetch.php/bdnog7/fast... · 50-ms Convergence: Do we really need this? • Most of the applications and services we are

ConfigurationTemplaterouter bgp 65000no synchronization

neighbor 10.0.0.10 remote-as 65000neighbor 10.0.0.10 update-source Loopback0

no auto-summary!address-family vpnv4bgp additional-paths installneighbor 10.0.0.10 activateneighbor 10.0.0.10 send-community both

exit-address-family!address-family ipv4 vrf abcimport path selection allneighbor 10.10.10.20 remote-as 65534neighbor 10.10.10.20 activateexit-address-family

Page 37: Fast Convergence techniques-APNIC-42wiki.bdnog.org/lib/exe/fetch.php/bdnog7/fast... · 50-ms Convergence: Do we really need this? • Most of the applications and services we are

ConclusionIGPFinetuning

100%DynamicandsimplifiedcanreachsubsecondconvergencetimeLFA-FRR

LFATunnelPre-computed,pre-installedPrefix-independentSimple,deploymentfriendly,goodscalingCanreach<50ms convergencetimesuitableforIntra-ASandL2-VPNtrafficBut

TopologydependantIPFRRIGPcomputation isveryCPU-intensivetask

BGPPICCanachieve<50ms convergencetimeforInter-ASandL3-VPNtraffic

Page 38: Fast Convergence techniques-APNIC-42wiki.bdnog.org/lib/exe/fetch.php/bdnog7/fast... · 50-ms Convergence: Do we really need this? • Most of the applications and services we are

ThankYou