Register a SA Forums Account here!
JOINING THE SA FORUMS WILL REMOVE THIS BIG AD, THE ANNOYING UNDERLINED ADS, AND STUPID INTERSTITIAL ADS!!!

You can: log in, read the tech support FAQ, or request your lost password. This dumb message (and those ads) will appear on every screen until you register! Get rid of this crap by registering your own SA Forums Account and joining roughly 150,000 Goons, for the one-time price of $9.95! We charge money because it costs us money per month for bills, and since we don't believe in showing ads to our users, we try to make the money back through forum registrations.
 
  • Post
  • Reply
jwh
Jun 12, 2002

Be careful with layer seven health checks: it's not uncommon for them to drive more load to your platform than regular user activity.

Which is funny.

Adbot
ADBOT LOVES YOU

ruro
Apr 30, 2003

jwh posted:

Be careful with layer seven health checks: it's not uncommon for them to drive more load to your platform than regular user activity.

Which is funny.
It's funny if you run them as often as you'd run a ping probe :)

jwh
Jun 12, 2002

ruro posted:

It's funny if you run them as often as you'd run a ping probe :)

I've worked in several environments where management had wanted subsecond failover of real servers in a pool, based on failed transactions to the platform that included backend components.

I don't recommend it.

Also, this leads me to an important piece of experience that I've learned in my career, maybe the most important: never, ever, throw a jump ball to management.

Do not, for example, say, "how quickly do we need to failover?" Say, instead, "we can detect failover at one second, two seconds, or five seconds. What would you like to do?" It's the illusion of choice that satisfies their need to manage, and it gets you out of the problem of having to live with their bad (and open ended) decisions.

Docjowles
Apr 9, 2009

ragzilla posted:

Can you expand on your issues with the C servers? We're looking at adding some for local storage applications.

The biggie was some fuckery with the RAID controllers. I wasn't the main guy working on this but if you really want I can ask the guy who was and get you explicit details. But basically you'd provision the box in UCS manager with a RAID config and everything would boot up fine. Then sometimes the disk IO performance would go to poo poo despite the management software not showing any alerts or issues. Reboot box, suddenly the RAID config is gone along with everything on the disks and you get to start over :haw: This persisted across a bunch of hardware swaps.

The C series boxes are in production now so I assume that's been resolved (since they're acting as database servers, not some stateless throwaway box) but like I said I wasn't personally handling that issue.

ate shit on live tv
Feb 15, 2004

by Azathoth

jwh posted:

I've worked in several environments where management had wanted subsecond failover of real servers in a pool, based on failed transactions to the platform that included backend components.

I don't recommend it.

Also, this leads me to an important piece of experience that I've learned in my career, maybe the most important: never, ever, throw a jump ball to management.

Do not, for example, say, "how quickly do we need to failover?" Say, instead, "we can detect failover at one second, two seconds, or five seconds. What would you like to do?" It's the illusion of choice that satisfies their need to manage, and it gets you out of the problem of having to live with their bad (and open ended) decisions.

Also don't be afraid to say "No" to unreasonable, or unrealistic demands.

Confused_Donkey
Mar 16, 2003
...
Picked up a pair of 6509-E's equipped with dual WS-SUP720-3B Supervisors. Been trying to find a compatibility matrix for IOS versions (ie supported). I've seen one in the past but for the life of me cannot find it. If anyone could toss a link my direction I would greatly appreciate it!

ruro
Apr 30, 2003

jwh posted:

I've worked in several environments where management had wanted subsecond failover of real servers in a pool, based on failed transactions to the platform that included backend components.

I don't recommend it.
Hah. We have a few apps where it was required but they were happy enough with 30 second fail over as they understood the potential problems.

falz
Jan 29, 2005

01100110 01100001 01101100 01111010

Confused_Donkey posted:

Picked up a pair of 6509-E's equipped with dual WS-SUP720-3B Supervisors. Been trying to find a compatibility matrix for IOS versions (ie supported). I've seen one in the past but for the life of me cannot find it. If anyone could toss a link my direction I would greatly appreciate it!

Google Cisco feature navigator. When in it search by platform and choose that supervisor. It will show compatible IOS versions with that sup and you can compare features between different versions.

evol262
Nov 30, 2010
#!/usr/bin/perl

Docjowles posted:

Yeah it's not a networking product per se, though it has a networking component. It's basically compute + networking + storage in a box, designed to scale out massively but be centrally managed from one interface. My company's starting to roll them out in a limited fashion. The B-series are a blade chassis, and the C-series are a traditional rack mount form factor.

FWIW we have had no loving end of problems with the C series but the B's have been great.

Don't forget to tape across the front of the chassis for the Bs. Only half joking. They used to vibrate themselves out, and I don't know if it's still a problem, but better safe than sorry.

Filthy Lucre
Feb 27, 2006
Can someone help me figure out why the gently caress my route-map ROUTE_MAP_FROM_HURRICANE isn't working?
pre:
#show run | i neighbor 184.105.249.5
 neighbor 184.105.249.5 remote-as 6939
 neighbor 184.105.249.5 description Hurricane Electric eBGP Session
 neighbor 184.105.249.5 update-source GigabitEthernet0/0/7.1127
 neighbor 184.105.249.5 version 4
 neighbor 184.105.249.5 next-hop-self
 neighbor 184.105.249.5 route-map ROUTE_MAP_FROM_HURRICANE in
 neighbor 184.105.249.5 route-map ROUTE_MAP_TO_HURRICANE out
 
#show run | b route-map ROUTE_MAP_FROM_HURRICANE permit
route-map ROUTE_MAP_FROM_HURRICANE permit 10
 match ip address prefix-list PREFIX_LIST_AS6939
 set local-preference 150
 set weight 100

#show run | i ip prefix-list PREFIX_LIST_AS6939
ip prefix-list PREFIX_LIST_AS6939 seq 10 permit 1.0.20.0/23

#show ip bgp 1.0.20.0
BGP routing table entry for 1.0.20.0/23, version 2957089
Paths: (2 available, best #1, table default)
Multipath: eBGP iBGP
  Advertised to update-groups:
     4
  Refresh Epoch 1
  174 2914 2519, (received & used)
    38.104.102.10 (metric 2) from 24.36.32.66 (24.36.32.66)
      Origin IGP, metric 51061, localpref 100, valid, internal, best
  Refresh Epoch 1
  6939 2519, (received-only)
    184.105.249.5 from 184.105.249.5 (216.218.252.168)
      Origin IGP, localpref 100, valid, external
184.105.249.5 is my upstream connection to Hurricane Electric on this router.
38.104.102.10 is my upstream connection to Cogent from one of my other routers (24.36.32.66).

I've stripped down the route-map and the prefix-list trying to get it to work. I can't figure out why it's not setting the local pref for 1.0.20.0/23 to 150.

H.R. Paperstacks
May 1, 2006

This is America
My president is black
and my Lambo is blue

Filthy Lucre posted:

Can someone help me figure out why the gently caress my route-map ROUTE_MAP_FROM_HURRICANE isn't working?
pre:
#show run | i neighbor 184.105.249.5
 neighbor 184.105.249.5 remote-as 6939
 neighbor 184.105.249.5 description Hurricane Electric eBGP Session
 neighbor 184.105.249.5 update-source GigabitEthernet0/0/7.1127
 neighbor 184.105.249.5 version 4
 neighbor 184.105.249.5 next-hop-self
 neighbor 184.105.249.5 route-map ROUTE_MAP_FROM_HURRICANE in
 neighbor 184.105.249.5 route-map ROUTE_MAP_TO_HURRICANE out
 
#show run | b route-map ROUTE_MAP_FROM_HURRICANE permit
route-map ROUTE_MAP_FROM_HURRICANE permit 10
 match ip address prefix-list PREFIX_LIST_AS6939
 set local-preference 150
 set weight 100

#show run | i ip prefix-list PREFIX_LIST_AS6939
ip prefix-list PREFIX_LIST_AS6939 seq 10 permit 1.0.20.0/23

#show ip bgp 1.0.20.0
BGP routing table entry for 1.0.20.0/23, version 2957089
Paths: (2 available, best #1, table default)
Multipath: eBGP iBGP
  Advertised to update-groups:
     4
  Refresh Epoch 1
  174 2914 2519, (received & used)
    38.104.102.10 (metric 2) from 24.36.32.66 (24.36.32.66)
      Origin IGP, metric 51061, localpref 100, valid, internal, best
  Refresh Epoch 1
  6939 2519, (received-only)
    184.105.249.5 from 184.105.249.5 (216.218.252.168)
      Origin IGP, localpref 100, valid, external
184.105.249.5 is my upstream connection to Hurricane Electric on this router.
38.104.102.10 is my upstream connection to Cogent from one of my other routers (24.36.32.66).

I've stripped down the route-map and the prefix-list trying to get it to work. I can't figure out why it's not setting the local pref for 1.0.20.0/23 to 150.

nvm

H.R. Paperstacks fucked around with this message at 17:49 on Sep 13, 2013

Agrikk
Oct 17, 2003

Take care with that! We have not fully ascertained its function, and the ticking is accelerating.

jwh posted:

Be careful with layer seven health checks: it's not uncommon for them to drive more load to your platform than regular user activity.

Which is funny.

I'm with you on this. The URL landing page for each of the seven applications is basically a poke of the back end database (I think it just calls getdate() or something similarly lightweight) to indicate that the URL is working and the database is responsive.

The original call was for a join statement to test the availability of a few tables, but that was quickly shot down as being too heavy a transaction for a probe...

Docjowles
Apr 9, 2009

Agrikk posted:

The original call was for a join statement to test the availability of a few tables, but that was quickly shot down as being too heavy a transaction for a probe...

At one point we noticed that our monitoring software was running a non-trivial query against the production DB many thousands of times per day, despite being configured to run every 5 minutes or something. Turned out collectd had a bug where the setting to specify how often to poll just didn't work and defaulted to every 10 seconds :haw: That was a fun discovery.

tortilla_chip
Jun 13, 2007

k-partite

This won't have an immediate effect. Do a soft clear inbound on the HE peer.

Filthy Lucre
Feb 27, 2006
I've gone home for the day (been awake for 30+ hours at this point), but I've done several soft and a few hard clears. I'll grep the logs when I go back to the office for an exact count.

The issue is more of a personal challenge at this point. Traffic is coming in and going out, so people are getting their porn, it's just not necessarily the way I want it.

Confused_Donkey
Mar 16, 2003
...

falz posted:

Google Cisco feature navigator. When in it search by platform and choose that supervisor. It will show compatible IOS versions with that sup and you can compare features between different versions.

I cannot believe I didn't see that when I searched. That helped me narrow it down, thanks!

dotster
Aug 28, 2013

Docjowles posted:

The biggie was some fuckery with the RAID controllers. I wasn't the main guy working on this but if you really want I can ask the guy who was and get you explicit details. But basically you'd provision the box in UCS manager with a RAID config and everything would boot up fine. Then sometimes the disk IO performance would go to poo poo despite the management software not showing any alerts or issues. Reboot box, suddenly the RAID config is gone along with everything on the disks and you get to start over :haw: This persisted across a bunch of hardware swaps.

The C series boxes are in production now so I assume that's been resolved (since they're acting as database servers, not some stateless throwaway box) but like I said I wasn't personally handling that issue.

What RAID controller were they running? I have run the mez card or PCI controllers but those are just LSI.

abigserve
Sep 13, 2009

this is a better avatar than what I had before

Filthy Lucre posted:

I've gone home for the day (been awake for 30+ hours at this point), but I've done several soft and a few hard clears. I'll grep the logs when I go back to the office for an exact count.

The issue is more of a personal challenge at this point. Traffic is coming in and going out, so people are getting their porn, it's just not necessarily the way I want it.

You're filtering that route out from your eBGP neighbor somewhere, that's what "received-only" means, and this is probably why it's not modifying those attributes.

ruro
Apr 30, 2003

My kingdom, such as it is, for anyone who can tell me how to configure ingress queuing on an WS-X6816-10T-2T line card using c3pl on a 6500/Sup2T running 15.0(1)SY1 advanced enterprise... :(.

Docjowles
Apr 9, 2009

dotster posted:

What RAID controller were they running? I have run the mez card or PCI controllers but those are just LSI.

Looks like LSI MegaRaid 9265-8i.

The Third Man
Nov 5, 2005

I know how much you like ponies so I got you a ponies avatar bro
I hope this isn't to general a question for this thread, but if I wanted to learn more about the different types of VPNs where would be a good starting point? If it helps, I'm mostly interested in access VPNs and how they are implemented/their security.

Count Thrashula
Jun 1, 2003

Death is nothing compared to vindication.
Buglord

The Third Man posted:

I hope this isn't to general a question for this thread, but if I wanted to learn more about the different types of VPNs where would be a good starting point? If it helps, I'm mostly interested in access VPNs and how they are implemented/their security.

If you already have a networking background or a CCNA or something, I'd get some study books on the Cisco CCNA Security exam, they tackle those concepts.

But "VPN" is a pretty broad topic, that would obviously just deal with Cisco's implementations.

dotster
Aug 28, 2013

Docjowles posted:

Looks like LSI MegaRaid 9265-8i.

Huh, must have been some early BIOS and RAID card firmware bugs or something. I have had good luck with those and the 9266 cards, I have even done some testing with LSI cachecade and been pretty pleased with it on a big SATA array. Anyway, sounds like they got it sorted.

Filthy Lucre
Feb 27, 2006
Follow up on my BGP issues;

From my ssh logs, multiple bgp resets;
pre:
C:\PuttyLogs>grep -ic "clear ip bgp 184.105.249.5 soft" 24.36.32.66-20130913.log
10

C:\PuttyLogs>grep -ic "clear ip bgp 184.105.249.5$" 24.36.32.66-20130913.log
2
To make things weirder, no one has been in the router and the BGP session hasn't reset since I left on Friday, but the route-map is now applying properly.

I don't loving know.

Gap In The Tooth
Aug 16, 2004

Filthy Lucre posted:


To make things weirder, no one has been in the router and the BGP session hasn't reset since I left on Friday, but the route-map is now applying properly.

I don't loving know.

This happened to me the other day with an IPIP tunnel. Three days of it not working then suddenly sh ip int br saying up up on both ends with no config changes.

Count Thrashula
Jun 1, 2003

Death is nothing compared to vindication.
Buglord
Can someone help summarize how multipoint frame relay works?

In GNS3, I have a frame relay switch set up like so (just an example)

Port 1: DLCI 102 -> DLCI 201
Port 2: DLCI 103 -> DLCI 301
Port 3: DLCI 104 -> DLCI 401

R1: s0/0.102 to port 1 on switch, s0/1.103 to port 2, s0/2.104 to port 3
R2: s0/0.201 to port 1 on switch
R3: s0/0.301 to port 2 on switch
R4: s0/0.401 to port 3 on switch

What do I need to make sure I do on the routers?

So far I've tried:

int s0/0
no ip address
no sh
enc frame-relay
!
int s0/0.102 multipoint
ip address 10.102.1.1 255.255.255.0
frame-relay map ip 10.102.1.1 102 broadcast
frame-relay interface-dlci 102

And the accompanying configurations for other ports/DLCIs. I'm dumb when it comes to frame relay apparently, maybe I'm just not understanding it correctly.

Gap In The Tooth
Aug 16, 2004
What does GNS3 use for LMI type? Make sure that is set correctly.

Your frame relay map command should have a remote IP (the other end of your circuit) and the local DLCI.

Badgerpoo
Oct 12, 2010
Having a strange issue with a bunch of wism2s I've inherited:

I have two 6509s with sup720 in a VSS pair. Each has 3 wism2 modules

code:
nnr0#show wism statu

Service Vlan : 604, Service IP Subnet : 172.17.120.250/255.255.255.0
      WLAN
Slot  Controller  Service IP       Management IP    SW Version    Controller Type    Status
----+-----------+----------------+----------------+------------+------------------+---------------
17    1           172.17.120.21    172.17.107.101   7.4.110.0    WS-SVC-WISM-2-K9   Oper-Up
18    1           172.17.120.36    172.17.107.103   7.4.110.0    WS-SVC-WISM-2-K9   Oper-Up
20    1           172.17.120.37    172.17.107.105   7.4.110.0    WS-SVC-WISM-2-K9   Oper-Up
33    1           172.17.120.39    172.17.107.102   7.4.110.0    WS-SVC-WISM-2-K9   Oper-Up
34    1           172.17.120.22    172.17.107.104   7.4.110.0    WS-SVC-WISM-2-K9   Oper-Up
36    1           172.17.120.20    172.17.107.106   7.4.110.0    WS-SVC-WISM-2-K9   Oper-Up
When I try to console in via the 6k it doesn't work. on the first five in the list above I get:
code:
nnr0#session switch 2 slot 1 processor 1 
The default escape character is Ctrl-^, then x.
You can also type 'exit' at the remote prompt to end the session
Trying 172.17.120.39 ... 
% Connection refused by remote host
But on slot 36 I get:
code:
nnr0#session switch 2 slot 4 processor 1
The default escape character is Ctrl-^, then x.
You can also type 'exit' at the remote prompt to end the session
Trying 172.17.120.20 ... 
% Connection timed out; remote host not responding
The physical consoles on the wisms themselves work, and the IP management works.

Any ideas why the internal console connection fail?

Count Thrashula
Jun 1, 2003

Death is nothing compared to vindication.
Buglord

Gap In The Tooth posted:

Your frame relay map command should have a remote IP (the other end of your circuit) and the local DLCI.

Ohhhhh. Thanks, that's probably the issue.

H.R. Paperstacks
May 1, 2006

This is America
My president is black
and my Lambo is blue
Anyone successfully using dummynet, WANEM, or something similar (open-source) for WAN simulation? We have a small lab at our local site here and we use it from time to time to test things rather than capitalizing the big lab at HQ. I'm looking to just have it simulate latency, etc for a link between two routers. I am using a space RHEL6 server with two dedicated NICs.

I've never personally used dummynet as I've always done this testing at the big lab. My limited googling hasn't turned up anything good, but my thought process was:

code:

RTR1-gi0/0--->eth0(dummynet/wanem)eth1---->RTR2-gi-0/0

And using dummynet/wanem to inject the latency and such, but everything I've found online has both dummynet/wanem interfaces having IP involved instead of just an l2 bridge. I guess I could setup multi-hop BGP between the routers and allow dummynet/wanem to route it as well. Ideally I'd like to be able to use sub-interfaces on the simulator box so I can keep a lan management connection.

H.R. Paperstacks
May 1, 2006

This is America
My president is black
and my Lambo is blue

routenull0 posted:

open-source wan simulation stuff

Scratch this, I spent some time today getting dummynet working like a champ inside of RHEL6.

Count Thrashula
Jun 1, 2003

Death is nothing compared to vindication.
Buglord
I installed ACS 5.4 on an ESX server and can SSH to it just fine to set it up via CLI, but... I... have no "acs" commands?

I've looked through all the exec, show, and config commands, and there's not a single drat "acs" command in there anywhere. Is there something I need to do before I can start the web-server?

psydude
Apr 1, 2008

QPZIL posted:

I installed ACS 5.4 on an ESX server and can SSH to it just fine to set it up via CLI, but... I... have no "acs" commands?

I've looked through all the exec, show, and config commands, and there's not a single drat "acs" command in there anywhere. Is there something I need to do before I can start the web-server?

No. All administration is through the web GUI.

ruro
Apr 30, 2003

QPZIL posted:

I installed ACS 5.4 on an ESX server and can SSH to it just fine to set it up via CLI, but... I... have no "acs" commands?

I've looked through all the exec, show, and config commands, and there's not a single drat "acs" command in there anywhere. Is there something I need to do before I can start the web-server?
As long as you've configured the ip address, default gateway, ntp etc. Run application start acs and away you go via the web gui.

Count Thrashula
Jun 1, 2003

Death is nothing compared to vindication.
Buglord

ruro posted:

As long as you've configured the ip address, default gateway, ntp etc. Run application start acs and away you go via the web gui.

I tried that, but "show application" doesn't show any listings. Does the ACS have to have an active internet connection? My assigned NTP connection doesn't actually connect, but I didn't think much of it.

edit-- I'm thinking maybe the "installing acs..." phase of the installation failed, based on not seeing any output from "show application."

Count Thrashula fucked around with this message at 13:54 on Sep 20, 2013

Flash z0rdon
Aug 11, 2013

routenull0 posted:

Anyone successfully using dummynet, WANEM, or something similar (open-source) for WAN simulation? We have a small lab at our local site here and we use it from time to time to test things rather than capitalizing the big lab at HQ. I'm looking to just have it simulate latency, etc for a link between two routers. I am using a space RHEL6 server with two dedicated NICs.

I've never personally used dummynet as I've always done this testing at the big lab. My limited googling hasn't turned up anything good, but my thought process was:

code:

RTR1-gi0/0--->eth0(dummynet/wanem)eth1---->RTR2-gi-0/0


we used Netem on RHEL and while it was somewhat of a performance hit throughput-wise it gave us the delay we wanted.

Sepist
Dec 26, 2005

FUCK BITCHES, ROUTE PACKETS

Gravy Boat 2k
We used dummynet in the financial brokerage I used to work it, no one ever said it didn't do what it was supposed to. I don't have any first hand experience with it though.

H.R. Paperstacks
May 1, 2006

This is America
My president is black
and my Lambo is blue

Sepist posted:

We used dummynet in the financial brokerage I used to work it, no one ever said it didn't do what it was supposed to. I don't have any first hand experience with it though.

Yeah I got it all working like a champ last week. It's been great for the minimal stuff we'll do locally. Anything heavy we do in the mirror lab at HQ.

Herv
Mar 24, 2005

Soiled Meat
I have a question about the Authentication Proxy feature. It's old and I was wondering if there was a newer system.

The auth challenge is not working in Chrome with default settings, IE/Safari and the like do work with it still.

Has there been a replacement to this feature?

I have a new partner/tunnel set. The data they are accessing is classified/sensitive exam content (e.g. exams you have heard of) The remote partner has a /16, and two /24's on their end of the tunnel, with /24 on my end.

I like to use auth-proxy since I can create granular radius policies that can have specific users receive ACL entries based on their identity after a successful auth with their AD account on my end.

When a partner user auths, the radius server provides their ACL entries and they are applied to the ACL on the next hop interface once through the firewall. Then they have 8 hours till they have to re-auth. The auth-proxy is running on the vlan interface behind the firewall.

That way, while the tunnel is open to all their LANs they still have to auth to get through the deny all ACL (with their entries placed at the top to bypass it), and I have a record of who/what came through the tunnel for accounting purposes.

Anyone have a more recent suggestion? If not I have to make a custom HTML page that doesn't use whatever Chrome is blocking.

Thanks

Herv fucked around with this message at 04:54 on Sep 21, 2013

Adbot
ADBOT LOVES YOU

abigserve
Sep 13, 2009

this is a better avatar than what I had before
So those new 6800 "instant access" switches from cisco are basically like a window into the future of distribution -> access layer switching. Run 'em as fabric extenders of a 6880 distribution switch and boom, instead of managing 12 switches per building you manage...one. The technology isn't there yet (there are some hard limits about total port/switch counts) but give it 3-5 years and that'll be it.

  • 1
  • 2
  • 3
  • 4
  • 5
  • Post
  • Reply