Re: help with BIND SRV

To: debian-isp@lists.debian.org
Subject: Re: help with BIND SRV
From: Nate Duehr <nate@natetech.com>
Date: Fri, 08 Oct 2004 05:08:45 -0600
Message-id: <[🔎] 4166753D.7070904@natetech.com>
In-reply-to: <[🔎] 20041008062911.GA31445@verkkotelakka.net>
References: <[🔎] 20041005213141.GE13485@fargo.hipdesign.com> <[🔎] 90130000.1097141122@hick.kon.de> <[🔎] 20041007172033.GT24620@fargo.hipdesign.com> <[🔎] 200410072104.58055.fraser@georgetown.wehave.net> <[🔎] 4165FA23.9060006@natetech.com> <[🔎] 20041008062911.GA31445@verkkotelakka.net>

Juha-Matti Tapio wrote:

On Thu, Oct 07, 2004 at 08:23:31PM -0600, Nate Duehr wrote:
Most people setting up round-robin DNS type setups for redundancy withscripts to change things for failover get bit by these things:
[...]
- They don't understand that there might be multiple DNS servers betweentheir top-level and the machine they're servicing (3X and 4X TTL)
RFC 1035 specifies in chapter 6.1.3. that requests served from a cache
should return a TTL which has been decremented by the amount of seconds
in cache, i.e. the TTL "counts down" in the cache.

Therefore I consider any caching nameservers that do not do this broken.
Are there a significant amount of such servers out there?

Though I agree on most of the other points.



Ahh... it's a trap.  Think about this.

1 - Regular DNS server hosting "something.com"
2 - ISP's caching nameserver
3 - Your company's nameserver
4 - A caching nameserver on your desktop machine

Now... add in here that let's say your company AND your ISP interceptall port 53 traffic and proxy all DNS requests through both of theirservers. Not super-common -- but there ARE organizations and ISP's outthere that do this for whatever convoluted security or other reasons.

Depending on how the proxying is set up, each server can 100% implementthe RFC you mention and a change on server 1 to a record that's cachedon your local desktop machine's nameserver will take 3X TTL to show upat your desktop! (It also means that if machine 1 or one of the otherNS's for that zone doesn't answer at all, it's 3X TTL to clear thenegative cache also.)

This of course, is NOT the norm -- but it's out there. (Think serviceproviders and highly secured networks. I believe AOL's DNSimplementation had this multiply-cascaded DNS server proxying problemfor many years, but they cleaned it up in the mid-1990's.)

People forget this type of setup is out there when they set their TTLtimes for "quick" changes.


Of course, more common is:

1 - Machine hosting "something.com"

2 - Caching nameserver on your desktop that's allowed to make directconnections out port 53 to the world


In that scenario, TTL's work "as expected".

Another common setup:

1 - Machine hosting "something.com"
2 - Company nameserver
3 - Your desktop machine NOT running a caching nameserver, just a resolver.

Again, normal behaviour.

Once during some re-IP'ing activities for a customer when I was workingfor a co-location/hosting company, we had an agreeable customer whowanted to make some IP addressing changes, and who would also work withme as the "DNS guy" to help him through his transition. Because he wasvery paranoid about the move, we actually set him up with dual-IP's forevery box he had during the transition period, and then made the DNSchanges.

We found from looking at his server logs that there were still brokenDNS servers and resolvers out there hitting the old IP addresses for 5days. In this particular case, his TTL was set to 1 day.

Hopefully that gives a better indication of how many layers or how manytruly-broken DNS resolvers and server setups there are out there. Withhis TTL's at a very reasonable 1 Day, it still took a business-week tosee all his traffic move over to the new IP's. Another week later wereclaimed the extra IP's and shut down the subinterfaces on his systems,and killed the routes -- at that point there was virtually no traffichitting the old IP range, but the number was still not at zero.

Of course if there is no negative cache entry for the zone anywhere, thefool-proof method of changing things is to provide a completely new Arecord name that has never been used before in your zone. That forces alookup all the way back to your servers, and thus -- the freshest IPinformation.

One way I've seen this effect used very effectively during a site movewas that the company reissued the zone data immediately with their"www.something.com" record changed and also a new "new.something.com" Arecord both pointing at the new address.

Then they set up a server at the old location that did nothing but havea redirect page from "www" to "new". They then took the big serveracross town and plugged it in at the IP "new" pointed to.

Client machines that had correct information quickly just went to "www"at the new address. Lagging machines and broken DNS architectures hit"www" at the old address and were redirected to "new".

Of course, this customer had to be careful about not using name-basedvirtual hosting on their webserver -- or making sure the big machine atthe new site would answer correctly on both "www" and "new". I forgetwhich tactic they used.

If you really think about the queries going on in DNS, there's plenty ofways to move things around safely without downtime -- I was always justfloored when some dot.bomb company would decide to move IP's and leavenothing on the old IP, and just "hope" even with their 7 day or higherTTL time they'd set up long ago, that customers would find them.

Of course, being that we also sold them their bandwidth, it was usuallypainfully obvious that their traffic load had dropped dramaticallyduring these hastily-planned moves. The sad part is usually at thispoint is when they would FINALLY call asking for help, and the onlyavailable option to them would be something like the new A-record trick,described above.

Customers painted themselves into corners with DNS and site moves allthe time in the colocation/hosting biz.


Hopefully that helps visualize it...

Nate

Reply to:

Follow-Ups:
- DNS TTLs [was: help with BIND SRV]
  - From: Marcin Owsiany <porridge@debian.org>

References:
- help with BIND SRV
  - From: August MacBeth <August@HiPDesign.com>
- Re: help with BIND SRV
  - From: Marcel Hicking <m.hicking@komplex.net>
- Re: help with BIND SRV
  - From: August MacBeth <August@HiPDesign.com>
- Re: help with BIND SRV
  - From: Fraser Campbell <fraser@georgetown.wehave.net>
- Re: help with BIND SRV
  - From: Nate Duehr <nate@natetech.com>
- Re: help with BIND SRV
  - From: Juha-Matti Tapio <jmtapio@verkkotelakka.net>

Prev by Date: greylisting DNSBL hosts?
Next by Date: DNS TTLs [was: help with BIND SRV]
Previous by thread: Re: help with BIND SRV
Next by thread: DNS TTLs [was: help with BIND SRV]
Index(es):
- Date
- Thread