[wxqc] cwop.aprs.net

Gerry Creager gerry.creager at tamu.edu
Mon Jun 2 08:10:45 CDT 2008


We're working to get better monitoring going, but realize that all three 
dedicated servers already run in professional environments.

As for last evening, CWOP-2 suffered a power supply failure.  Although a 
redundant power supply system, it was deemed necessary to replace the 
failed unit.  The box was down for ~1 minute according to the system 
administrator, although I recorded 22 minutes of downtime on my network 
monitoring system.

What we appear to have is a set of ongoing problems that we've discussed 
here before:
1.  OS caching DNS lookups?  Don't know, always a possibility.
2.  Local routers caching DNS data and not refreshing as needed.
3.  ISPs caching DNS and not obeying TTL setting in Bind.
4.  Client software not responding to a failure appropriately and 
re-requesting DNS lookup.

The connection request should time out after 30 sec and another server 
should be queried at that point.  That'd be problemmatical if we had 
consistent 30 sec latencies but we shouldn't have that.

I've started investigating a proxy method in place of round-robin DNS 
lookups to direct all clients to a live server on a load-leveling basis. 
  I've not found one I like yet, as most such systems expect the servers 
to be co-located instead of geographically diverse, as we are.  As more 
develops in this area I'll report.

Gerry

Merton Campbell Crockett wrote:
> You might want to check the status of the system again.  I show it 
> dropping connections starting at 2121z and continuing until 0001z. 
> 
> I defined APRS.NET as a forward zone so its a little hard to tell if the 
> problem is continuing as its address tends to be at the end of the list.
> 
> Merton Campbell Crockett
> 
> 
> 
> On 01 Jun 2008, at 17:04:18, Ted Lum wrote:
> 
>> CWOP-2 is back @ Sun Jun 1 16:00:57 UTC 2008
>>
>> Ted Lum wrote:
>>> Yes, CWOP-2 @ Sun Jun 1 13:37:10 UTC 2008. Think its just the service on 
>>> the box.
>>>
>>> tim.mcmanus at mac.com wrote:
>>>   
>>>> I think one of these servers went down again.  Every time a server  
>>>> drops in that rotation, WeatherLink locks up sending data.  It locked  
>>>> up for 45 minutes this time.
>>>>
>>>> Is there a better way to do this?  I think this is the fifth or sixth  
>>>> time a dead server in the rotation locked up WeatherLink since we  
>>>> changed the URL to cwop.aprs.net.
>>>>
>>>>
>>>> --
>>>> Tim McManus
>>>> tim.mcmanus at mac.com
>>>>
>>>>
>>>>
>>>> _______________________________________________
>>>> wxqc mailing list
>>>> Post messages to wxqc at lists.gladstonefamily.net
>>>> To unsubcribe or change delivery options, please go to:
>>>> http://server.gladstonefamily.net/mailman/listinfo/wxqc
>>>> To search the archives: http://www.google.com/coop/cse?cx=008314629403309390388%3Aknlfnptih9u
>>>>
>>>> The contents of this message are the responsibility of the author.
>>>>
>>>>   
>>>>     
>>>
>>>   
>>
>>
>> -- 
>> This message has been scanned for viruses and
>> dangerous content by *MailScanner* <http://www.mailscanner.info/>, and is
>> believed to be clean. _______________________________________________
>> wxqc mailing list
>> Post messages to wxqc at lists.gladstonefamily.net 
>> <mailto:wxqc at lists.gladstonefamily.net>
>> To unsubcribe or change delivery options, please go to:
>> http://server.gladstonefamily.net/mailman/listinfo/wxqc
>> To search the archives: 
>> http://www.google.com/coop/cse?cx=008314629403309390388%3Aknlfnptih9u
>>
>> The contents of this message are the responsibility of the author.
> 
> Merton Campbell Crockett
> m.c.crockett at roadrunner.com <mailto:m.c.crockett at roadrunner.com>
> 
> 
> 
> 
> ------------------------------------------------------------------------
> 
> _______________________________________________
> wxqc mailing list
> Post messages to wxqc at lists.gladstonefamily.net
> To unsubcribe or change delivery options, please go to:
> http://server.gladstonefamily.net/mailman/listinfo/wxqc
> To search the archives: http://www.google.com/coop/cse?cx=008314629403309390388%3Aknlfnptih9u
> 
> The contents of this message are the responsibility of the author.

-- 
Gerry Creager -- gerry.creager at tamu.edu
Texas Mesonet -- AATLT, Texas A&M University	
Cell: 979.229.5301 Office: 979.458.4020 FAX: 979.862.3983
Office: 1700 Research Parkway Ste 160, TAMU, College Station, TX 77843


More information about the wxqc mailing list