World: r3wp

Join the discussions in the REBOL3 world...

[All] except covered in other channels

older newer	first last
Pekr 23-Feb-2005 [1267]	ah, dot trick worked, thanks ...
Tomc 24-Feb-2005 [1268x2]	R: I would add to our list of mail-list users, all altme users whenever safeworlds single point of failure archetecure rears it's head
Tomc 24-Feb-2005 [1268x2]	(such as the last half day)
Pekr 24-Feb-2005 [1270x2]	ah, finally back? What happened?
Pekr 24-Feb-2005 [1270x2]	uf, pressed Bold button mistakenly ....
Tomc 24-Feb-2005 [1272]	when all worlds are unavailable from everywhere it is because safeworlds is having network/hardware/software problems on the machine that looks the current address of a world
JaimeVargas 24-Feb-2005 [1273]	Yes the Safeworlds nameserver application was done. The application runs on the same server than their website.
Tomc 24-Feb-2005 [1274]	even if the world host address is working just fine in the next room
Pekr 24-Feb-2005 [1275]	no, not apparently, as their website worked here, while Atlme did not?
JaimeVargas 24-Feb-2005 [1276]	Because the nameserver app was done. The application runs on port 5500. Webserver in port 80 both hosted on the same ip-address 66.328.43.215
Tomc 24-Feb-2005 [1277]	website need not be on the same machine seerving world addresses
JaimeVargas 24-Feb-2005 [1278]	nameserver app failed
Pekr 24-Feb-2005 [1279]	I wonder, if Altme a) uses at least TWO machines to connect to for lookup purposes like DNS does b) can remember IP address so next time look-up server is not available, it can try direct IP connection .... if not, it is imo plain stupid as at least b) imo would not take more than few hours for programmer to add .... now Peter will come once again, and call it - "unreliable" :-)
JaimeVargas 24-Feb-2005 [1280]	From my observations of pkt trace. a) No it doesn't b) No caching either Pekr "unreliable" assestion correct.
Tomc 24-Feb-2005 [1281]	safeworlds did have a good long run with no downtime, but stuff happens eventually
Pekr 24-Feb-2005 [1282]	not "pekr unreliable" - it was joke. It was Peter Wood imo, who called it unreliable and once Rebol2 failed, someone joked that now Peter Wood will come and call it unreliable, so I just wait when he appears :-)
Tomc 24-Feb-2005 [1283x2]	Pekr- it is more a policy / bussiness pratice than praciical programming problem
Tomc 24-Feb-2005 [1283x2]	there are some advantages to safeworlds, theu can provide/ force upgrades to their software
Pekr 24-Feb-2005 [1285]	why business policy? Do you think SafeWorlds wants to keep it under control? Well, bad service then? I don't believe they don't have at least two machines. Alme client could contain list of alternative look-up server - it would cover most outages imo ...
Tomc 24-Feb-2005 [1286]	and it keeps them in the loop, not obsoleating yourself may make good bussiness sense (I am useless at bussiness so I am geussing here)
JaimeVargas 24-Feb-2005 [1287]	I think the should build some redundancy in the name resolution part of the AltMe service otherwise there will be service cut-offs.
Tomc 24-Feb-2005 [1288x2]	I think their feeling is that if they can provide reliable world-name service then it is the best choice.
Tomc 24-Feb-2005 [1288x2]	and they think they can
JaimeVargas 24-Feb-2005 [1290]	Tom, there other ways to address the "obsoleating" policy that doesn't require service cuts.
Tomc 24-Feb-2005 [1291x2]	however realithy does bite
Tomc 24-Feb-2005 [1291x2]	I am not the one that needs convincing ;)
Pekr 24-Feb-2005 [1293]	Tomc - if there is any business practice, I just don't buy it and SafeWorlds wants to stay in control. Then they should not promote product as private. IIRC, folks are reporting even their private running worlds don't run because of look-up server failure ...
Tomc 24-Feb-2005 [1294]	but they are private in that the data exchanged after the world-name-server lookup does not go anywhere near the world-name-server... it is just words
Terry 24-Feb-2005 [1295]	I'd be really embarrased right about now if I had promoted Altme as a corporate communications tool.
JaimeVargas 24-Feb-2005 [1296]	Well... Partially truth. The name server is consulted all the time. At least from my pkt trace. I am looking more deeply into it.
Tomc 24-Feb-2005 [1297]	If safeworlds were able to provide the service they are attempting to then we would not care for the most part
Pekr 24-Feb-2005 [1298]	Well, imagine running some cooperative group of ppl and lose ability to work because of AltME outage. What is even more strange is, that SafeWorlds did not bother to post any kind of message on their website. RT did ....
Tomc 24-Feb-2005 [1299]	effectivly true... if you have a connection and the world_name-server goes away you do not loose the connection you have.
Pekr 24-Feb-2005 [1300]	yes, that is probably true .....
Tomc 24-Feb-2005 [1301]	definatly
JaimeVargas 24-Feb-2005 [1302]	The msg are sent encrypted between server and client. But the name server is consulted all the time to know if there are msg waiting or something similar. I ma not quite sure why the name server keeps engaged after initial resolving.
Pekr 24-Feb-2005 [1303]	It does? Strange ....
JaimeVargas 24-Feb-2005 [1304]	If you had a pkt tracer check out the traffic going to 66.218.43.215 port 5500
Tomc 24-Feb-2005 [1305x2]	coule be keep-alive or statistics gathering
Tomc 24-Feb-2005 [1305x2]	quality control not needed for operation
Pekr 24-Feb-2005 [1307]	yes, I have Ethereal, never checked on AltME yet ...
JaimeVargas 24-Feb-2005 [1308x3]	;Interesting every second the client send a request that looks like cmd: live data: [world-name binary64] ;and the server replies with this cmd: lookup data: ok
	In the case above the "client" is my private AltMe world server. So the traffic is being sent from my server to the nameserver every second. I guess is a way to keep the ownership of the free AltMe world names.
	Enough hacking back to real work...
[unknown: 9] 24-Feb-2005 [1311x2]	.
[unknown: 9] 24-Feb-2005 [1311x2]	Tomc wrote "safeworlds did have a good long run with no downtime, but stuff happens eventually" Thank you, and yes we did. The error that happened today was fixed, and an additional check is now being made to prevent that from happening again in the future. One thing I can clearly say is that each problem has been unique, and each time one happens (a severe failure) we stop everything and jump on it, and in addition to correcting the current problem, write code to prevent the same problem from happening in the future. We have been down less than Live Journal, which has millions of people, and hundreds of servers for the same time period. So we rock!
BrianW 24-Feb-2005 [1313]	and you're modest too ;-)
[unknown: 9] 24-Feb-2005 [1314]	Modesty is a disease, and it wastes time, and truncates communication, which are both ethical crimes.
BrianW 24-Feb-2005 [1315]	heh.
[unknown: 9] 24-Feb-2005 [1316]	Anyone here know anything about Kylix?
older newer	first last