[OpenAFS] HELP! We 've lost our sync site

ted creedon tcreedon@easystreet.com
Sat, 10 Jan 2004 11:27:54 -0800


If you build 1.2.10a on a linux box there is no problem or does one wait =
for
1.2.11?

-----Original Message-----
From: openafs-info-admin@openafs.org =
[mailto:openafs-info-admin@openafs.org]
On Behalf Of Douglas E. Engert
Sent: Saturday, January 10, 2004 11:16 AM
To: James M Mosley
Cc: openafs-info@openafs.org
Subject: Re: [OpenAFS] HELP! We 've lost our sync site

Did you see the four notes on "Ubik time overflow at 0x40000000"
That is the problem.=20

What version of AFS are you running and on wht OS?
If you can't build it yourself, or can't want for the OPenAFS peple
to build a release, maybe some else might have built it by now.
(I have OpenAFS-1.2.10 running on sunx4_58 for the last 15 minutes.)

James M Mosley wrote:
>=20
> All,
>         We need immediate help!  We have been unable to establish a =
sync
> site for about 6 hours.  All 3 of our database servers are up and =
appear
> to be perfroming the election as expected.  However, the server that
> should be come the synce site doesn't.    Here is some output from =
udebug
> on that server:
>=20
> as-sm1# udebug as-sm1 7002 -long
> Host's addresses are: 152.15.10.70
> Host's 152.15.10.70 time is Sat Jan 10 13:59:36 2004
> Local time is Sat Jan 10 13:59:39 2004 (time differential 3 secs)
> Last yes vote for 152.15.10.70 was 3 secs ago (not sync site);
> Last vote started 3 secs ago (at Sat Jan 10 13:59:36 2004)
> Local db version is 1073480540.254
> I am not sync site
> Lowest host 152.15.10.70 was set 3 secs ago
> Sync host 0.0.0.0 was set 1073761176 secs ago
> Sync site's db version is 1073480540.254
> 0 locked pages, 0 of them for write
>=20
> Server (152.15.13.7): (db 0.0)
>     last vote rcvd 5 secs ago (at Sat Jan 10 13:59:34 2004),
>     last beacon sent 3 secs ago (at Sat Jan 10 13:59:36 2004), last =
vote
was yes
>     dbcurrent=3D0, up=3D1 beaconSince=3D1
>=20
> Server (152.15.30.27): (db 0.0)
>     last vote rcvd 4 secs ago (at Sat Jan 10 13:59:35 2004),
>     last beacon sent 3 secs ago (at Sat Jan 10 13:59:36 2004), last =
vote
was yes
>     dbcurrent=3D0, up=3D1 beaconSince=3D1
> as-sm1#
>=20
> The only strange thing we have noticed is that when we attempted to
> stop/restart the database servers to see if the condition we clear =
itself
> up we saw as-sm1 become the sync site (as it should) but it claimed it =
was
> a sync site for a negative number of seconds.  The amount of time =
seemed
> to refer back to about the time we started seeing the problem as =
evidenced
> by the last time the local database files were updated.
>=20
> All three database servers our running Solaris 9 and OpenAFS 1.2.10.
>=20
> We need help soon.  Thanks.
>=20
> Mike
>=20
> -------------------------------------
> Mike Mosley                             Email: jmmosley@uncc.edu
> Systems Software Developer              Phone: (704) 687-3522
> College of Engineering, UNC-Charlotte   Fax: (704) 687-2352
> _______________________________________________
> OpenAFS-info mailing list
> OpenAFS-info@openafs.org
> https://lists.openafs.org/mailman/listinfo/openafs-info

--=20

 Douglas E. Engert  <DEEngert@anl.gov>
 Argonne National Laboratory
 9700 South Cass Avenue
 Argonne, Illinois  60439=20
 (630) 252-5444
_______________________________________________
OpenAFS-info mailing list
OpenAFS-info@openafs.org
https://lists.openafs.org/mailman/listinfo/openafs-info