Changes between Version 7 and Version 8 of Sun-Local Computing


Ignore:
Timestamp:
04/03/06 16:23:31 (20 years ago)
Author:
cdelarcuz
Comment:

--

Legend:

Unmodified
Added
Removed
Modified
  • Sun-Local Computing

    v7 v8  
    327327To use SSL, you'll need to select that option in your email program.
    328328
    329 Refer to [http://www.astro.columbia.edu/~crussell/computing/notes/netscape_email_ssl.note
    330 SSL email setup details] for Netscape 6.2 .
    331 
    332 
     329Refer to [http://www.astro.columbia.edu/~crussell/computing/notes/netscape_email_ssl.note SSL-notes] for Netscape 6.2 setup details.
     330
     331= Pool & Backup =
     332
     333== Permanent Sun host Data Directory Space ==
     334
     335
     336        If you need disk space on the Sun hosts, send mail to crussell@astro .
     337        Note that ONLY /home directories are backedup on the Sun hosts.
     338
     339
     340
     341== Backup Info ==
     342
     343        4mm tape media is available in 1008 or in 1328 so that students can
     344        backup data directories.
     345
     346== Backing up Directories ==
     347
     348        Notes for backing up directories can be found [http://www.astro.columbia.edu/~crussell/computing/notes/backup.note here].
     349
     350= Rebooting Sun Hosts =
     351
     352== EMERGENCY SUN HOST REBOOT/POWER OFF ==
     353
     354
     355If no one with root password access (crussell, caleb, jules or arlin) is
     356available and you have determined (see below) that a reboot is
     357necessary, you can reboot as follows:
     358
     359
     3601. If host has a front panel 'button' (ie. all SunBlades and all Ultras
     361                                           except for Ultra-1's)
     362   then to reboot safely:
     363
     364   Push and let go quickly the lit button on the front panel.  The
     365   screen will go black.  About a minute or two later, the screen will
     366   light up, a message about syncing the disks will be displayed and
     367   then the cpu box will be powered off.
     368
     369   After a moment, check that the light in or near the front panel
     370   button is off.  Then push and let go quickly the button.  The host
     371   will boot back up.
     372       
     373        NOTE: Host lakme in Room 1325 has a hardware problem and this
     374              does not work on that host.
     375        NOTE: If the 'safe' reboot doesn't work, the last resort is
     376              to hold in the 'button' about 4 seconds or more until the
     377              cpu box powers off (light is out).  If that fails, then
     378              unplug the cpu box.
     379
     380   *See below for errors that occur when the 'safe' reboot isn't possible.*
     381
     382       
     3832. If host has no front panel 'button' (ie. Ultra-1     attila, isolde, lucia
     384                                            SPARCtation-10      odyssey
     385                                            SPARCstation-5      doncarlo )
     386   then to reboot safely:
     387
     388   Hold down 'Stop' key and the 'a' key.  This will display the okay prompt:
     389                OK>
     390   In order to sync the disks (this does NOT work on other hosts),
     391        type:
     392                OK> sync
     393   After many messages (ie: syncing disks, dumping, ...) the screen
     394   will go black.  When the screen turns white, the host has started
     395   to reboot.
     396
     397   If you need to power off the host, then instead, as soon as the
     398   screen turns white for the reboot, do a 2nd 'Stop-A'.  Then at the
     399   prompt, type
     400                OK> power-off
     401   to power off the cpu box.
     402
     403        NOTE: If the 'safe' reboot doesn't work, the last resort is
     404              use the switch by the plug to power off the cpu box.
     405
     406   *See below for errors that occur when the 'safe' reboot isn't possible.*
     407
     408                    ............................
     409
     410== IS A REBOOT NECESSARY? ==
     411
     412Most Sun hosts act as data disk servers, some also as home directory
     413servers and others as mail, print, web, and software servers.
     414
     415As the Sun systems are interdependent, they are ideally always up and
     416running with all disks mounted.  Rebooting should be a last resort.
     417
     418Below are guidelines on how to avoid a reboot/poweroff of a Sun host.
     419See guidelines for doing a 'safe' reboot/poweroff above.
     420
     421How to avoid a reboot:
     422
     4231. Keep a 'console' window open.  All system error messages are
     424   displayed in the console window.  (In CDE's menu, select HOSTS -->
     425   Terminal Console )
     426
     4272. Check the messages in the console window.  If they include:
     428        ... server SOME_HOST not responding ...
     429   then your host is likely just waiting on another host that is
     430   temporarily unavailable.  Wait for the remote host to be
     431   back up.  You should only reboot as a last resort and only if you
     432   can work without that remote host.
     433
     434   If the console messages include:
     435        ... server SOME_HOST_X not responding ...
     436        ... server SOME_HOST_Y not responding ...
     437        ... server SOME_HOST_Z not responding ...
     438   then there is likely a network problem and rebooting will not help.
     439   Get in touch with the system manager.
     440
     4413. If your console session is hung or frozen and there are no
     442        ... server SOME_HOST not responding ... 
     443    messages, then try to kill the session from another host:
     444                > ssh -l your_username  your_hostname
     445                > psallw | grep ksh
     446USER PID  %CPU PRI NI S %MEM  VSZ   SZ  ELAPSED    TIME TT      COMMAND   
     447YOU  9561  0.0  59 20 S  0.1 1952  244  1-02:32:54 0:00 ?  /bin/ksh \
     448                                                        /usr/dt/bin/Xsession
     449                > kill  9561            (PID# for your Xsession 'ksh' process)
     450
     451
     452                    .................................
     453
     454== LIKELY ERRORS AFTER 'UNSAFE' REBOOT/POWEROFF ==
     455
     456   These occur when the 'safe' reboot/poweroff described above was
     457   not possible.
     458
     459        Disk .../dev/dsk/c0t0d0s0... not OKAY
     460
     461                ...Run fsck...
     462
     463   Before the indicated disk(s) can be mounted, the root account must run
     464   'fsck' on that disk.  If no one is available to use the root
     465   account, the disk and, possibly the host, will remain off line.
     466
     467       
     468       
     469               
     470               
    333471
    334472