[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

SM report





Yesterday I left the SM components running. After 4 hours it stopped
transfering files but it didn't crash. Alex fixed the bug that originated
yesterday's crash. But the strange thing when I did a "ps" today was the
existence of two cmanager's running. This happened before and we can't
explain it. Alex thinks that ftp times out and somehow (that's the part we
are clueless about) a new cmanager starts by itself. This is the ps
output:

rmds03 % ps -l -u bernardo
 F S   UID   PID  PPID  C PRI NI     ADDR     SZ    WCHAN TTY      TIME CMD
 8 S  3275 12382 12379  0  51 20 627834a8    261 627836a0 pts/18   0:00 tcsh
 8 S  3275 12358 12355  0  51 20 62785668    246 62785860 pts/12   0:01 tcsh
 8 S  3275  7381 12382  0  41 20 62d43b78    465 61cc1386 pts/18   0:00 sii.sol
 8 Z  3275 25129  7372  0   0                                      0:02 <defunct>
 8 S  3275 12400 12391  0  51 20 633a0320    263 633a0518 pts/19   0:01 tcsh
 8 S  3275  7372 12400  0  41 20 63bd50f8   2813 63bd5168 pts/19   0:57 cmanager
 8 S  3275  7380 12358  0  48 20 633a09e0    693 6255785e pts/12   0:01 qe
 8 S  3275 29655 29652  0  51 20 629e14b0    266 629e16a8 pts/37   0:04 tcsh
 8 S  3275  7379 12370  0  41 20 63bd21b8    780 e7f05e84 pts/17   0:17 qmonitor
 8 S  3275 12370 12367  0  51 20 615a3470    261 615a3668 pts/17   0:00 tcsh
 8 S  3275 25174  7372  0  41 20 627848e8   2813 ef619924 pts/19   0:00 cmanager


There are two cmanager's in the same tty. Has anyone seen anything like
this before? There is also a <defunct> process, maybe an old pftp. The
second cmanager has the PPID 7372, which is the PID of the first
cmanager... Any help will be appreciated.

Thanks,
Luis




--------------------------------------------------------------
Luis M. Bernardo  <lmbernardo@lbl.gov> <www.lbl.gov/~bernardo>  
Scientific Data Management Research & Development Group
Lawrence Berkeley National Laboratory         
Berkeley, CA 94720