[xcat-user] xCAT 1.3b2 and Torque 2.1.8 STDOUT andSTDERR problem

Egan Ford egan at sense.net
Fri Dec 14 13:29:57 MST 2007


You can use qsub -o {hostname}:path and -e {hostname}:path to pick any
directory for the output to be copied to after the job completes.

But for spooling to ~/.pbs_spool vs /var/spool/pbs/spool on the mom I think
it is a compile time option only.

> -----Original Message-----
> From: xcat-user-bounces at lists.xcat.org 
> [mailto:xcat-user-bounces at lists.xcat.org] On Behalf Of Joseph Hargitai
> Sent: Friday, December 14, 2007 12:42 PM
> To: xCAT Users Mailing list
> Subject: Re: RE: [xcat-user] xCAT 1.3b2 and Torque 2.1.8 
> STDOUT andSTDERR problem
> Importance: High
> 
> 
> 
> is there perhaps a runtime flag in to set/unset this? 
> 
> j
> 
> ----- Original Message -----
> From: Egan Ford <egan at sense.net>
> Date: Friday, December 14, 2007 2:06 pm
> Subject: RE: [xcat-user] xCAT 1.3b2 and Torque 2.1.8 STDOUT 
> and STDERR problem
> 
> > --disable-spool  if disabled, TORQUE will create output and 
> error files
> >                  directly in $HOME/.pbs_spool if it exists 
> or in $HOME
> >                  otherwise.  By default, TORQUE will spool files in
> >                  $TORQUEHOME/spool and copy them to the users home
> >                  directory when the job completes.
> > 
> > If you used the latest xCAT torquemaker, then 
> --disable-spool was used
> > when
> > building torque.  Without --disable-spool a user has to 
> wait until the 
> > job
> > completely finishes/fails to get any intermediate results 
> because the
> > temporary output files are on the mom node in 
> /var/spool/pbs/spool (I 
> > guess
> > they can ssh in to see them).  With --disable-spool a user 
> can tail -f
> > ~/.pbs_spool/file.[oe] while the job runs to get 
> intermediate results. 
> >  The
> > user may opt to kill the job sooner than later.
> > 
> > --disable-spool + your mom config change may be the best of both 
> > worlds. Can you create a .pbs_spool directory in your home 
> and verify 
> > that you can tail -f the output while it gets created and 
> then after 
> > job completion it
> > gets copied to $PBS_WORKDIR?
> > 
> > Thanks.
> > 
> > > -----Original Message-----
> > > From: xcat-user-bounces at lists.xcat.org
> > > [mailto:xcat-user-bounces at lists.xcat.org] On Behalf Of 
> Chris Beggio
> > > Sent: Thursday, December 13, 2007 3:55 PM
> > > To: xcat-user at lists.xcat.org
> > > Subject: [xcat-user] xCAT 1.3b2 and Torque 2.1.8 STDOUT and 
> > > STDERR problem
> > > Importance: High
> > > 
> > > 
> > > 
> > > xCAT users,
> > > 
> > > I have recently pounded a new flat spot on my head with a PBS
> > > problem on a recently installed cluster. Versions are as follows:
> > > 
> > > Red Hat Enterprise Linux WS release 4 (Nahant Update 5)
> > > 
> > > xCAT 1.3.0-beta2
> > > Fri Jul 13 00:18:24 MDT 2007
> > > 
> > > Torque 2.1.8
> > > 
> > > The problem is that out of the box, PBS was not copying the
> > > PBS prolog and epilog, along with STDERR (${PBS_JOBID}.ER) 
> > > and STDOUT (${PBS_JOBID}.OU) or the joined output and error 
> > > stream to the directory where the job was executed 
> > > (${PBS_O_WORKDIR}), and instead was copying them to the user 
> > > home directory (${PBS_O_HOME}). I compared two identical 
> > > machines and all the versions and configurations appear to be 
> > > the same, but while one was copying output to 
> > > ${PBS_O_WORKDIR}, the more recently installed was not. Then I 
> > > changed /var/spool/pbs/mom_priv/config on the compute nodes 
> > > to include:
> > > 
> > > $usecp *:/home  /home
> > > 
> > > This line does not exist on the other working cluster.
> > > Everything works now and the file 
> > > ${PBS_JOBNAME}.o${PBS_JOBID} is now deposited in 
> > > ${PBS_O_WORKDIR} as expected.  What am I missing?
> > > 
> > > Thanks and Happy Festivus.
> > > 
> > > Chris
> > > --
> > > __________________________________________________
> > > Chris Beggio
> > > 
> > > 1600 Computing Support Team
> > > 
> > > Commercial Data Systems
> > > Contracted by Sandia National Laboratories
> > > 
> > > Phone: 505-284-8001
> > > Fax: 505-284-6078
> > > Email: cabeggi at sandia.gov
> > > __________________________________________________
> > > 
> > > 
> > > _______________________________________________
> > > xcat-user mailing list
> > > xcat-user at lists.xcat.org
> > > http://www.xcat.org/mailman/listinfo/xcat-user
> > 
> > _______________________________________________
> > xcat-user mailing list
> > xcat-user at lists.xcat.org 
> > http://www.xcat.org/mailman/listinfo/xcat-user
> _______________________________________________
> xcat-user mailing list
> xcat-user at lists.xcat.org 
> http://www.xcat.org/mailman/listinfo/xcat-user



More information about the xcat-user mailing list