5.4 Pagasus - Test Montage DAX example

This page is still under construction and may contain incorrect information

Pegasus Tutorial

This is based on Tutorial note, but it is tailored to run Pegasus and montage.dax example on a local condor pool, with MyProxy.

Prerequisites for this node

  1. Condor (running condor_master and condor_schedd). It comes with Condor-G.
  2. condor_submit working (Refer to the Condor Submit Test in these notes).
  3. Linux user accounts on the local machine.
  4. Globus installed as the 2nd node and working as client.
    (Copied the /etc/grid-security/certificates etc.)
  5. Globus usercerts were made on the Globus MyProxy sever.
  6. The user authentication entries are made in /etc/grid-security/gridmapfile on local and central server.
    # vim /etc/grid-security/grid-mapfile
    --insert--
    "/O=Grid/OU=GlobusTest/OU=simpleCA-grid2.ramscommunity.org/OU=ramscommunity.org/CN=Yoichi Takayama" yoichi
    "/O=Grid/OU=GlobusTest/OU=simpleCA-grid2.ramscommunity.org/OU=ramscommunity.org/CN=Luke Foxton" lfoxton
  7. Pegasus Mapper was installed on this host.
  8. NFS is mounted.
  9. Pegasus Worker is installed on NFS.
  10. Montage is installed on NFS.

Pegasus Init

This has been defined in /etc/profile

unset CLASSPATH
...
...
export PEGASUS_HOME=/usr/local/pegasus/2.2.0
source $PEGASUS_HOME/setup.sh

$ env | grep PEGASUS
PEGASUS_HOME=/usr/local/pegasus/2.2.0

$ env | grep CLASSPATH
CLASSPATH=/usr/local/pegasus/2.2.0/lib/xmlParserAPIs.jar:/usr/local/pegasus/2.2.0/lib/cryptix-asn1.jar:/usr/local/pegasus/2.2.0/lib/mysql-connector-java-5.0.5-bin.jar:/usr/local/pegasus/2.2.0/lib/preservcsl.jar:/usr/local/pegasus/2.2.0/lib/java-getopt-1.0.9.jar:/usr/local/pegasus/2.2.0/lib/postgresql-8.1dev-400.jdbc3.jar:/usr/local/pegasus/2.2.0/lib/puretls.jar:/usr/local/pegasus/2.2.0/lib/cryptix32.jar:/usr/local/pegasus/2.2.0/lib/exist.jar:/usr/local/pegasus/2.2.0/lib/commons-logging.jar:/usr/local/pegasus/2.2.0/lib/xmldb.jar:/usr/local/pegasus/2.2.0/lib/jce-jdk13-125.jar:/usr/local/pegasus/2.2.0/lib/pegasus.jar:/usr/local/pegasus/2.2.0/lib/log4j-1.2.8.jar:/usr/local/pegasus/2.2.0/lib/xmlrpc.jar:/usr/local/pegasus/2.2.0/lib/accessors.jar:/usr/local/pegasus/2.2.0/lib/commons-pool.jar:/usr/local/pegasus/2.2.0/lib/cog-jglobus.jar:/usr/local/pegasus/2.2.0/lib/xercesImpl.jar:/usr/local/pegasus/2.2.0/lib/globus_rls_client.jar:/usr/local/pegasus/2.2.0/lib/jakarta-oro.jar:/usr/local/pegasus/2.2.0/lib/exist-optional.jar:/usr/local/pegasus/2.2.0/lib/junit.jar:/usr/local/pegasus/2.2.0/lib/cryptix.jar:/usr/local/pegasus/2.2.0/lib/resolver.jar:/usr/local/pegasus/2.2.0/lib/wings.jar

Get Pegasus tutorial package.

cd $HOME
$ wget http://pegasus.isi.edu/tutorial/issgc08/pegasus-wms-issgc08.tar.gz
$ tar -xzvf pegasus-wms-issgc08.tar.gz
$ cd pegasus-wms
$ ls -l
total 64
drwxr-xr-x 2 yoichi yoichi 4096 Sep 11 18:46 config
drwxr-xr-x 3 yoichi yoichi 4096 Aug 23 12:54 dags
drwxr-xr-x 2 yoichi yoichi 4096 Aug 23 12:37 dax
-rw-r--r-- 1 yoichi yoichi  113 May  6 10:49 partitiondax
-rw-r--r-- 1 yoichi yoichi  147 Jun  7 03:29 pegasus-plan-grid
-rw-r--r-- 1 yoichi yoichi  151 Jun  7 03:29 pegasus-plan-grid-defer
-rw-r--r-- 1 yoichi yoichi  145 May  1 10:01 pegasus-plan-local
-rw-r--r-- 1 yoichi yoichi   48 Jun  6 04:40 proxy-logon

Input data

Got the tutorial input data from ISI site and installed in /nfs/shared-scratch/tutorial/pegasus-wms/.

# mkdir /nfs/shared-scratch/tutorial/pegasus-wms/inputdata/montage
# mv inputdata/0.2degree /nfs/shared-scratch/tutorial/pegasus-wms/inputdata/montage
# chmod -R 1777 /nfs/shared-scratch
# ls -l /nfs/shared-scratch/tutorial/pegasus-wms/inputdata/montage/0.2degree
total 12536
-rwxrwxrwt 1 condor condor 2111040 Aug 28 23:54 2mass-atlas-990502s-j1420186.fits
-rwxrwxrwt 1 condor condor 2111040 Aug 28 23:54 2mass-atlas-990502s-j1420198.fits
-rwxrwxrwt 1 condor condor 2111040 Aug 28 23:54 2mass-atlas-990502s-j1430080.fits
-rwxrwxrwt 1 condor condor 2111040 Aug 28 23:54 2mass-atlas-990502s-j1430092.fits
-rwxrwxrwt 1 condor condor 2111040 Aug 28 23:54 2mass-atlas-990502s-j1440186.fits
-rwxrwxrwt 1 condor condor 2111040 Aug 28 23:54 2mass-atlas-990502s-j1440198.fits
-rwxrwxrwt 1 condor condor    1294 Aug 28 23:54 cimages.tbl
-rwxrwxrwt 1 condor condor   33426 Aug 28 23:54 dag.xml
-rwxrwxrwt 1 condor condor    1229 Aug 28 23:54 diffs.tbl
-rwxrwxrwt 1 condor condor    2750 Aug 28 23:54 images.tbl
-rwxrwxrwt 1 condor condor    1294 Aug 28 23:54 pimages.tbl
-rwxrwxrwt 1 condor condor     304 Aug 28 23:54 region.hdr
-rwxrwxrwt 1 condor condor    1376 Aug 28 23:54 rimages.tbl
-rwxrwxrwt 1 condor condor     431 Aug 28 23:54 statfile.tbl
-rwxrwxrwt 1 condor condor    1360 Aug 28 23:54 url.list

Properties and Catalog files

Pegasus needs 1 properties file and 3 catalog files to run. The properties file indicates which catalog files to use (and it contains other properties, too).

  • Replica catalog - files names and locations.
  • Sites catalog - Grid pool site def (host names, service names & locations).
  • Transformation catalog - Grid applications (names and locations).

Properties file

> cat config/properties
##########################
# PEGASUS USER PROPERTIES
##########################

## SELECT THE REPLICAT CATALOG MODE AND URL
pegasus.catalog.replica = SimpleFile
pegasus.catalog.replica.file = ${user.home}/pegasus-wms/config/rc.data
#pegasus.catalog.replica.url=rlsn://smarty.isi.edu

## SELECT THE SITE CATALOG MODE AND FILE
pegasus.catalog.site = XML
pegasus.catalog.site.file = ${user.home}/pegasus-wms/config/sites.xml


## SELECT THE TRANSFORMATION CATALOG MODE AND FILE
pegasus.catalog.transformation = File
pegasus.catalog.transformation.file = ${user.home}/pegasus-wms/config/tc.data

## SET UP THE WORK AND INVOCATION DATABASE
#pegasus.catalog.work =  Database
#pegasus.catalog.provenance = InvocationSchema

## Database related properties
#pegasus.catalog..db.driver = MySQL
#pegasus.catalog.*.db.url = jdbc:mysql://smarty.isi.edu/osg2008
#pegasus.catalog.*.db.user = osg2008user
#pegasus.catalog.*.db.password =  Osg2008

## USE DAGMAN RETRY FEATURE FOR FAILURES
pegasus.dagman.retry=2

## STAGE ALL OUR EXECUTABLES OR USE INSTALLED WHEREVER THEY EXIST
# pegasus.catalog.transformation.mapper = All
# Use All for genera use

# pegasus.catalog.transformation.mapper = Installed
# Use Installed for diamond.dax example

pegasus.catalog.transformation.mapper = Staged
# Staged means pegasus will deploy a worker package
# into the working directory of the workflow
# which must be on the nfs and shared.
# Otherwise Worker packages must be installed
# at the same locations on all
# Worker nodes as specified in the tc.data.

## CHECK JOB EXIT CODES FOR FAILURE
pegasus.exitcode.scope=all
pegasus.exitcode.arguments= -e
## OPTIMZE DATA & EXECUTABLE TRANSFERS
pegasus.transfer.refiner=Bundle

#STAGE DATA AND EXECUTABLES USING GRIDFTP 3rd PARTY MODE
pegasus.transfer.*.thirdparty.sites=*
pegasus.transfer.*.impl  GUC

## WORK AND STORAGE DIR
pegasus.dir.storage = ${user.name}/storage
pegasus.dir.exec = ${user.name}/exec

# JOB Priorities

pegasus.job.priority=10
pegasus.transfer.*.priority=100

#JOB CATEGORIES
pegasus.dagman.projection.maxjobs 2

Replica catalog

Edit the ./config/rc.in file to reflect the change of the file locations. This includes diamond.dax input/output files. Local input files are located in /scratch/tutorial/inputdata. The output files are located in local /scratch/storage. Network files are located in /nfs/shared-scratch/tutorial/pegasus-wms/inputdata.

$ vi config/rc.in
-- insert --
# file-based replica catalog: 2008-09-24T01:58:11.069-07:00
f.a gsiftp://grid1.ramscommunity.org/scratch/tutorial/inputdata/diamond/f.a pool="local"
2mass-atlas-990502s-j1420198.fits gsiftp://grid1.ramscommunity.org/nfs/shared-scratch/tutorial/pegasus-wms/inputdata/montage/0.2degree/2mass-atlas-990502s-j1420198.fits pool="local"
2mass-atlas-990502s-j1430092.fits gsiftp://grid1.ramscommunity.org/nfs/shared-scratch/tutorial/pegasus-wms/inputdata/montage/0.2degree/2mass-atlas-990502s-j1430092.fits pool="local"
images_20080505_143233_14944.tbl gsiftp://grid1.ramscommunity.org/nfs/shared-scratch/tutorial/pegasus-wms/inputdata/montage/0.2degree/images.tbl pool="local"
2mass-atlas-990502s-j1430080.fits gsiftp://grid1.ramscommunity.org/nfs/shared-scratch/tutorial/pegasus-wms/inputdata/montage/0.2degree/2mass-atlas-990502s-j1430080.fits pool="local"
2mass-atlas-990502s-j1440198.fits gsiftp://grid1.ramscommunity.org/nfs/shared-scratch/tutorial/pegasus-wms/inputdata/montage/0.2degree/2mass-atlas-990502s-j1440198.fits pool="local"
2mass-atlas-990502s-j1440186.fits gsiftp://grid1.ramscommunity.org/nfs/shared-scratch/tutorial/pegasus-wms/inputdata/montage/0.2degree/2mass-atlas-990502s-j1440186.fits pool="local"
2mass-atlas-990502s-j1420186.fits gsiftp://grid1.ramscommunity.org/nfs/shared-scratch/tutorial/pegasus-wms/inputdata/montage/0.2degree/2mass-atlas-990502s-j1420186.fits pool="local"
cimages_20080505_143233_14944.tbl gsiftp://grid1.ramscommunity.org/nfs/shared-scratch/tutorial/pegasus-wms/inputdata/montage/0.2degree/cimages.tbl pool="local"
dag_20080505_143233_14944.xml gsiftp://grid1.ramscommunity.org/nfs/shared-scratch/tutorial/pegasus-wms/inputdata/montage/0.2degree/dag.xml pool="local"
region_20080505_143233_14944.hdr gsiftp://grid1.ramscommunity.org/nfs/shared-scratch/tutorial/pegasus-wms/inputdata/montage/0.2degree/region.hdr pool="local"
pimages_20080505_143233_14944.tbl gsiftp://grid1.ramscommunity.org/nfs/shared-scratch/tutorial/pegasus-wms/inputdata/montage/0.2degree/pimages.tbl pool="local"
statfile_20080505_143233_14944.tbl gsiftp://grid1.ramscommunity.org/nfs/shared-scratch/tutorial/pegasus-wms/inputdata/montage/0.2degree/statfile.tbl pool="local"
f.c2 gsiftp://grid1.ramscommunity.org/scratch/yoichi/storage/f.c2 pool="local"
f.c1 gsiftp://grid1.ramscommunity.org/scratch/yoichi/storage/f.c1 pool="local"
f.d gsiftp://grid1.ramscommunity.org/scratch/yoichi/storage/f.d pool="local"
mosaic_20080505_143233_14944_area.fits gsiftp://grid1.ramscommunity.org/scratch/yoichi/storage/mosaic_20080505_143233_14944_area.fits pool="local"
mosaic_20080505_143233_14944.fits gsiftp://grid1.ramscommunity.org/scratch/yoichi/storage/mosaic_20080505_143233_14944.fits pool="local"
shrunken_20080505_143233_14944.fits gsiftp://grid1.ramscommunity.org/scratch/yoichi/storage/shrunken_20080505_143233_14944.fits pool="local"
montage.jpg gsiftp://grid1.ramscommunity.org/scratch/yoichi/storage/montage.jpg pool="local"

Clear the old catalog/rc.data (if it exists) and create a new rc.data from rc.in as follows:

$ rc-client -Dpegasus.user.properties=config/properties list local
$ rc-client -Dpegasus.user.properties=config/properties clear
$ rc-client -Dpegasus.user.properties=config/properties --insert config/rc.in
$ rc-client -Dpegasus.user.properties=config/properties list local

Sites catalog

Next, edit the ./config/sites.txt to reflect changes. Back up the old one and make a new sites.txt with: (Globus is installed on this host at /usr/local/globus. The Pegasus Mapper is installed at /usr/local/pegasus. The Pegasus Worker is accessible for Execute node from /nfs/software/pegasus. GridFTP is accessible for the rams Condor pool from grid2. GridFTP is also installed on grid1.)

site rams{
sysinfo "INTEL32::LINUX"
gridlaunch "/nfs/software/pegasus/2.0.2/bin/kickstart"
workdir "/nfs/shared-scratch/"
gridftp "gsiftp://grid2.ramscommunity.org/nfs/shared-scratch" "4.2.0"
universe transfer "grid2.ramscommunity.org/jobmanager-fork" "4.2.0"
universe vanilla "grid2.ramscommunity.org/jobmanager-condor" "4.2.0"
lrc "rlsn://smarty.isi.edu"
profile  env "GLOBUS_LOCATION" "/usr/local/globus"
profile  env "LD_LIBRARY_PATH" "/usr/local/globus/lib"
profile  env "PEGASUS_HOME" "/nfs/software/pegasus/2.0.2"
profile  pegasus "bundle.stagein" "1"
profile  pegasus "bundle" "1"
}

site local{
sysinfo "INTEL32::LINUX"
gridlaunch "/usr/local/pegasus/2.0.2/bin/kickstart"
workdir "/scratch"
gridftp "gsiftp://grid1.ramscommunity.org/scratch" "4.2.0"
universe transfer "grid1.ramscommunity.org/jobmanager-fork" "4.2.0"
universe vanilla "grid1.ramscommunity.org/jobmanager-condor" "4.2.0"
lrc "rlsn://smarty.isi.edu"
profile  env "GLOBUS_LOCATION" "/usr/local/globus"
profile  env "LD_LIBRARY_PATH" "/usr/local/globus/lib"
profile  env "PEGASUS_HOME" "/usr/local/pegasus/2.0.2"
}

NB: The local host is also a part of the rams Condor pool in this case.

Make sure that the user can read/write to /scratch and /nfs/shared-scratch.

lrc "rlsc://smarty.ramscommunity.org" is about a replica catalog service. It is commented out in the property file, since we are using the SimpleFile catalog. The manual seems to imply that any valid URL may do, but I am not certain. However, it has to be mentioned in sites.txt, or you get an error.

Parse this with sc-client to generate sites.xml.

$ sc-client -f config/sites.txt -o config/sites.xml
2008.11.18 12:37:48.323 EST: [INFO] event.pegasus.sc-client pegasus.version 2.2.0cvs  - STARTED 
2008.11.18 12:37:48.389 EST: [INFO]  Reading config/sites.txt 
2008.11.18 12:37:48.460 EST: [INFO]  Reading config/sites.txt -DONE 
2008.11.18 12:37:48.470 EST: [INFO]  Written out the converted file to config/sites.xml 
2008.11.18 12:37:48.471 EST: [INFO]  Time taken to execute is 0.116 seconds 
2008.11.18 12:37:48.471 EST: [INFO] event.pegasus.sc-client pegasus.version 2.2.0cvs  - FINISHED

Transformation catalog

(Note: These worked with 2.0.1 but they don't work with 2.2.0 we are using now)

Copy the tc.data to tc.data.old

Delete the old tc.data, by: (assuming that you are in pegasus-wms)

$ tc-client -Dpegasus.user.properties=config/properties -q -B
$ tc-client -Dpegasus.user.properties=config/properties -d -BPRELST
2008.09.11 19:18:33.931 EST: [INFO] Deleted the entire tc succesfullytc-client
$ tc-client -Dpegasus.user.properties=config/properties -q -B

Or, simply delete the tc.data file

Create a new config/tc.in with:

local   condor::dagman  /nfs/software/condor/7.0.4/bin/condor_dagman    INSTALLED       INTEL32::LINUX  NULL
local   diamond::analyze:2.0    /nfs/software/pegasus/2.0.2/bin/keg   INSTALLED       INTEL32::LINUX  NULL
local   diamond::findrange:2.0  /nfs/software/pegasus/2.0.2/bin/keg     INSTALLED       INTEL32::LINUX
local   diamond::preprocess:2.0 /nfs/software/pegasus/2.0.2/bin/keg     INSTALLED       INTEL32::LINUX
rams    bin/mDiff       gsiftp://grid2.ramscommunity.org/nfs/software/montage/3.0/bin/mDiff STATIC_BINARY   INTEL32::LINUX  ENV::MONTAGE_HOME="."
rams    bin/mFitplane   gsiftp://grid2.ramscommunity.org/nfs/software/montage/3.0/bin/mFitplane     STATIC_BINARY   INTEL32::LINUX  NULL
rams    mAdd:3.0        gsiftp://grid2.ramscommunity.org/nfs/software/montage/3.0/bin/mAdd  STATIC_BINARY   INTEL32::LINUX  NULL
rams    mBackground:3.0 gsiftp://grid2.ramscommunity.org/nfs/software/montage/3.0/bin/mBackground   STATIC_BINARY   INTEL32::LINUX  NULL
rams    mBgModel:3.0    gsiftp://grid2.ramscommunity.org/nfs/software/montage/3.0/bin/mBgModel      STATIC_BINARY   INTEL32::LINUX  NULL
rams    mConcatFit:3.0  gsiftp://grid2.ramscommunity.org/nfs/software/montage/3.0/bin/mConcatFit    STATIC_BINARY   INTEL32::LINUX  NULL
rams    mDiffFit:3.0    gsiftp://grid2.ramscommunity.org/nfs/software/montage/3.0/bin/mDiffFit      STATIC_BINARY   INTEL32::LINUX  ENV::MONTAGE_HOME="."
rams    mImgtbl:3.0     gsiftp://grid2.ramscommunity.org/nfs/software/montage/3.0/bin/mImgtbl       STATIC_BINARY   INTEL32::LINUX  NULL
rams    mJPEG:3.0       gsiftp://grid2.ramscommunity.org/nfs/software/montage/3.0/bin/mJPEG STATIC_BINARY   INTEL32::LINUX  NULL
rams    mProject:3.0    gsiftp://grid2.ramscommunity.org/nfs/software/montage/3.0/bin/mProjectPP    STATIC_BINARY   INTEL32::LINUX  CONDOR::priority="25"
rams    mProjectPP:3.0  gsiftp://grid2.ramscommunity.org/nfs/software/montage/3.0/bin/mProjectPP    STATIC_BINARY   INTEL32::LINUX  CONDOR::priority="25"
rams    mShrink:3.0     gsiftp://grid2.ramscommunity.org/nfs/software/montage/3.0/bin/mShrink       STATIC_BINARY   INTEL32::LINUX  NULL

Make a new catalog/tc.data file with:

$ tc-client -Dpegasus.user.properties=config/properties -a -B -f config/tc.in
(check it with)
$ tc-client -Dpegasus.user.properties=config/properties -q -B

MyProxy

(# su - yoichi)
($ cd pegasus-wms)

$ myproxy-logon -s grid2
Enter MyProxy pass phrase:
A credential has been received for user yoichi in /tmp/x509up_u500.

GRAM2 gatekeeper

Pegasus requires Globus gatekeeper, which is pre-ws GRAM2.

This has been set up with the GRAM2 section with Globus installation and test.

Submitting a job to rams Condor pool

$ pegasus-plan -Dpegasus.user.properties=`pwd`/config/properties --dir `pwd`/dags \
  --sites rams --output local --nocleanup --force --cluster horizontal \
  --dax `pwd`/dax/montage.dax

2008.11.18 13:08:56.574 EST: [INFO] event.pegasus.planner planner.version 2.2.0cvs  - STARTED 
2008.11.18 13:08:57.447 EST: [INFO] event.pegasus.parse.dax dax.id /home/yoichi/pegasus-wms/dax/montage.dax  - STARTED 
2008.11.18 13:08:57.600 EST: [INFO] event.pegasus.parse.dax dax.id /home/yoichi/pegasus-wms/dax/montage.dax  - FINISHED 
2008.11.18 13:08:57.670 EST: [INFO] event.pegasus.siteselection dax.id montage_0  - STARTED 
2008.11.18 13:08:57.731 EST: [INFO] event.pegasus.siteselection dax.id montage_0  - FINISHED 
2008.11.18 13:08:57.749 EST: [INFO] event.pegasus.cluster dax.id montage_0  - STARTED 
2008.11.18 13:08:57.845 EST: [INFO] event.pegasus.cluster dax.id montage_0  - FINISHED 
2008.11.18 13:08:57.845 EST: [INFO]  Grafting transfer nodes in the workflow 
2008.11.18 13:08:57.846 EST: [INFO] event.pegasus.generate.transfer-nodes dax.id montage_0  - STARTED 
2008.11.18 13:09:02.535 EST: [INFO] event.pegasus.generate.transfer-nodes dax.id montage_0  - FINISHED 
2008.11.18 13:09:02.542 EST: [INFO] event.pegasus.generate.workdir-nodes dax.id montage_0  - STARTED 
2008.11.18 13:09:02.549 EST: [INFO] event.pegasus.generate.workdir-nodes dax.id montage_0  - FINISHED 
2008.11.18 13:09:02.549 EST: [INFO] event.pegasus.generate.cleanup-wf dax.id montage_0  - STARTED 
2008.11.18 13:09:02.552 EST: [INFO] event.pegasus.generate.cleanup-wf dax.id montage_0  - FINISHED 
2008.11.18 13:09:02.565 EST: [INFO]  Generating codes for the concrete workflow 
2008.11.18 13:09:04.024 EST: [INFO]  Generating codes for the concrete workflow -DONE 
2008.11.18 13:09:04.026 EST: [INFO]  Generating code for the cleanup workflow 
2008.11.18 13:09:04.080 EST: [INFO]  Generating code for the cleanup workflow -DONE 
2008.11.18 13:09:04.085 EST: [INFO]  


I have concretized your abstract workflow. The workflow has been entered 
into the workflow database with a state of "planned". The next step is 
to start or execute your workflow. The invocation required is


pegasus-run -Dpegasus.user.properties=/home/yoichi/pegasus-wms/dags/yoichi/pegasus/montage/run0001/pegasus.9270.properties --nodatabase /home/yoichi/pegasus-wms/dags/yoichi/pegasus/montage/run0001

 
2008.11.18 13:09:04.086 EST: [INFO]  Time taken to execute is 7.491 seconds 
2008.11.18 13:09:04.086 EST: [INFO] event.pegasus.planner planner.version 2.2.0cvs  - FINISHED
$ pegasus-run -Dpegasus.user.properties=/home/yoichi/pegasus-wms/dags/yoichi/pegasus/montage/run0001/pegasus.9270.properties --nodatabase /home/yoichi/pegasus-wms/dags/yoichi/pegasus/montage/run0001
Rescued /tmp/montage-09271.log as /tmp/montage-09271.log.000

Checking all your submit files for log file names.
This might take a while... 
Done.
-----------------------------------------------------------------------
File for submitting this DAG to Condor           : montage-0.dag.condor.sub
Log of DAGMan debugging messages                 : montage-0.dag.dagman.out
Log of Condor library output                     : montage-0.dag.lib.out
Log of Condor library error messages             : montage-0.dag.lib.err
Log of the life of condor_dagman itself          : montage-0.dag.dagman.log

Condor Log file for all jobs of this DAG         : /tmp/montage-09271.log
-no_submit given, not submitting DAG to Condor.  You can do this with:
"condor_submit montage-0.dag.condor.sub"
-----------------------------------------------------------------------
Submitting job(s).
Logging submit event(s).
1 job(s) submitted to cluster 6.

I have started your workflow, committed it to DAGMan, and updated its
state in the work database. A separate daemon was started to collect
information about the progress of the workflow. The job state will soon
be visible. Your workflow runs in base directory. 

cd /home/yoichi/pegasus-wms/dags/yoichi/pegasus/montage/run0001

*** To monitor the workflow you can run ***

pegasus-status -w montage-0 -t 20081118T130856+1100 
or
pegasus-status /home/yoichi/pegasus-wms/dags/yoichi/pegasus/montage/run0001

*** To remove your workflow run ***

pegasus-remove -d 6.0
or
pegasus-remove /home/yoichi/pegasus-wms/dags/yoichi/pegasus/montage/run0001
$ pegasus-status /home/yoichi/pegasus-wms/dags/yoichi/pegasus/montage/run0001


-- Submitter: grid1.ramscommunity.org : <137.111.246.175:9650> : grid1.ramscommunity.org
 ID      OWNER/NODENAME   SUBMITTED     RUN_TIME ST PRI SIZE CMD               
   6.0   yoichi         11/18 13:09   0+00:01:04 R  0   4.9  condor_dagman -f -
   7.0    |-montage_0_r 11/18 13:10   0+00:00:07 R  10  0.1  kickstart -n pegas


$ condor_status

Name               OpSys      Arch   State     Activity LoadAv Mem   ActvtyTime

grid1.ramscommunit LINUX      INTEL  Owner     Idle     0.060   249  0+00:50:05
grid4.ramscommunit LINUX      INTEL  Unclaimed Idle     0.000   503  0+00:55:04

                     Total Owner Claimed Unclaimed Matched Preempting Backfill

         INTEL/LINUX     2     1       0         1       0          0        0

               Total     2     1       0         1       0          0        0




$ pegasus-status /home/yoichi/pegasus-wms/dags/yoichi/pegasus/montage/run0001


-- Submitter: grid1.ramscommunity.org : <137.111.246.175:9650> : grid1.ramscommunity.org
 ID      OWNER/NODENAME   SUBMITTED     RUN_TIME ST PRI SIZE CMD

CVS 2.2.0 nightly-build problem

The CVS 2.2.0 nightly-build had some problem.

$ pegasus-version -f
2.2.0cvs-x86_cent_4.2-20081016052128Z

$ tc-client -Dpegasus.user.properties=config/properties -q -B

Exception in thread "main" java.util.EmptyStackException
	at java.util.Stack.peek(Stack.java:85)
	at edu.isi.pegasus.common.logging.format.AbstractLogFormatter.add(AbstractLogFormatter.java:146)
	at edu.isi.pegasus.common.logging.format.Simple.add(Simple.java:52)
	at edu.isi.pegasus.common.logging.LogManager.log(LogManager.java:411)
	at org.griphyn.common.catalog.toolkit.TCClient.executeCommand(TCClient.java:247)
	at org.griphyn.common.catalog.toolkit.TCClient.main(TCClient.java:381)

This may be contributing to some problems.

Although this error was fixed, it is not yet available as a binary package apparently and I was advised to build from the SVN source.

SVN is not installed, so first it must be installed.

# yum install subversion.i386
...
...
Installed: subversion.i386 0:1.4.2-2.el5
Complete!

Checking out the latest trunk:

# svn checkout https://smarty.isi.edu/svn/repo1/pegasus/trunk pegasus
Error validating server certificate for 'https://smarty.isi.edu:443':
 - The certificate is not issued by a trusted authority. Use the
   fingerprint to validate the certificate manually!
Certificate information:
 - Hostname: smarty.isi.edu
 - Valid: from Sep  2 23:58:13 2008 GMT until Sep  2 23:58:13 2009 GMT
 - Issuer: Certificate Authorities, DOEGrids, org
 - Fingerprint: bc:2f:1c:49:44:13:36:71:02:e0:65:e5:79:17:5d:dd:c5:5c:ab:cd
(R)eject, accept (t)emporarily or accept (p)ermanently? p
...
...
Checked out revision 1059.

Building:

# unset CLASSPATH
# export PEGASUS_HOME=/root/pegasus
# source $PEGASUS_HOME/setup-devel.sh
# cd $PEGASUS_HOME
# ant dist-worker dist
# cd dist
# ls -l
total 30272
drwxr-xr-x 14 root root     4096 Nov 18 18:40 pegasus-2.2.0cvs
-rw-r--r--  1 root root 15466833 Nov 18 18:41 pegasus-binary-2.2.0cvs-x86_cent_5.2.tar.gz
-rw-r--r--  1 root root 15466848 Nov 18 18:44 pegasus-worker-2.2.0cvs-x86_cent_5.2.tar.gz

Replacing the Pagasus-Mapper:

# rm -rf pegasus-2.2.0cvs
# tar -xzvf pegasus-binary-2.2.0cvs-x86_cent_5.2.tar.gz
...
...
# rm -rf /usr/local/pegasus/2.2.0/*
# mv pegasus-2.2.0cvs/* /usr/local/pegasus/2.2.0
# ls -l /usr/local/pegasus/2.2.0
total 184
drwxr-xr-x  2 root root  4096 Nov 18 18:40 bin
drwxr-xr-x 10 root root  4096 Nov 18 18:40 contrib
drwxr-xr-x  5 root root  4096 Nov 18 18:40 doc
drwxr-xr-x  2 root root  4096 Nov 18 18:40 etc
drwxr-xr-x  2 root root  4096 Nov 18 18:40 examples
-rw-r--r--  1 root root  5096 Nov 18 18:31 GTPL
drwxr-xr-x  3 root root  4096 Nov 18 18:40 lib
drwxr-xr-x  2 root root  4096 Nov 18 18:40 libexec
-rw-r--r--  1 root root 11368 Nov 18 18:31 LICENSE
drwxr-xr-x  3 root root  4096 Nov 18 18:39 man
-rw-r--r--  1 root root  4106 Nov 18 18:31 MIGRATING_FROM_VDS
-rw-r--r--  1 root root  2714 Nov 18 18:31 README
-rw-r--r--  1 root root 12175 Nov 18 18:31 RELEASE_NOTES
-rwxr-xr-x  1 root root  2212 Nov 18 18:31 setup.csh
-rwxr-xr-x  1 root root  2217 Nov 18 18:31 setup.sh
drwxr-xr-x  2 root root  4096 Nov 18 18:40 share
drwxr-xr-x  2 root root  4096 Nov 18 18:40 sql
-rw-r--r--  1 root root    29 Nov 18 18:38 stamp
drwxr-xr-x  7 root root  4096 Nov 18 18:40 test
drwxr-xr-x  2 root root  4096 Nov 18 18:40 var

Replacing the Pegasus-Worker:

# tar -xzvf pegasus-worker-2.2.0cvs-x86_cent_5.2.tar.gz
pegasus-2.2.0cvs/
...
...
# rm -rf /nfs/software/pegasus/2.2.0/*
# ls -l /nfs/software/pegasus/2.2.0/
total 0

# mv pegasus-2.2.0cvs/* /nfs/software/pegasus/2.2.0/
# ls -l /nfs/software/pegasus/2.2.0/
total 184
drwxr-xr-x  2 root root  4096 Nov 18 18:43 bin
drwxr-xr-x 10 root root  4096 Nov 18 18:40 contrib
drwxr-xr-x  5 root root  4096 Nov 18 18:40 doc
drwxr-xr-x  2 root root  4096 Nov 18 18:40 etc
drwxr-xr-x  2 root root  4096 Nov 18 18:40 examples
-rw-r--r--  1 root root  5096 Nov 18 18:31 GTPL
drwxr-xr-x  3 root root  4096 Nov 18 18:40 lib
drwxr-xr-x  2 root root  4096 Nov 18 18:40 libexec
-rw-r--r--  1 root root 11368 Nov 18 18:31 LICENSE
drwxr-xr-x  3 root root  4096 Nov 18 18:39 man
-rw-r--r--  1 root root  4106 Nov 18 18:31 MIGRATING_FROM_VDS
-rw-r--r--  1 root root  2714 Nov 18 18:31 README
-rw-r--r--  1 root root 12175 Nov 18 18:31 RELEASE_NOTES
-rwxr-xr-x  1 root root  2212 Nov 18 18:31 setup.csh
-rwxr-xr-x  1 root root  2217 Nov 18 18:31 setup.sh
drwxr-xr-x  2 root root  4096 Nov 18 18:40 share
drwxr-xr-x  2 root root  4096 Nov 18 18:40 sql
-rw-r--r--  1 root root    29 Nov 18 18:43 stamp
drwxr-xr-x  7 root root  4096 Nov 18 18:40 test
drwxr-xr-x  2 root root  4096 Nov 18 18:40 var

Test

# su - yoichi
$ cd pegasus-wms
$ tc-client -Dpegasus.user.properties=config/properties -q -B

#RESID     LTX                        PTX                                                                         TYPE              SYSINFO             PROFILES

local    condor::dagman             /nfs/software/condor/7.0.4/bin/condor_dagman                                 INSTALLED        INTEL32::LINUX    NULL
local    diamond::analyze:2.0       /nfs/software/pegasus/2.2.0/bin/keg                                          INSTALLED        INTEL32::LINUX    NULL
local    diamond::findrange:2.0     /nfs/software/pegasus/2.2.0/bin/keg                                          INSTALLED        INTEL32::LINUX    NULL
local    diamond::preprocess:2.0    /nfs/software/pegasus/2.2.0/bin/keg                                          INSTALLED        INTEL32::LINUX    NULL
rams     bin/mDiff                  gsiftp://grid2.ramscommunity.org/nfs/software/montage/3.0/bin/mDiff          STATIC_BINARY    INTEL32::LINUX    ENV::MONTAGE_HOME="."
rams     bin/mFitplane              gsiftp://grid2.ramscommunity.org/nfs/software/montage/3.0/bin/mFitplane      STATIC_BINARY    INTEL32::LINUX    NULL
rams     mAdd:3.0                   gsiftp://grid2.ramscommunity.org/nfs/software/montage/3.0/bin/mAdd           STATIC_BINARY    INTEL32::LINUX    NULL
rams     mBackground:3.0            gsiftp://grid2.ramscommunity.org/nfs/software/montage/3.0/bin/mBackground    STATIC_BINARY    INTEL32::LINUX    NULL
rams     mBgModel:3.0               gsiftp://grid2.ramscommunity.org/nfs/software/montage/3.0/bin/mBgModel       STATIC_BINARY    INTEL32::LINUX    NULL
rams     mConcatFit:3.0             gsiftp://grid2.ramscommunity.org/nfs/software/montage/3.0/bin/mConcatFit     STATIC_BINARY    INTEL32::LINUX    NULL
rams     mDiffFit:3.0               gsiftp://grid2.ramscommunity.org/nfs/software/montage/3.0/bin/mDiffFit       STATIC_BINARY    INTEL32::LINUX    ENV::MONTAGE_HOME="."
rams     mImgtbl:3.0                gsiftp://grid2.ramscommunity.org/nfs/software/montage/3.0/bin/mImgtbl        STATIC_BINARY    INTEL32::LINUX    NULL
rams     mJPEG:3.0                  gsiftp://grid2.ramscommunity.org/nfs/software/montage/3.0/bin/mJPEG          STATIC_BINARY    INTEL32::LINUX    NULL
rams     mProject:3.0               gsiftp://grid2.ramscommunity.org/nfs/software/montage/3.0/bin/mProjectPP     STATIC_BINARY    INTEL32::LINUX    CONDOR::priority="25"
rams     mProjectPP:3.0             gsiftp://grid2.ramscommunity.org/nfs/software/montage/3.0/bin/mProjectPP     STATIC_BINARY    INTEL32::LINUX    CONDOR::priority="25"
rams     mShrink:3.0                gsiftp://grid2.ramscommunity.org/nfs/software/montage/3.0/bin/mShrink        STATIC_BINARY    INTEL32::LINUX    NULL
Enter labels to add to this page:
Please wait 
Looking for a label? Just start typing.