Bugs fixed in SGE 5.3p7 since release 5.3p6 ------------------------------------------- Issue Sun BugId Description ------- ---------- --------------------------------------------------------------------------------- 942 6370485 stale resource_unknown_list 538 6370481 increase PATH size limit of 2048 characters 1958 6370003 long lines in accounting entries break qacct 6366691 utilbin/rsh can be used to gain root access 1840 6340741 scheduler dies when array jobs are submitted with -now y 1778 6318659 sge_ca -usercert fails when executed more than once 1625 6252525 qmon: complex attributes not removeable 1541 6250603 qmon crash (segmentation fault) on Solaris64 1515 6232120 Certificate renewal in CSP mode broken 1403 6218430 Problems with load values if execution daemons run in a solaris zone 1288 6198937 Core dump in `qmon`Xpm.c`CreateImage running qmon on E25K (using remote DISPLAY) 1302 6185208 qmon and equal job arguments 1535 5086193 load.sh fails on a machine when uptime displays time for less than an hour 1012 5040728 Job error state broken Bugs fixed in SGE 5.3p6 since release 5.3p5 ------------------------------------------- Issue Sun BugId Description ------- ---------- --------------------------------------------------------------------------------- 4753766 Various fixes in man pages 551 4969825 not supported array task dependencies are not rejected 5006354 sge_ca script will not create user certs when SGE_CELL (default) is not set 871 5018669 qrsh/qlogin: "Connection refused" due to race condition in shepherd 632 5018695 loadsensor doing output to stderr can block 599 5018726 qalter lacks -dl option! 600 5018733 Empty parameters crashes qstat and qhost 919 5018757 HPCT jobs may fail - add variable to job environment which point to SGE binaries 5018884 SSL vulnerabilities stated in Sun Alert 57524 835 5019497 modification of loadsensor causes crash of execd 820 5019584 SGEEE: scheduler may crash on Linux AMD64 in functional ticket calculation 709 5019595 Dateformat YYMMDDhhmm was interpreted wrong (qacct, qsub, qalter,...) 635 5019601 "vmem" in qstat -j keeps the max value 771 5019624 qselect/qstat -l selection wrongly considers load and utilization 648 5019635 schedd_job_info=true causes large delays with parallel job scheduling 646 5020131 renaming a user deletes the user 607 5020134 qhost output broken for global consumables 5020139 a stored job template in qmon sets -hold_jid to a wrong value 5020141 qsh and qlogin accepted the options -h and -hold_jid and ignored them later 5020143 qdel XXX.YY- will delete the first array task of job XXX 637 5020153 mail bomb upon abort with tightly integrated par jobs 314 5020278 a colon in a job name breaks qacct 5020282 some Linux distributions are not recognized by SGE 931 5020371 sge_shepherd creates world writable files 5021405 CSP reconnect problem of scheduler and execd 5022074 qmon displays wrong numerical values on AMD Linux in spinbox widgets Bugs fixed in SGE 5.3p5 since release 5.3p4 ------------------------------------------- Issue Sun BugId Description ------- ---------- --------------------------------------------------------------------------------- 628 4923403 When installing using -csp the sge_ssl* files are not up to date 627 4958080 suspend and resume with HPC Cluster tools 5 not properly working 626 4957760 Fix needed for CERT CA-2003-26 Multiple Vulnerabilities in SSL/TLS 619 4952767 qrsh -notify doesn't work 617 4952236 Broken mail option with SGE 5.3p4 qrsh 597 infotext ignores parameters for HP10, HP11 systems 592 4930789 An overwritten string attribut was ignored in the scheduler 589 4930786 global load values are ignored 4949917 qmon seg faults with a user hold job from qtcsh qtask file 4930793 minor issues with the sgeee ticket update interval Fixed problem reading in qmaster_param halflife_decay_list Bugs fixed in SGE 5.3p4 since release 5.3p3 ------------------------------------------- Issue Sun BugId Description ------- ---------- --------------------------------------------------------------------------------- 415 4775325 qrong qstat -j diagnosis message wrongly indicates not enough PE slots 469,489 4815795 qstat -alarm was broken 485 4813965 Reject tightly integrated array jobs (not supported in SGE 5.3) 492 4819479 qhost -q -l arch=xx crashes if exec host is down 499 4822742 deleted sharetree showed up again after qmaster restart 501 4822746 spooling integrity improvement was missing for sharetree, projects and users 502 4824104 SGEEE: allow qalter -p for running jobs 503 overriding queue limits with job request limits incomplete for NEC 504 4838650 Multiple job script writes for array jobs caused ETXTBSY error 506 4847819 sge->sgeee upgrade failed 508 4833346 qsh/qrsh/qlogin might core with segementation violation 509 4841414 Unable to delete task array job with negative increment 510 4835832 NOTIFY_SUSP signal only sent for first suspension of job 511 4838595 maxujobs does not count suspended or held jobs 513 qrsh (rshd) does not set limits after setting the osjobid (affects e.g. SUPER-UX/SX) 514 4838549 maxujobs scheduler config functionality is broken 515 4838636 sharetree can't be modified 521 4869772 resource limits not correctly set for Linux x86 522 4844838 sge_shepherd does not exit on SIGTERM (e.g. on "rcsge stop") 523 4842844 commd performance, delay for first job start on machines with many job slots 524 4842878 qdel -u does not delete all jobs of the user 526 4845505 cannot qalter/qhold/qrls several tasks of same job 531 4847802 scheduler crashes on Solaris(x86) due to compiler bug 532 4847814 SGEEE: scheduler dispatches wrong jobs when jobs were in error state 540 enabled output file names to have a ":" inside for qsub 541 qlogin/qlogind whitespace checking removed to allow ssh/sshd with optional arguments 544 4886017 qstat crashed, when used with the options -r -s z 545 4859658 SGE: scheduler crashed when job priorities where changed with qalter 547 4860391 qmake dumped core when starting recursive make calls 549 4851939 unexpected output for qstat -j and qmon "Why?" 555 4886025 queue name was truncated to 10 chars 557 4866711 SGE_O_* variables for a task within a tightly integrated job set incorrect 560 4869784 qmon "Qmon" resource file contains syntax errors 568 Allow "-" sign as first character of load_formula 569 Impossible to disable qmaster profiling 572 4883714 SGEEE: qmaster crashed on qdel of tightly integrated PE job with usage_scaling activated 575 4881949 Slave tasks of tightly integrated PE jobs not killed when h_rt limit was exceeded 576 4886026 max_u_jobs could count rejected jobs as submitted 577 4885719 disabled infotext error output if SGE_ROOT is not set 578 4885906 NSLOTS and NHOSTS incorrectly set in environment of tightly integrated tasks 579 4885930 failure of master task of a tightly integrated PE job did not delete the job 580 *_RESERVED_USAGE gives wrong values for PE jobs 174 Checkpointed job was migrated on bad exit status 4749151 sge_ca script -usercert needs RANDFILE env var 4822799 cannot install on Solaris 10 4876169 An incomplete resource request (-l =1) caused qmaster to crash 4893432 upgrade to openssl 0.9.7a Bugs fixed in SGE 5.3p3 since release 5.3p2 ------------------------------------------- Issue Sun BugId Description ------- ---------- --------------------------------------------------------------------------------- 126 4780316 race condition if signals are to be delivered in job's startup phase 264 array job parameters 314 4713013 qacct may display incorrect accounting information 380 4818741 startup failure of qrsh job is reported as regular job exit 391 4756556 .cshrc error causes [pro|epi]log,pe-[start|stop] failure 392 4756557 non-resolvable hosts in host_aliases file cause wrong hostname resolving 393 4760981 Empty sge_request file causes submission error 398 4816541 no newline character at end of sge_aliases file may crash qsub 402 4778762 Array jobs which contain only one task (id=1) will be handled as single job 403 4769608 qalter shows wrong priority number when using negative priorities with -p option 416 4776016 execd does res. consuming process tracking even if no job is to be controlled 417 4776821 qtcsh can't be used as normal tcsh 418 4776754 complex values for user defined complexes are rejected with global host 423 4778757 stepsize 0 in array job specification results in qmaster exception 424 4778758 memcpy leak in execd 430 4795475 qstat -f output broken for pe jobs on same queue 432 4787598 schedd_job_info messages shown by qstat -j even if it is set to false 435 4790540 sge_schedd process consumes more memory than needed if schedd_job_info=true 436 4790547 Job notification signals won't be delivered if user redefines suspend_method ... 437 4790592 conflicting policies can cause job being started and immediately suspended 438 4791238 SGE may create duplicate accounting entries for parallel jobs 439 4791908 job logging file exists but is empty in certain configurations 440 4792036 job arguments larger than 10k crash qmaster 441 4787623 failover to shadow master leaves sge_schedd on the original qmaster host 446 4794242 wrong usage reported by qstat -j 449 4802831 cannot set -C to null string as described in man qsub 450 4805423 STRING complex attribute handling with RELOP "!=" is broken 455 4802171 qacct -l selection broken 472 4807677 qrsh crash when command line arguments are longer than 4K 478 4813188 qstat -r shows wrong dependencies 486 scripts/util/arch: backed out hp11-64 and aix51 detection since no port available yet 487 4815774 Uninitialized pointer cause segmentation fault in qsh/qrsh on submit only hosts 490 4816529 qmon crash when pressing Why for a list of selected jobs 493 4818737 SGEEE: huge scheduling times if maxujobs is set 4811230 qconf -Muser and qconf -Auser report no success messages 496 Functional policy pending ticket calculations are not intuitive Bugs fixed in SGE 5.3p2 since release 5.3p1 ------------------------------------------- Issue Sun BugId Description --------------------------------------------------------------------------------------------------- 214 4658716 protocol doing termin. on failure for tightly integr. par. job could be leaner 325 4716824 qlogin and qrsh accept unsupported options 328 4718880 qsub/qalter -l ... might select wrong resource. 4719218 "Job Submission" GUI: blank text in pop out window 331 4719755 wrong port output in qstat error message when qmaster not reachable 4721129 A misoperation in host configuration on qmon leads to qmaster daemon's death 4721134 qmon gets shutdown with the message "Segmentation Fault" in terminal 4721181 Wrong path for qmaster messages file in exe host installation 4722060 CLI: invalid option "-jid jid" for qconf in qconf -help 4723543 Too small panes and cells to display some item names 346 4727457 Proper $SGE_CELL setting fails with non default cell in settings.sh 345 4727515 maxujobs prevents dispatching even if job limit is not reached 4728293 qmon gets shutdown with a word "global" in Cluster Configuration 355 4731288 qmon cluster config dialog does not show gid_range in SGE product mode 356 4731347 can configure fshare/otickets in acls of type DEPT 4731348 access_list(5) man page does not describe file format for SGEEE ACL's 370 4732031 OK without hostname in host_configuration kill sge_qmaster 359 4732545 gethostbyaddr in ~/solaris64 doesn't work. 4733043 qmon dumps core when mouse over an interactive job in Job Control window 4733089 qmon dies after checking 'transfer' in 'queue control' window 4733092 requirement for installation process of exec host with non root user 360 4733859 Userset "defaultdepartment" accepts users in CLI 4735258 CLI: Wrong info for usage 363 4735972 scheduler crashes if all subnodes of a node have 0 shares in sharetree 371 4739596 rlim_fd_max > 1024 can cause 32 bit daemons to crash at startup phase 350 4740335 qmon dies with changes in Edit Tickets on Solaris64. 313 4740350 problems with destin_id_list syntax 333 4740407 SGEEE: Insufficient checking for PTF_{MIN,MAX}_PRIORITY in execd_params 330 4740578 load formula of the scheduler 354 4741230 qmod help output is incomplete 350 4742082 Calculation failure in Functional policy wit hvery large numbers 219 4742133 rcsge startup script has hardwired execd_spool_dir for "stop" procedure 374 4742189 schedd_job_info = true causes immense daemons memory growth in large systems 342 4744523 no error message for interactive job start failure due to wrong DISPLAY settings 325 4745387 qsh, qrsh and qlogin silently ignore options -ac, -dc, -sc and -w 335 4745399 qmake without any information about parallel execution fails 327 4745404 qmake does an incorrect resource request if ARCH is an empty string 379 4747829 accounting record about qrsh termination incomplete 302 4753668 prevent deletion of still referenced objects 308 4753669 qconf gets commd timeout 4753766 Various fixes for 5.3p2 in man pages 4754435 OpenSSL 0.9.6c security vulnerability 389 4755931 possible file access problems on 64-bit file system with 32-bit binaries 383 Fixed restart_command for checkpoint jobs linux clients use port 65535 by error if COMMD_PORT/sge_commd service not set Exiting sgeee jobs can be shown in qmon dialog dialog for pending jobs Documentation update for loose Sun HPC cluster tools integration Bugs fixed in SGE 5.3p1 since release 5.3 ----------------------------------------- Issue Sun BugId Description --------------------------------------------------------------------------------------------------- 182 allow arguments for *_command and *_daemon parameters in cluster config 215 4673738 disallow "none" load formula and do additional tests for the correctness 223 4700286 complex default value not considered for load/suspend thresholds 226 4665780 qmaster error message during startup: global configuration doesn't exist - creat 229 4668148 solaris psets won't be used by SGE 231 4697491 signal notification can prevent delivery of actual suspend/termination signal 232 4708239 Allow specification of arguments to [rsh|rlogin|qlogin]_[daemon|command] 236 4670664 parallel jobs (e.g. qmake) fail 237 4670669 error message "can't set additional group id for job" for interactive jobs 239 4675410 queue suspend threshold alarm nsuspend>1 does not susp. multiple jobs 240 4675416 use of suspend thresholds can break user sort policy and cause logging 243 4676340 memory leak in sge_schedd 247 4696768 SGE(EE) allows to submit binary job scripts 250 4682966 qsh(1) ignores -S in sge_request(5) files 251 4683852 qalter on running jobs can confuse consumable mgmt 259 4686157 qhost -j is broken 271 4692957 non-privileged users can submit jobs with priority higher than 0 272 4708235 SGE should allow to start qrsh jobs when /etc/nologin exits 275 4706929 qmon does not display job predecessors in job control 284 4699665 qstat resource based job selection broken 287 4701640 problems launching jobs from qtcsh with "&" 299 4677087 execd could crash when executing tightly integrated parallel jobs 311 4712023 global load values can prevent dispatching of jobs 314 4713013 qacct may display incorrect accounting information