climateprediction.net home page
Task 11324940

Task 11324940

Name hadsm3dhet2_kcv0_006626318_6
Workunit 6829691
Created 15 Mar 2010, 12:41:25 UTC
Sent 30 Mar 2010, 13:00:52 UTC
Report deadline 12 Mar 2011, 18:20:52 UTC
Received 13 Apr 2010, 17:55:19 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 938547
Run time
CPU time 9 days 15 hours 5 min 32 sec
Validate state Invalid
Credit 4,366.71
Device peak FLOPS 0.00 GFLOPS
Application version ---
Stderr
<core_client_version>6.4.5</core_client_version>
<![CDATA[
<message>
The device does not recognize the command. (0x16) - exit code 22 (0x16)
</message>
<stderr_txt>
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5632, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5492, iMonCtr=1
Model crash detected, will try to restart...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - Quit request from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2136, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1608, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1608, iMonCtr=1
Model crash detected, will try to restart...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
MainError:	06:25:14 PM	No files match the supplied pattern.
MainError:	06:25:14 PM	No files match the supplied pattern.
MainError:	06:25:14 PM	No files match the supplied pattern.
MainError:	06:25:14 PM	No files match the supplied pattern.
MainError:	06:25:14 PM	No files match the supplied pattern.
MainError:	06:25:14 PM	No files match the supplied pattern.
MainError:	06:25:14 PM	No files match the supplied pattern.
MainError:	06:25:14 PM	No files match the supplied pattern.
MainError:	06:25:14 PM	No files match the supplied pattern.
MainError:	06:25:14 PM	No files match the supplied pattern.
MainError:	06:25:14 PM	No files match the supplied pattern.
MainError:	06:25:14 PM	No files match the supplied pattern.
MainError:	06:25:14 PM	No files match the supplied pattern.
MainError:	06:25:14 PM	No files match the supplied pattern.
MainError:	06:25:14 PM	No files match the supplied pattern.
MainError:	06:25:14 PM	No files match the supplied pattern.
MainError:	06:25:14 PM	No files match the supplied pattern.
MainError:	06:25:14 PM	No files match the supplied pattern.
MainError:	06:25:14 PM	No files match the supplied pattern.
MainError:	06:25:14 PM	No files match the supplied pattern.
MainError:	06:25:14 PM	No files match the supplied pattern.
MainError:	06:25:14 PM	No files match the supplied pattern.
MainError:	06:25:14 PM	No files match the supplied pattern.
MainError:	06:25:14 PM	No files match the supplied pattern.
MainError:	06:25:14 PM	No files match the supplied pattern.
MainError:	06:25:14 PM	No files match the supplied pattern.
MainError:	06:25:14 PM	No files match the supplied pattern.
MainError:	06:25:14 PM	No files match the supplied pattern.
MainError:	06:25:14 PM	No files match the supplied pattern.
MainError:	06:25:14 PM	No files match the supplied pattern.
MainError:	06:25:14 PM	No files match the supplied pattern.
MainError:	06:25:14 PM	No files match the supplied pattern.
MainError:	06:25:14 PM	No files match the supplied pattern.
MainError:	06:25:14 PM	No files match the supplied pattern.
MainError:	06:25:14 PM	No files match the supplied pattern.
MainError:	06:25:14 PM	No files match the supplied pattern.
MainError:	06:25:14 PM	No files match the supplied pattern.
MainError:	06:25:14 PM	No files match the supplied pattern.
MainError:	06:25:14 PM	No files match the supplied pattern.
MainError:	06:25:14 PM	No files match the supplied pattern.
MainError:	06:25:14 PM	No files match the supplied pattern.
MainError:	06:25:14 PM	No files match the supplied pattern.
MainError:	06:25:14 PM	No files match the supplied pattern.
MainError:	06:25:14 PM	No files match the supplied pattern.
MainError:	06:25:14 PM	No files match the supplied pattern.
MainError:	06:25:14 PM	No files match the supplied pattern.
MainError:	06:25:14 PM	No files match the supplied pattern.
MainError:	06:25:14 PM	No files match the supplied pattern.
MainError:	06:25:14 PM	No files match the supplied pattern.
MainError:	06:25:14 PM	No files match the supplied pattern.
MainError:	06:25:14 PM	No files match the supplied pattern.
MainError:	06:25:14 PM	No files match the supplied pattern.
MainError:	06:25:14 PM	No files match the supplied pattern.
MainError:	06:25:14 PM	No files match the supplied pattern.
MainError:	06:25:14 PM	No files match the supplied pattern.
MainError:	06:25:14 PM	No files match the supplied pattern.
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6160, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3924, iMonCtr=1
Model crash detected, will try to restart...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1772, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1772, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1772, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1772, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1772, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1772, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
13 Apr 2010 00:17:53 938547 11324940 hadsm3dhet2_kcv0_006626318_6 216,040 818,934 1.7230
12 Apr 2010 18:04:40 938547 11324940 hadsm3dhet2_kcv0_006626318_6 205,238 800,355 1.7231
12 Apr 2010 12:00:03 938547 11324940 hadsm3dhet2_kcv0_006626318_6 194,436 781,603 1.7228
12 Apr 2010 05:55:33 938547 11324940 hadsm3dhet2_kcv0_006626318_6 183,634 762,991 1.7228
11 Apr 2010 23:52:15 938547 11324940 hadsm3dhet2_kcv0_006626318_6 172,832 744,409 1.7228
11 Apr 2010 17:36:45 938547 11324940 hadsm3dhet2_kcv0_006626318_6 162,030 725,628 1.7224
11 Apr 2010 11:17:58 938547 11324940 hadsm3dhet2_kcv0_006626318_6 151,228 706,939 1.7222
11 Apr 2010 04:59:37 938547 11324940 hadsm3dhet2_kcv0_006626318_6 140,426 688,178 1.7218
10 Apr 2010 22:42:58 938547 11324940 hadsm3dhet2_kcv0_006626318_6 129,624 669,374 1.7213
10 Apr 2010 16:16:22 938547 11324940 hadsm3dhet2_kcv0_006626318_6 118,822 650,613 1.7209
10 Apr 2010 09:59:23 938547 11324940 hadsm3dhet2_kcv0_006626318_6 108,020 631,793 1.7203
10 Apr 2010 03:37:49 938547 11324940 hadsm3dhet2_kcv0_006626318_6 97,218 613,020 1.7197
09 Apr 2010 21:25:27 938547 11324940 hadsm3dhet2_kcv0_006626318_6 86,416 594,356 1.7195
09 Apr 2010 15:20:48 938547 11324940 hadsm3dhet2_kcv0_006626318_6 75,614 575,632 1.7190
09 Apr 2010 09:10:36 938547 11324940 hadsm3dhet2_kcv0_006626318_6 64,812 556,951 1.7187
09 Apr 2010 02:14:35 938547 11324940 hadsm3dhet2_kcv0_006626318_6 54,010 538,414 1.7188
08 Apr 2010 20:02:44 938547 11324940 hadsm3dhet2_kcv0_006626318_6 43,208 519,799 1.7186
08 Apr 2010 13:47:11 938547 11324940 hadsm3dhet2_kcv0_006626318_6 32,406 501,095 1.7181
08 Apr 2010 07:34:58 938547 11324940 hadsm3dhet2_kcv0_006626318_6 21,604 482,243 1.7171
08 Apr 2010 01:10:00 938547 11324940 hadsm3dhet2_kcv0_006626318_6 10,802 463,532 1.7165


©2024 climateprediction.net