climateprediction.net home page
Task 11014242

Task 11014242

Name hadsm3dhet2_jow1_006595251_4
Workunit 6798624
Created 15 Mar 2010, 12:01:03 UTC
Sent 4 Oct 2010, 15:38:12 UTC
Report deadline 16 Sep 2011, 20:58:12 UTC
Received 27 Oct 2010, 12:14:24 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 938547
Run time
CPU time 2 days 21 hours 15 min 22 sec
Validate state Invalid
Credit 1,389.41
Device peak FLOPS 1.90 GFLOPS
Application version UK Met Office HadSM3 Slab Model v6.07
windows_intelx86
Stderr
<core_client_version>6.4.5</core_client_version>
<![CDATA[
<message>
The device does not recognize the command. (0x16) - exit code 22 (0x16)
</message>
<stderr_txt>
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=796, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3508, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5632, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5560, iMonCtr=1
Model crash detected, will try to restart...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5200, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2752, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2752, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2752, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2752, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2752, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2752, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
27 Oct 2010 12:16:59 938547 11014242 hadsm3dhet2_jow1_006595251_4 151,228 249,390 1.6491
17 Oct 2010 21:05:23 938547 11014242 hadsm3dhet2_jow1_006595251_4 140,426 231,007 1.6450
17 Oct 2010 14:22:29 938547 11014242 hadsm3dhet2_jow1_006595251_4 129,624 212,807 1.6417
17 Oct 2010 07:39:07 938547 11014242 hadsm3dhet2_jow1_006595251_4 118,822 194,673 1.6384
17 Oct 2010 02:53:48 938547 11014242 hadsm3dhet2_jow1_006595251_4 108,020 176,387 1.6329
16 Oct 2010 17:20:03 938547 11014242 hadsm3dhet2_jow1_006595251_4 97,218 158,279 1.6281
16 Oct 2010 10:28:51 938547 11014242 hadsm3dhet2_jow1_006595251_4 86,416 140,235 1.6228
15 Oct 2010 23:24:35 938547 11014242 hadsm3dhet2_jow1_006595251_4 75,614 122,306 1.6175
15 Oct 2010 17:12:34 938547 11014242 hadsm3dhet2_jow1_006595251_4 64,812 104,288 1.6091
15 Oct 2010 11:10:43 938547 11014242 hadsm3dhet2_jow1_006595251_4 54,010 86,286 1.5976
12 Oct 2010 23:57:30 938547 11014242 hadsm3dhet2_jow1_006595251_4 43,208 68,852 1.5935
12 Oct 2010 13:24:52 938547 11014242 hadsm3dhet2_jow1_006595251_4 32,406 52,074 1.6069
06 Oct 2010 03:39:11 938547 11014242 hadsm3dhet2_jow1_006595251_4 21,604 34,870 1.6141
05 Oct 2010 04:10:51 938547 11014242 hadsm3dhet2_jow1_006595251_4 10,802 17,886 1.6558


©2024 climateprediction.net