climateprediction.net home page
Task 15589491

Task 15589491

Name hadcm3n_4k8a_1940_40_008303818_0
Workunit 8454953
Created 7 Feb 2013, 0:15:10 UTC
Sent 7 Feb 2013, 0:15:19 UTC
Report deadline 9 May 2013, 7:42:30 UTC
Received 20 Feb 2013, 9:22:28 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1105487
Run time 4 days 15 hours 35 min
CPU time 4 days 10 hours 43 min 36 sec
Validate state Invalid
Credit 2,488.32
Device peak FLOPS 3.05 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>7.0.28</core_client_version>
<![CDATA[
<message>
The device does not recognize the command. (0x16) - exit code 22 (0x16)
</message>
<stderr_txt>
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
06:57:30 (3936): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
20:32:03 (5556): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
20:32:04 (5556): No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
17:27:14 (3900): No heartbeat from core client for 30 sec - exiting
17:27:15 (3900): No heartbeat from core client for 30 sec - exiting
17:27:16 (3900): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
04:07:16 (4744): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
04:07:17 (4744): No heartbeat from core client for 30 sec - exiting
04:07:18 (4744): No heartbeat from core client for 30 sec - exiting
04:07:19 (4744): No heartbeat from core client for 30 sec - exiting
04:07:20 (4744): No heartbeat from core client for 30 sec - exiting
04:07:21 (4744): No heartbeat from core client for 30 sec - exiting
04:07:22 (4744): No heartbeat from core client for 30 sec - exiting
04:07:23 (4744): No heartbeat from core client for 30 sec - exiting
04:07:24 (4744): No heartbeat from core client for 30 sec - exiting
04:07:25 (4744): No heartbeat from core client for 30 sec - exiting
04:07:26 (4744): No heartbeat from core client for 30 sec - exiting
07:46:28 (3704): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5800, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5800, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5800, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5800, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5800, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5800, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
20 Feb 2013 01:46:41 1105487 15589491 hadcm3n_4k8a_1940_40_008303818_0 207,360 362,674 1.7490
19 Feb 2013 08:38:14 1105487 15589491 hadcm3n_4k8a_1940_40_008303818_0 181,440 318,664 1.7563
18 Feb 2013 19:42:14 1105487 15589491 hadcm3n_4k8a_1940_40_008303818_0 155,520 272,564 1.7526
18 Feb 2013 04:42:25 1105487 15589491 hadcm3n_4k8a_1940_40_008303818_0 129,600 227,067 1.7521
17 Feb 2013 15:10:33 1105487 15589491 hadcm3n_4k8a_1940_40_008303818_0 103,680 180,634 1.7422
17 Feb 2013 02:20:43 1105487 15589491 hadcm3n_4k8a_1940_40_008303818_0 77,760 135,462 1.7421
16 Feb 2013 07:33:41 1105487 15589491 hadcm3n_4k8a_1940_40_008303818_0 51,840 90,353 1.7429
15 Feb 2013 16:00:46 1105487 15589491 hadcm3n_4k8a_1940_40_008303818_0 25,920 45,469 1.7542


©2024 climateprediction.net