climateprediction.net home page
Task 14649417

Task 14649417

Name hadcm3n_203y_1940_40_007957625_1
Workunit 8112737
Created 9 May 2012, 14:29:42 UTC
Sent 10 May 2012, 21:46:33 UTC
Report deadline 10 Aug 2012, 5:13:44 UTC
Received 30 May 2012, 21:17:48 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1056144
Run time 5 days 10 hours 17 min 12 sec
CPU time 4 days 16 hours 34 min 39 sec
Validate state Invalid
Credit 2,799.36
Device peak FLOPS 3.01 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>7.0.25</core_client_version>
<![CDATA[
<message>
The device does not recognize the command. (0x16) - exit code 22 (0x16)
</message>
<stderr_txt>
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2604, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
17:16:33 (3748): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
17:16:35 (3748): No heartbeat from core client for 30 sec - exiting
17:16:36 (3748): No heartbeat from core client for 30 sec - exiting
17:16:37 (3748): No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
08:38:25 (5056): No heartbeat from core client for 30 sec - exiting
08:38:26 (5056): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
08:38:27 (5056): No heartbeat from core client for 30 sec - exiting
08:38:29 (5056): No heartbeat from core client for 30 sec - exiting
08:38:30 (5056): No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
20:03:19 (4280): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5520, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5520, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5520, iMonCtr=1
Model crash detected, will try to restart...
15:35:31 (4256): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4528, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4528, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4528, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
30 May 2012 01:08:00 1056144 14649417 hadcm3n_203y_1940_40_007957625_1 233,280 389,936 1.6715
28 May 2012 20:02:47 1056144 14649417 hadcm3n_203y_1940_40_007957625_1 207,360 346,799 1.6724
27 May 2012 19:12:16 1056144 14649417 hadcm3n_203y_1940_40_007957625_1 181,440 304,114 1.6761
26 May 2012 15:09:52 1056144 14649417 hadcm3n_203y_1940_40_007957625_1 155,520 259,173 1.6665
24 May 2012 22:49:46 1056144 14649417 hadcm3n_203y_1940_40_007957625_1 129,600 213,782 1.6496
21 May 2012 23:40:58 1056144 14649417 hadcm3n_203y_1940_40_007957625_1 103,680 170,022 1.6399
20 May 2012 19:12:57 1056144 14649417 hadcm3n_203y_1940_40_007957625_1 77,760 127,876 1.6445
16 May 2012 23:11:46 1056144 14649417 hadcm3n_203y_1940_40_007957625_1 51,840 85,149 1.6425
14 May 2012 21:08:40 1056144 14649417 hadcm3n_203y_1940_40_007957625_1 25,920 42,500 1.6397


©2024 climateprediction.net