climateprediction.net home page
Task 17565723

Task 17565723

Name hadcm3n_xait_1940_40_009149931_1
Workunit 9280267
Created 10 Dec 2014, 9:21:31 UTC
Sent 10 Dec 2014, 9:28:26 UTC
Report deadline 11 Mar 2015, 16:55:37 UTC
Received 22 Dec 2014, 17:56:50 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1244719
Run time 6 days 21 hours 10 min 23 sec
CPU time 6 days 10 hours 13 min 41 sec
Validate state Invalid
Credit 5,287.68
Device peak FLOPS 3.30 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>7.4.27</core_client_version>
<![CDATA[
<message>
The device does not recognize the command.
 (0x16) - exit code 22 (0x16)
</message>
<stderr_txt>
09:03:57 (6948): No heartbeat from core client for 30 sec - exiting
09:03:58 (6948): No heartbeat from core client for 30 sec - exiting
09:03:59 (6948): No heartbeat from core client for 30 sec - exiting
09:04:00 (6948): No heartbeat from core client for 30 sec - exiting
09:04:01 (6948): No heartbeat from core client for 30 sec - exiting
09:04:02 (6948): No heartbeat from core client for 30 sec - exiting
09:04:03 (6948): No heartbeat from core client for 30 sec - exiting
09:04:04 (6948): No heartbeat from core client for 30 sec - exiting
09:04:06 (6948): No heartbeat from core client for 30 sec - exiting
09:04:07 (6948): No heartbeat from core client for 30 sec - exiting
09:04:08 (6948): No heartbeat from core client for 30 sec - exiting
09:04:09 (6948): No heartbeat from core client for 30 sec - exiting
09:04:10 (6948): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
15:01:32 (6628): No heartbeat from core client for 30 sec - exiting
15:01:33 (6628): No heartbeat from core client for 30 sec - exiting
15:01:34 (6628): No heartbeat from core client for 30 sec - exiting
15:01:36 (6628): No heartbeat from core client for 30 sec - exiting
15:01:37 (6628): No heartbeat from core client for 30 sec - exiting
15:01:38 (6628): No heartbeat from core client for 30 sec - exiting
15:01:39 (6628): No heartbeat from core client for 30 sec - exiting
15:01:40 (6628): No heartbeat from core client for 30 sec - exiting
15:01:41 (6628): No heartbeat from core client for 30 sec - exiting
15:01:42 (6628): No heartbeat from core client for 30 sec - exiting
15:01:43 (6628): No heartbeat from core client for 30 sec - exiting
15:01:44 (6628): No heartbeat from core client for 30 sec - exiting
15:01:45 (6628): No heartbeat from core client for 30 sec - exiting
15:01:46 (6628): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
15:01:48 (6628): No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
21:13:19 (6320): No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
17:47:50 (9048): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
16:46:31 (5552): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
20:00:25 (1372): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
20:01:33 (6712): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
20:03:03 (6376): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6672, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6672, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6672, iMonCtr=1
Model crash detected, will try to restart...
09:52:16 (6672): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4568, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4568, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4568, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
22 Dec 2014 13:22:41 1244719 17565723 hadcm3n_xait_1940_40_009149931_1 440,640 540,715 1.2271
21 Dec 2014 19:28:10 1244719 17565723 hadcm3n_xait_1940_40_009149931_1 414,720 510,414 1.2307
21 Dec 2014 10:26:37 1244719 17565723 hadcm3n_xait_1940_40_009149931_1 388,800 478,558 1.2309
21 Dec 2014 00:00:59 1244719 17565723 hadcm3n_xait_1940_40_009149931_1 362,880 446,531 1.2305
20 Dec 2014 14:34:30 1244719 17565723 hadcm3n_xait_1940_40_009149931_1 336,960 414,595 1.2304
20 Dec 2014 05:22:58 1244719 17565723 hadcm3n_xait_1940_40_009149931_1 311,040 383,055 1.2315
19 Dec 2014 19:47:46 1244719 17565723 hadcm3n_xait_1940_40_009149931_1 285,120 351,288 1.2321
19 Dec 2014 10:10:58 1244719 17565723 hadcm3n_xait_1940_40_009149931_1 259,200 319,043 1.2309
19 Dec 2014 00:37:18 1244719 17565723 hadcm3n_xait_1940_40_009149931_1 233,280 286,874 1.2297
18 Dec 2014 14:40:42 1244719 17565723 hadcm3n_xait_1940_40_009149931_1 207,360 254,243 1.2261
17 Dec 2014 07:31:43 1244719 17565723 hadcm3n_xait_1940_40_009149931_1 181,440 224,014 1.2346
16 Dec 2014 22:23:09 1244719 17565723 hadcm3n_xait_1940_40_009149931_1 155,520 194,062 1.2478
16 Dec 2014 13:26:07 1244719 17565723 hadcm3n_xait_1940_40_009149931_1 129,600 163,866 1.2644
16 Dec 2014 05:18:01 1244719 17565723 hadcm3n_xait_1940_40_009149931_1 103,680 133,411 1.2868
15 Dec 2014 19:36:13 1244719 17565723 hadcm3n_xait_1940_40_009149931_1 77,760 102,997 1.3245
13 Dec 2014 10:28:20 1244719 17565723 hadcm3n_xait_1940_40_009149931_1 51,840 70,921 1.3681
10 Dec 2014 22:31:52 1244719 17565723 hadcm3n_xait_1940_40_009149931_1 25,920 36,427 1.4054


©2024 climateprediction.net