climateprediction.net home page
Task 16163619

Task 16163619

Name hadcm3n_oehw_1900_40_008473911_1
Workunit 8624750
Created 28 Dec 2013, 21:04:48 UTC
Sent 28 Dec 2013, 21:04:55 UTC
Report deadline 30 Mar 2014, 4:32:06 UTC
Received 1 Jan 2014, 17:16:40 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1244729
Run time 3 days 14 hours 4 min 54 sec
CPU time 3 days 10 hours 8 min 47 sec
Validate state Invalid
Credit 3,421.44
Device peak FLOPS 3.34 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>7.2.33</core_client_version>
<![CDATA[
<message>
The device does not recognize the command.
 (0x16) - exit code 22 (0x16)
</message>
<stderr_txt>
08:38:16 (21560): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
08:38:17 (21560): No heartbeat from core client for 30 sec - exiting
08:38:18 (21560): No heartbeat from core client for 30 sec - exiting
08:38:19 (21560): No heartbeat from core client for 30 sec - exiting
08:38:20 (21560): No heartbeat from core client for 30 sec - exiting
08:38:21 (21560): No heartbeat from core client for 30 sec - exiting
08:38:22 (21560): No heartbeat from core client for 30 sec - exiting
08:38:23 (21560): No heartbeat from core client for 30 sec - exiting
08:38:25 (21560): No heartbeat from core client for 30 sec - exiting
08:38:26 (21560): No heartbeat from core client for 30 sec - exiting
08:38:27 (21560): No heartbeat from core client for 30 sec - exiting
08:38:28 (21560): No heartbeat from core client for 30 sec - exiting
08:38:29 (21560): No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
20:14:23 (22128): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
20:14:24 (22128): No heartbeat from core client for 30 sec - exiting
20:14:25 (22128): No heartbeat from core client for 30 sec - exiting
20:14:26 (22128): No heartbeat from core client for 30 sec - exiting
20:14:27 (22128): No heartbeat from core client for 30 sec - exiting
20:14:29 (22128): No heartbeat from core client for 30 sec - exiting
20:14:30 (22128): No heartbeat from core client for 30 sec - exiting
20:14:31 (22128): No heartbeat from core client for 30 sec - exiting
20:14:32 (22128): No heartbeat from core client for 30 sec - exiting
20:14:33 (22128): No heartbeat from core client for 30 sec - exiting
20:14:34 (22128): No heartbeat from core client for 30 sec - exiting
03:04:25 (9452): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
03:04:26 (9452): No heartbeat from core client for 30 sec - exiting
03:04:27 (9452): No heartbeat from core client for 30 sec - exiting
03:04:29 (9452): No heartbeat from core client for 30 sec - exiting
03:04:30 (9452): No heartbeat from core client for 30 sec - exiting
03:04:31 (9452): No heartbeat from core client for 30 sec - exiting
03:04:32 (9452): No heartbeat from core client for 30 sec - exiting
03:04:33 (9452): No heartbeat from core client for 30 sec - exiting
03:04:34 (9452): No heartbeat from core client for 30 sec - exiting
03:04:35 (9452): No heartbeat from core client for 30 sec - exiting
03:04:36 (9452): No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
03:01:36 (20332): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
03:01:37 (20332): No heartbeat from core client for 30 sec - exiting
05:50:32 (12808): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
05:50:34 (12808): No heartbeat from core client for 30 sec - exiting
05:50:35 (12808): No heartbeat from core client for 30 sec - exiting
05:50:36 (12808): No heartbeat from core client for 30 sec - exiting
05:50:37 (12808): No heartbeat from core client for 30 sec - exiting
05:50:38 (12808): No heartbeat from core client for 30 sec - exiting
05:50:40 (12808): No heartbeat from core client for 30 sec - exiting
05:50:41 (12808): No heartbeat from core client for 30 sec - exiting
05:50:42 (12808): No heartbeat from core client for 30 sec - exiting
05:50:43 (12808): No heartbeat from core client for 30 sec - exiting
05:50:44 (12808): No heartbeat from core client for 30 sec - exiting
09:54:17 (19364): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
15:08:54 (24760): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
15:08:55 (24760): No heartbeat from core client for 30 sec - exiting
15:08:56 (24760): No heartbeat from core client for 30 sec - exiting
15:08:57 (24760): No heartbeat from core client for 30 sec - exiting
15:08:58 (24760): No heartbeat from core client for 30 sec - exiting
15:08:59 (24760): No heartbeat from core client for 30 sec - exiting
18:01:17 (22288): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
03:12:25 (23736): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
03:12:26 (23736): No heartbeat from core client for 30 sec - exiting
03:12:27 (23736): No heartbeat from core client for 30 sec - exiting
03:12:28 (23736): No heartbeat from core client for 30 sec - exiting
03:12:29 (23736): No heartbeat from core client for 30 sec - exiting
03:12:30 (23736): No heartbeat from core client for 30 sec - exiting
03:12:31 (23736): No heartbeat from core client for 30 sec - exiting
03:12:32 (23736): No heartbeat from core client for 30 sec - exiting
03:12:33 (23736): No heartbeat from core client for 30 sec - exiting
03:12:34 (23736): No heartbeat from core client for 30 sec - exiting
03:12:35 (23736): No heartbeat from core client for 30 sec - exiting
03:30:52 (24484): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
03:30:59 (24484): No heartbeat from core client for 30 sec - exiting
03:31:00 (24484): No heartbeat from core client for 30 sec - exiting
03:31:01 (24484): No heartbeat from core client for 30 sec - exiting
03:31:02 (24484): No heartbeat from core client for 30 sec - exiting
08:35:22 (19604): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
08:35:24 (19604): No heartbeat from core client for 30 sec - exiting
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=22104, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=22104, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=22104, iMonCtr=1
Model crash detected, will try to restart...
09:00:54 (2988): No heartbeat from core client for 30 sec - exiting
09:01:19 (2988): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=8832, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=8832, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=8832, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
01 Jan 2014 09:09:24 1244729 16163619 hadcm3n_oehw_1900_40_008473911_1 285,120 284,759 0.9987
01 Jan 2014 00:02:14 1244729 16163619 hadcm3n_oehw_1900_40_008473911_1 259,200 255,333 0.9851
31 Dec 2013 14:51:33 1244729 16163619 hadcm3n_oehw_1900_40_008473911_1 233,280 226,410 0.9706
31 Dec 2013 05:44:13 1244729 16163619 hadcm3n_oehw_1900_40_008473911_1 207,360 197,877 0.9543
30 Dec 2013 22:42:38 1244729 16163619 hadcm3n_oehw_1900_40_008473911_1 181,440 171,526 0.9454
30 Dec 2013 15:33:57 1244729 16163619 hadcm3n_oehw_1900_40_008473911_1 155,520 145,151 0.9333
30 Dec 2013 07:31:39 1244729 16163619 hadcm3n_oehw_1900_40_008473911_1 129,600 119,190 0.9197
30 Dec 2013 00:24:50 1244729 16163619 hadcm3n_oehw_1900_40_008473911_1 103,680 93,279 0.8997
29 Dec 2013 17:22:43 1244729 16163619 hadcm3n_oehw_1900_40_008473911_1 77,760 67,699 0.8706
29 Dec 2013 10:17:33 1244729 16163619 hadcm3n_oehw_1900_40_008473911_1 51,840 42,983 0.8291
29 Dec 2013 03:11:20 1244729 16163619 hadcm3n_oehw_1900_40_008473911_1 25,920 21,130 0.8152


©2024 climateprediction.net