climateprediction.net home page
Task 13288145

Task 13288145

Name hadcm3n_p2x6_1940_40_007420198_0
Workunit 7617833
Created 24 Aug 2011, 21:40:31 UTC
Sent 24 Aug 2011, 21:42:01 UTC
Report deadline 24 Nov 2011, 5:09:12 UTC
Received 3 Sep 2011, 10:32:39 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1074891
Run time 9 days 10 hours 14 min 17 sec
CPU time 8 days 23 hours 58 min
Validate state Invalid
Credit 4,665.60
Device peak FLOPS 3.15 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>6.10.58</core_client_version>
<![CDATA[
<message>
The device does not recognize the command. (0x16) - exit code 22 (0x16)
</message>
<stderr_txt>
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
10:30:39 (5724): No heartbeat from core client for 30 sec - exiting
10:30:40 (5724): No heartbeat from core client for 30 sec - exiting
10:30:41 (5724): No heartbeat from core client for 30 sec - exiting
10:30:42 (5724): No heartbeat from core client for 30 sec - exiting
10:30:43 (5724): No heartbeat from core client for 30 sec - exiting
10:30:44 (5724): No heartbeat from core client for 30 sec - exiting
10:30:45 (5724): No heartbeat from core client for 30 sec - exiting
10:30:46 (5724): No heartbeat from core client for 30 sec - exiting
10:30:47 (5724): No heartbeat from core client for 30 sec - exiting
10:30:48 (5724): No heartbeat from core client for 30 sec - exiting
10:30:49 (5724): No heartbeat from core client for 30 sec - exiting
10:30:50 (5724): No heartbeat from core client for 30 sec - exiting
10:30:51 (5724): No heartbeat from core client for 30 sec - exiting
10:30:52 (5724): No heartbeat from core client for 30 sec - exiting
10:30:53 (5724): No heartbeat from core client for 30 sec - exiting
10:30:54 (5724): No heartbeat from core client for 30 sec - exiting
10:30:55 (5724): No heartbeat from core client for 30 sec - exiting
10:30:56 (5724): No heartbeat from core client for 30 sec - exiting
10:30:57 (5724): No heartbeat from core client for 30 sec - exiting
10:30:58 (5724): No heartbeat from core client for 30 sec - exiting
10:30:59 (5724): No heartbeat from core client for 30 sec - exiting
10:31:00 (5724): No heartbeat from core client for 30 sec - exiting
10:31:01 (5724): No heartbeat from core client for 30 sec - exiting
10:31:02 (5724): No heartbeat from core client for 30 sec - exiting
10:31:03 (5724): No heartbeat from core client for 30 sec - exiting
10:31:04 (5724): No heartbeat from core client for 30 sec - exiting
10:31:05 (5724): No heartbeat from core client for 30 sec - exiting
10:31:06 (5724): No heartbeat from core client for 30 sec - exiting
10:31:07 (5724): No heartbeat from core client for 30 sec - exiting
10:31:08 (5724): No heartbeat from core client for 30 sec - exiting
10:31:09 (5724): No heartbeat from core client for 30 sec - exiting
10:31:10 (5724): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3220, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3220, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3220, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3220, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3220, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3220, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
03 Sep 2011 07:48:07 1074891 13288145 hadcm3n_p2x6_1940_40_007420198_0 388,800 777,088 1.9987
02 Sep 2011 17:17:45 1074891 13288145 hadcm3n_p2x6_1940_40_007420198_0 362,880 725,786 2.0001
02 Sep 2011 01:33:08 1074891 13288145 hadcm3n_p2x6_1940_40_007420198_0 336,960 673,594 1.9990
01 Sep 2011 10:08:29 1074891 13288145 hadcm3n_p2x6_1940_40_007420198_0 311,040 621,454 1.9980
31 Aug 2011 18:58:18 1074891 13288145 hadcm3n_p2x6_1940_40_007420198_0 285,120 569,227 1.9964
31 Aug 2011 02:47:55 1074891 13288145 hadcm3n_p2x6_1940_40_007420198_0 259,200 516,826 1.9939
30 Aug 2011 11:41:16 1074891 13288145 hadcm3n_p2x6_1940_40_007420198_0 233,280 464,405 1.9908
29 Aug 2011 21:16:46 1074891 13288145 hadcm3n_p2x6_1940_40_007420198_0 207,360 412,731 1.9904
29 Aug 2011 10:55:06 1074891 13288145 hadcm3n_p2x6_1940_40_007420198_0 181,440 361,465 1.9922
28 Aug 2011 15:06:58 1074891 13288145 hadcm3n_p2x6_1940_40_007420198_0 155,520 309,576 1.9906
28 Aug 2011 00:30:24 1074891 13288145 hadcm3n_p2x6_1940_40_007420198_0 129,600 257,319 1.9855
27 Aug 2011 08:43:02 1074891 13288145 hadcm3n_p2x6_1940_40_007420198_0 103,680 205,108 1.9783
26 Aug 2011 17:54:52 1074891 13288145 hadcm3n_p2x6_1940_40_007420198_0 77,760 153,275 1.9711
26 Aug 2011 04:01:37 1074891 13288145 hadcm3n_p2x6_1940_40_007420198_0 51,840 101,524 1.9584
25 Aug 2011 12:12:51 1074891 13288145 hadcm3n_p2x6_1940_40_007420198_0 25,920 50,395 1.9443


©2024 climateprediction.net