climateprediction.net home page
Task 15786074

Task 15786074

Name hadcm3n_3m1n_1980_40_008367034_1
Workunit 8517893
Created 16 May 2013, 1:47:27 UTC
Sent 16 May 2013, 1:47:38 UTC
Report deadline 15 Aug 2013, 9:14:49 UTC
Received 19 Jun 2013, 21:42:03 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1227663
Run time 13 days 22 hours 44 min 58 sec
CPU time 13 days 7 hours 47 min 21 sec
Validate state Invalid
Credit 8,087.04
Device peak FLOPS 2.63 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>7.0.28</core_client_version>
<![CDATA[
<message>
The device does not recognize the command. (0x16) - exit code 22 (0x16)
</message>
<stderr_txt>
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1376, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1376, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1376, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1376, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1376, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1376, iMonCtr=1
Model crash detected, will try to restart...
15:28:04 (6696): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
11:42:17 (1768): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3208, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5092, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6464, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3832, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3832, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3832, iMonCtr=1
Model crash detected, will try to restart...
C12:01:02 (904): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
16:37:21 (4328): No heartbeat from core client for 30 sec - exiting
16:37:22 (4328): No heartbeat from core client for 30 sec - exiting
16:37:23 (4328): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1452, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1452, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1452, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1452, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1452, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1452, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
18 Jun 2013 23:57:08 1227663 15786074 hadcm3n_3m1n_1980_40_008367034_1 673,920 1,153,135 1.7111
14 Jun 2013 21:28:36 1227663 15786074 hadcm3n_3m1n_1980_40_008367034_1 648,000 1,104,778 1.7049
09 Jun 2013 21:42:01 1227663 15786074 hadcm3n_3m1n_1980_40_008367034_1 622,080 1,061,641 1.7066
09 Jun 2013 10:00:07 1227663 15786074 hadcm3n_3m1n_1980_40_008367034_1 596,160 1,019,770 1.7106
08 Jun 2013 22:20:26 1227663 15786074 hadcm3n_3m1n_1980_40_008367034_1 570,240 977,978 1.7150
08 Jun 2013 10:39:26 1227663 15786074 hadcm3n_3m1n_1980_40_008367034_1 544,320 936,264 1.7201
07 Jun 2013 22:28:14 1227663 15786074 hadcm3n_3m1n_1980_40_008367034_1 518,400 892,908 1.7224
07 Jun 2013 09:25:48 1227663 15786074 hadcm3n_3m1n_1980_40_008367034_1 492,480 847,347 1.7206
06 Jun 2013 21:29:48 1227663 15786074 hadcm3n_3m1n_1980_40_008367034_1 466,560 802,487 1.7200
05 Jun 2013 17:04:23 1227663 15786074 hadcm3n_3m1n_1980_40_008367034_1 440,640 755,113 1.7137
05 Jun 2013 02:27:39 1227663 15786074 hadcm3n_3m1n_1980_40_008367034_1 414,720 704,597 1.6990
04 Jun 2013 14:17:18 1227663 15786074 hadcm3n_3m1n_1980_40_008367034_1 388,800 655,560 1.6861
03 Jun 2013 22:43:46 1227663 15786074 hadcm3n_3m1n_1980_40_008367034_1 362,880 605,183 1.6677
29 May 2013 01:25:22 1227663 15786074 hadcm3n_3m1n_1980_40_008367034_1 336,960 555,119 1.6474
28 May 2013 13:30:29 1227663 15786074 hadcm3n_3m1n_1980_40_008367034_1 311,040 512,788 1.6486
28 May 2013 02:11:14 1227663 15786074 hadcm3n_3m1n_1980_40_008367034_1 285,120 470,147 1.6489
24 May 2013 20:21:38 1227663 15786074 hadcm3n_3m1n_1980_40_008367034_1 259,200 428,167 1.6519
24 May 2013 08:37:51 1227663 15786074 hadcm3n_3m1n_1980_40_008367034_1 233,280 386,166 1.6554
23 May 2013 21:02:56 1227663 15786074 hadcm3n_3m1n_1980_40_008367034_1 207,360 344,599 1.6618
23 May 2013 12:44:28 1227663 15786074 hadcm3n_3m1n_1980_40_008367034_1 181,440 304,395 1.6777


©2024 climateprediction.net