climateprediction.net home page
Task 13317844

Task 13317844

Name hadcm3n_o2c8_1940_40_007432721_0
Workunit 7630224
Created 31 Aug 2011, 20:09:30 UTC
Sent 31 Aug 2011, 20:14:54 UTC
Report deadline 1 Dec 2011, 3:42:05 UTC
Received 14 Sep 2011, 11:37:00 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1077037
Run time 12 days 12 hours 13 min 6 sec
CPU time 12 days 4 hours 13 min 9 sec
Validate state Invalid
Credit 4,976.64
Device peak FLOPS 3.21 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>6.10.18</core_client_version>
<![CDATA[
<message>
The device does not recognize the command. (0x16) - exit code 22 (0x16)
</message>
<stderr_txt>
08:26:26 (4764): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
08:49:00 (1172): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
08:49:01 (1172): No heartbeat from core client for 30 sec - exiting
08:51:06 (7252): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
08:53:30 (6944): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
08:55:16 (3652): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
08:56:58 (2680): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
08:58:45 (7608): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
09:00:27 (6356): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
13:11:38 (4772): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
14:12:56 (11832): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
04:43:17 (2044): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
04:59:47 (8880): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
05:02:28 (3796): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
05:04:55 (7000): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
05:07:14 (5152): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
05:09:26 (4052): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
22:14:53 (5044): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
17:16:22 (1564): No heartbeat from core client for 30 sec - exiting
17:16:23 (1564): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
20:11:30 (1700): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
00:09:50 (7292): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
00:17:13 (5992): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7856, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7856, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7856, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7856, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7856, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7856, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
14 Sep 2011 00:33:43 1077037 13317844 hadcm3n_o2c8_1940_40_007432721_0 414,720 1,025,066 2.4717
13 Sep 2011 11:41:14 1077037 13317844 hadcm3n_o2c8_1940_40_007432721_0 388,800 963,586 2.4784
12 Sep 2011 12:00:56 1077037 13317844 hadcm3n_o2c8_1940_40_007432721_0 362,880 894,364 2.4646
11 Sep 2011 02:26:47 1077037 13317844 hadcm3n_o2c8_1940_40_007432721_0 336,960 823,685 2.4445
10 Sep 2011 14:29:37 1077037 13317844 hadcm3n_o2c8_1940_40_007432721_0 311,040 753,794 2.4235
09 Sep 2011 13:26:03 1077037 13317844 hadcm3n_o2c8_1940_40_007432721_0 285,120 691,475 2.4252
08 Sep 2011 15:11:31 1077037 13317844 hadcm3n_o2c8_1940_40_007432721_0 259,200 635,970 2.4536
08 Sep 2011 00:41:22 1077037 13317844 hadcm3n_o2c8_1940_40_007432721_0 233,280 583,852 2.5028
07 Sep 2011 11:35:52 1077037 13317844 hadcm3n_o2c8_1940_40_007432721_0 207,360 524,842 2.5311
06 Sep 2011 13:18:08 1077037 13317844 hadcm3n_o2c8_1940_40_007432721_0 181,440 462,499 2.5490
05 Sep 2011 17:38:26 1077037 13317844 hadcm3n_o2c8_1940_40_007432721_0 155,520 391,651 2.5183
04 Sep 2011 16:34:29 1077037 13317844 hadcm3n_o2c8_1940_40_007432721_0 129,600 320,982 2.4767
03 Sep 2011 22:03:14 1077037 13317844 hadcm3n_o2c8_1940_40_007432721_0 103,680 253,908 2.4490
03 Sep 2011 02:21:39 1077037 13317844 hadcm3n_o2c8_1940_40_007432721_0 77,760 185,254 2.3824
02 Sep 2011 07:02:00 1077037 13317844 hadcm3n_o2c8_1940_40_007432721_0 51,840 115,281 2.2238
01 Sep 2011 11:48:12 1077037 13317844 hadcm3n_o2c8_1940_40_007432721_0 25,920 46,788 1.8051


©2024 climateprediction.net