climateprediction.net home page
Task 13350579

Task 13350579

Name hadcm3n_t4q5_1940_40_007443636_3
Workunit 7641139
Created 9 Sep 2011, 1:29:55 UTC
Sent 9 Sep 2011, 1:54:51 UTC
Report deadline 9 Dec 2011, 9:22:02 UTC
Received 24 Sep 2011, 7:13:10 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1077037
Run time 14 days 2 hours 35 min 50 sec
CPU time 13 days 19 hours 32 min 6 sec
Validate state Invalid
Credit 5,287.68
Device peak FLOPS 3.12 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>6.10.18</core_client_version>
<![CDATA[
<message>
The device does not recognize the command. (0x16) - exit code 22 (0x16)
</message>
<stderr_txt>
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
04:43:17 (4304): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
04:59:47 (9080): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
05:02:28 (6124): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
05:04:56 (7924): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
05:07:15 (8112): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
05:09:27 (6208): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=232, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5132, iMonCtr=1
Model crash detected, will try to restart...
17:16:22 (2304): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
20:11:30 (3136): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
00:09:50 (6920): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
00:17:13 (6080): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
00:13:29 (5300): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
01:58:58 (5320): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
01:58:59 (5320): No heartbeat from core client for 30 sec - exiting
09:39:24 (5544): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
13:42:30 (3068): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
13:48:25 (9104): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
13:51:17 (8944): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
10:59:37 (11420): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4972, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4972, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4972, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4972, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4972, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4972, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
23 Sep 2011 17:22:45 1077037 13350579 hadcm3n_t4q5_1940_40_007443636_3 440,640 1,149,591 2.6089
22 Sep 2011 21:30:53 1077037 13350579 hadcm3n_t4q5_1940_40_007443636_3 414,720 1,079,001 2.6018
22 Sep 2011 01:33:04 1077037 13350579 hadcm3n_t4q5_1940_40_007443636_3 388,800 1,008,592 2.5941
21 Sep 2011 05:14:20 1077037 13350579 hadcm3n_t4q5_1940_40_007443636_3 362,880 938,367 2.5859
20 Sep 2011 09:23:51 1077037 13350579 hadcm3n_t4q5_1940_40_007443636_3 336,960 869,413 2.5802
19 Sep 2011 13:41:28 1077037 13350579 hadcm3n_t4q5_1940_40_007443636_3 311,040 800,585 2.5739
18 Sep 2011 17:33:17 1077037 13350579 hadcm3n_t4q5_1940_40_007443636_3 285,120 729,462 2.5584
17 Sep 2011 21:17:35 1077037 13350579 hadcm3n_t4q5_1940_40_007443636_3 259,200 658,080 2.5389
17 Sep 2011 01:06:22 1077037 13350579 hadcm3n_t4q5_1940_40_007443636_3 233,280 587,953 2.5204
16 Sep 2011 05:37:37 1077037 13350579 hadcm3n_t4q5_1940_40_007443636_3 207,360 519,139 2.5036
15 Sep 2011 02:34:09 1077037 13350579 hadcm3n_t4q5_1940_40_007443636_3 181,440 454,714 2.5061
14 Sep 2011 11:38:07 1077037 13350579 hadcm3n_t4q5_1940_40_007443636_3 155,520 390,831 2.5131
13 Sep 2011 14:59:33 1077037 13350579 hadcm3n_t4q5_1940_40_007443636_3 129,600 329,043 2.5389
12 Sep 2011 17:22:20 1077037 13350579 hadcm3n_t4q5_1940_40_007443636_3 103,680 267,238 2.5775
11 Sep 2011 14:29:40 1077037 13350579 hadcm3n_t4q5_1940_40_007443636_3 77,760 196,143 2.5224
10 Sep 2011 17:30:18 1077037 13350579 hadcm3n_t4q5_1940_40_007443636_3 51,840 125,978 2.4301
09 Sep 2011 22:09:18 1077037 13350579 hadcm3n_t4q5_1940_40_007443636_3 25,920 59,984 2.3142


©2024 climateprediction.net