climateprediction.net home page
Task 13023062

Task 13023062

Name hadcm3n_t3c3_1940_40_007315100_0
Workunit 7512530
Created 28 Jun 2011, 19:37:21 UTC
Sent 28 Jun 2011, 19:49:19 UTC
Report deadline 28 Sep 2011, 3:16:30 UTC
Received 24 Aug 2011, 22:43:16 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1126569
Run time 11 days 1 hours 48 min 13 sec
CPU time 11 days 0 hours 1 min 54 sec
Validate state Invalid
Credit 7,776.00
Device peak FLOPS 2.56 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>6.10.58</core_client_version>
<![CDATA[
<message>
The device does not recognize the command. (0x16) - exit code 22 (0x16)
</message>
<stderr_txt>
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1972, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
07:42:07 (2936): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
08:46:42 (2944): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
08:46:43 (2944): No heartbeat from core client for 30 sec - exiting
11:48:18 (2680): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
11:48:20 (2680): No heartbeat from core client for 30 sec - exiting
12:05:58 (3436): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
12:06:00 (3436): No heartbeat from core client for 30 sec - exiting
16:10:28 (3004): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
04:51:46 (444): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
11:17:55 (2740): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
11:17:56 (2740): No heartbeat from core client for 30 sec - exiting
11:38:45 (2976): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
11:38:47 (2976): No heartbeat from core client for 30 sec - exiting
00:14:15 (3000): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
14:04:09 (824): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
14:04:10 (824): No heartbeat from core client for 30 sec - exiting
16:28:57 (2952): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
16:28:59 (2952): No heartbeat from core client for 30 sec - exiting
16:31:29 (3508): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
19:58:08 (3884): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
20:51:15 (3788): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
21:52:59 (3688): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
22:35:07 (3392): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
22:35:09 (3392): No heartbeat from core client for 30 sec - exiting
01:04:17 (3288): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
01:05:37 (3288): No heartbeat from core client for 30 sec - exiting
03:12:19 (2176): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
03:12:20 (2176): No heartbeat from core client for 30 sec - exiting
03:12:21 (2176): No heartbeat from core client for 30 sec - exiting
06:10:56 (3304): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
06:10:58 (3304): No heartbeat from core client for 30 sec - exiting
09:26:27 (1652): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
11:32:29 (3284): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
11:32:31 (3284): No heartbeat from core client for 30 sec - exiting
16:45:24 (3448): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
16:45:25 (3448): No heartbeat from core client for 30 sec - exiting
17:52:39 (2396): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
17:52:42 (2396): No heartbeat from core client for 30 sec - exiting
17:52:43 (2396): No heartbeat from core client for 30 sec - exiting
17:52:45 (2396): No heartbeat from core client for 30 sec - exiting
17:52:46 (2396): No heartbeat from core client for 30 sec - exiting
14:30:55 (748): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
14:30:56 (748): No heartbeat from core client for 30 sec - exiting
20:52:54 (3720): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
20:52:56 (3720): No heartbeat from core client for 30 sec - exiting
20:52:57 (3720): No heartbeat from core client for 30 sec - exiting
20:52:58 (3720): No heartbeat from core client for 30 sec - exiting
10:38:02 (452): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
10:38:04 (452): No heartbeat from core client for 30 sec - exiting
10:38:05 (452): No heartbeat from core client for 30 sec - exiting
11:50:17 (1572): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
11:50:18 (1572): No heartbeat from core client for 30 sec - exiting
16:08:53 (3376): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
16:08:55 (3376): No heartbeat from core client for 30 sec - exiting
16:08:56 (3376): No heartbeat from core client for 30 sec - exiting
16:08:57 (3376): No heartbeat from core client for 30 sec - exiting
14:31:51 (444): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
14:31:52 (444): No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - Suspend request from BOINC...
15:26:10 (3828): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
15:26:12 (3828): No heartbeat from core client for 30 sec - exiting
15:26:13 (3828): No heartbeat from core client for 30 sec - exiting
15:26:14 (3828): No heartbeat from core client for 30 sec - exiting
15:26:15 (3828): No heartbeat from core client for 30 sec - exiting
06:43:04 (3532): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
06:43:06 (3532): No heartbeat from core client for 30 sec - exiting
06:43:07 (3532): No heartbeat from core client for 30 sec - exiting
06:43:08 (3532): No heartbeat from core client for 30 sec - exiting
06:43:09 (3532): No heartbeat from core client for 30 sec - exiting
06:43:10 (3532): No heartbeat from core client for 30 sec - exiting
07:44:48 (3536): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
07:44:49 (3536): No heartbeat from core client for 30 sec - exiting
20:02:41 (1228): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
20:02:43 (1228): No heartbeat from core client for 30 sec - exiting
20:02:44 (1228): No heartbeat from core client for 30 sec - exiting
20:02:45 (1228): No heartbeat from core client for 30 sec - exiting
20:02:46 (1228): No heartbeat from core client for 30 sec - exiting
23:00:49 (2972): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
23:00:50 (2972): No heartbeat from core client for 30 sec - exiting
07:41:15 (624): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
08:48:05 (3024): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
13:40:48 (2620): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
13:40:50 (2620): No heartbeat from core client for 30 sec - exiting
13:40:51 (2620): No heartbeat from core client for 30 sec - exiting
13:40:52 (2620): No heartbeat from core client for 30 sec - exiting
14:17:03 (2472): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
14:17:04 (2472): No heartbeat from core client for 30 sec - exiting
15:01:55 (312): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
17:45:17 (3136): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
17:45:19 (3136): No heartbeat from core client for 30 sec - exiting
20:20:26 (3376): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
20:20:27 (3376): No heartbeat from core client for 30 sec - exiting
20:51:15 (2440): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
20:51:17 (2440): No heartbeat from core client for 30 sec - exiting
20:51:18 (2440): No heartbeat from core client for 30 sec - exiting
20:51:19 (2440): No heartbeat from core client for 30 sec - exiting
21:46:05 (3676): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
22:12:50 (3980): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
22:12:51 (3980): No heartbeat from core client for 30 sec - exiting
22:12:52 (3980): No heartbeat from core client for 30 sec - exiting
12:22:47 (912): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
15:08:30 (3780): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
15:08:31 (3780): No heartbeat from core client for 30 sec - exiting
15:08:32 (3780): No heartbeat from core client for 30 sec - exiting
15:08:33 (3780): No heartbeat from core client for 30 sec - exiting
15:08:34 (3780): No heartbeat from core client for 30 sec - exiting
15:08:35 (3780): No heartbeat from core client for 30 sec - exiting
15:08:36 (3780): No heartbeat from core client for 30 sec - exiting
23:33:11 (4052): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
23:33:12 (4052): No heartbeat from core client for 30 sec - exiting
02:06:22 (1420): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
06:27:04 (4020): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
06:27:06 (4020): No heartbeat from core client for 30 sec - exiting
06:27:07 (4020): No heartbeat from core client for 30 sec - exiting
11:53:57 (1508): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2744, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2744, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2744, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2744, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2744, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2744, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
09 Jul 2011 19:26:14 1126569 13023062 hadcm3n_t3c3_1940_40_007315100_0 648,000 924,069 1.4260
09 Jul 2011 06:19:34 1126569 13023062 hadcm3n_t3c3_1940_40_007315100_0 622,080 886,296 1.4247
08 Jul 2011 23:10:45 1126569 13023062 hadcm3n_t3c3_1940_40_007315100_0 596,160 848,317 1.4230
08 Jul 2011 08:48:53 1126569 13023062 hadcm3n_t3c3_1940_40_007315100_0 570,240 810,431 1.4212
07 Jul 2011 23:15:24 1126569 13023062 hadcm3n_t3c3_1940_40_007315100_0 544,320 772,637 1.4195
07 Jul 2011 23:15:24 1126569 13023062 hadcm3n_t3c3_1940_40_007315100_0 518,400 736,038 1.4198
07 Jul 2011 23:15:24 1126569 13023062 hadcm3n_t3c3_1940_40_007315100_0 492,480 699,448 1.4203
07 Jul 2011 23:15:24 1126569 13023062 hadcm3n_t3c3_1940_40_007315100_0 466,560 662,885 1.4208
07 Jul 2011 23:15:24 1126569 13023062 hadcm3n_t3c3_1940_40_007315100_0 440,640 626,329 1.4214
05 Jul 2011 23:57:45 1126569 13023062 hadcm3n_t3c3_1940_40_007315100_0 414,720 589,720 1.4220
05 Jul 2011 23:57:45 1126569 13023062 hadcm3n_t3c3_1940_40_007315100_0 388,800 553,139 1.4227
04 Jul 2011 23:23:42 1126569 13023062 hadcm3n_t3c3_1940_40_007315100_0 362,880 516,596 1.4236
04 Jul 2011 12:46:35 1126569 13023062 hadcm3n_t3c3_1940_40_007315100_0 336,960 480,031 1.4246
04 Jul 2011 02:53:52 1126569 13023062 hadcm3n_t3c3_1940_40_007315100_0 311,040 443,398 1.4255
03 Jul 2011 17:05:49 1126569 13023062 hadcm3n_t3c3_1940_40_007315100_0 285,120 406,766 1.4266
03 Jul 2011 07:13:44 1126569 13023062 hadcm3n_t3c3_1940_40_007315100_0 259,200 370,171 1.4281
02 Jul 2011 23:04:44 1126569 13023062 hadcm3n_t3c3_1940_40_007315100_0 233,280 333,506 1.4296
02 Jul 2011 23:04:44 1126569 13023062 hadcm3n_t3c3_1940_40_007315100_0 207,360 297,025 1.4324
02 Jul 2011 00:30:24 1126569 13023062 hadcm3n_t3c3_1940_40_007315100_0 181,440 260,548 1.4360
01 Jul 2011 23:29:44 1126569 13023062 hadcm3n_t3c3_1940_40_007315100_0 155,520 224,124 1.4411


©2024 climateprediction.net