climateprediction.net home page
Task 15733344

Task 15733344

Name hadcm3n_u065_2020_40_008336645_1
Workunit 8487506
Created 18 Apr 2013, 10:59:59 UTC
Sent 18 Apr 2013, 11:00:11 UTC
Report deadline 18 Jul 2013, 18:27:22 UTC
Received 13 Jun 2013, 21:19:53 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1284994
Run time 14 days 20 hours 22 min 47 sec
CPU time 13 days 23 hours 44 min 24 sec
Validate state Invalid
Credit 12,441.60
Device peak FLOPS 3.32 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>7.0.64</core_client_version>
<![CDATA[
<message>
デバイスがコマンドを認識できません。
 (0x16) - exit code 22 (0x16)
</message>
<stderr_txt>
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4792, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
07:16:02 (2932): No heartbeat from core client for 30 sec - exiting
07:16:04 (2932): No heartbeat from core client for 30 sec - exiting
07:16:05 (2932): No heartbeat from core client for 30 sec - exiting
07:16:06 (2932): No heartbeat from core client for 30 sec - exiting
07:16:07 (2932): No heartbeat from core client for 30 sec - exiting
07:16:08 (2932): No heartbeat from core client for 30 sec - exiting
07:16:09 (2932): No heartbeat from core client for 30 sec - exiting
07:16:10 (2932): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2892, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3840, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6020, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3128, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5092, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3564, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4584, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
19:08:16 (2848): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
19:08:17 (2848): No heartbeat from core client for 30 sec - exiting
19:08:18 (2848): No heartbeat from core client for 30 sec - exiting
19:08:19 (2848): No heartbeat from core client for 30 sec - exiting
19:08:21 (2848): No heartbeat from core client for 30 sec - exiting
19:08:22 (2848): No heartbeat from core client for 30 sec - exiting
19:08:23 (2848): No heartbeat from core client for 30 sec - exiting
19:08:24 (2848): No heartbeat from core client for 30 sec - exiting
19:08:25 (2848): No heartbeat from core client for 30 sec - exiting
19:08:26 (2848): No heartbeat from core client for 30 sec - exiting
19:08:27 (2848): No heartbeat from core client for 30 sec - exiting
07:05:34 (4920): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
20:18:34 (5888): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Atmos Hold Restart file rename failed on atmos_restart.hold
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
19:53:58 (4624): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
18:51:19 (4312): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
18:51:24 (4312): No heartbeat from core client for 30 sec - exiting
18:51:25 (4312): No heartbeat from core client for 30 sec - exiting
18:51:26 (4312): No heartbeat from core client for 30 sec - exiting
18:51:27 (4312): No heartbeat from core client for 30 sec - exiting
18:51:28 (4312): No heartbeat from core client for 30 sec - exiting
18:51:29 (4312): No heartbeat from core client for 30 sec - exiting
18:51:31 (4312): No heartbeat from core client for 30 sec - exiting
18:51:32 (4312): No heartbeat from core client for 30 sec - exiting
18:51:33 (4312): No heartbeat from core client for 30 sec - exiting
18:51:34 (4312): No heartbeat from core client for 30 sec - exiting
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2904, iMonCtr=1
Model crash detected, will try to restart...
06:08:28 (616): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
06:35:31 (280): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
06:35:32 (280): No heartbeat from core client for 30 sec - exiting
06:35:33 (280): No heartbeat from core client for 30 sec - exiting
06:35:34 (280): No heartbeat from core client for 30 sec - exiting
06:35:35 (280): No heartbeat from core client for 30 sec - exiting
06:35:37 (280): No heartbeat from core client for 30 sec - exiting
06:35:38 (280): No heartbeat from core client for 30 sec - exiting
06:35:39 (280): No heartbeat from core client for 30 sec - exiting
06:35:40 (280): No heartbeat from core client for 30 sec - exiting
06:35:41 (280): No heartbeat from core client for 30 sec - exiting
06:35:42 (280): No heartbeat from core client for 30 sec - exiting
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4436, iMonCtr=1
Model crash detected, will try to restart...
20:00:50 (1340): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
20:00:56 (1340): No heartbeat from core client for 30 sec - exiting
20:00:57 (1340): No heartbeat from core client for 30 sec - exiting
20:00:59 (1340): No heartbeat from core client for 30 sec - exiting
20:01:00 (1340): No heartbeat from core client for 30 sec - exiting
20:01:01 (1340): No heartbeat from core client for 30 sec - exiting
20:01:02 (1340): No heartbeat from core client for 30 sec - exiting
20:01:03 (1340): No heartbeat from core client for 30 sec - exiting
20:01:04 (1340): No heartbeat from core client for 30 sec - exiting
20:01:05 (1340): No heartbeat from core client for 30 sec - exiting
20:01:06 (1340): No heartbeat from core client for 30 sec - exiting
20:03:29 (2244): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
20:09:53 (3900): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
20:13:50 (2000): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
21:28:30 (3832): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
21:31:02 (5720): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
21:31:11 (5720): No heartbeat from core client for 30 sec - exiting
21:31:12 (5720): No heartbeat from core client for 30 sec - exiting
21:31:14 (5720): No heartbeat from core client for 30 sec - exiting
21:31:15 (5720): No heartbeat from core client for 30 sec - exiting
21:31:16 (5720): No heartbeat from core client for 30 sec - exiting
21:31:17 (5720): No heartbeat from core client for 30 sec - exiting
21:31:18 (5720): No heartbeat from core client for 30 sec - exiting
21:31:19 (5720): No heartbeat from core client for 30 sec - exiting
21:31:20 (5720): No heartbeat from core client for 30 sec - exiting
21:31:21 (5720): No heartbeat from core client for 30 sec - exiting
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5760, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5760, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5760, iMonCtr=1
Model crash detected, will try to restart...
21:32:37 (5760): No heartbeat from core client for 30 sec - exiting
21:32:38 (5760): No heartbeat from core client for 30 sec - exiting
21:32:39 (5760): No heartbeat from core client for 30 sec - exiting
21:32:40 (5760): No heartbeat from core client for 30 sec - exiting
21:32:41 (5760): No heartbeat from core client for 30 sec - exiting
21:32:42 (5760): No heartbeat from core client for 30 sec - exiting
21:32:43 (5760): No heartbeat from core client for 30 sec - exiting
21:32:44 (5760): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
21:32:45 (5760): No heartbeat from core client for 30 sec - exiting
21:32:47 (5760): No heartbeat from core client for 30 sec - exiting
21:32:48 (5760): No heartbeat from core client for 30 sec - exiting
21:32:49 (5760): No heartbeat from core client for 30 sec - exiting
21:32:50 (5760): No heartbeat from core client for 30 sec - exiting
21:32:51 (5760): No heartbeat from core client for 30 sec - exiting
21:32:52 (5760): No heartbeat from core client for 30 sec - exiting
21:32:53 (5760): No heartbeat from core client for 30 sec - exiting
21:32:54 (5760): No heartbeat from core client for 30 sec - exiting
21:32:55 (5760): No heartbeat from core client for 30 sec - exiting
06:06:05 (1180): No heartbeat from core client for 30 sec - exiting
06:06:07 (1180): No heartbeat from core client for 30 sec - exiting
06:06:08 (1180): No heartbeat from core client for 30 sec - exiting
06:06:09 (1180): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
06:08:44 (2816): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
06:08:45 (2816): No heartbeat from core client for 30 sec - exiting
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5004, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5004, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5004, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
30 Jul 2013 10:05:00 1130983 15733344 hadcm3n_u065_2020_40_008336645_1 1,036,800 1,538,333 1.4837
30 Jul 2013 10:05:00 1130983 15733344 hadcm3n_u065_2020_40_008336645_1 1,010,880 1,495,603 1.4795
26 Jul 2013 12:37:21 1130983 15733344 hadcm3n_u065_2020_40_008336645_1 984,960 1,452,747 1.4749
25 Jul 2013 23:33:45 1130983 15733344 hadcm3n_u065_2020_40_008336645_1 959,040 1,409,913 1.4701
23 Jul 2013 22:08:24 1130983 15733344 hadcm3n_u065_2020_40_008336645_1 933,120 1,367,768 1.4658
23 Jul 2013 20:56:13 1130983 15733344 hadcm3n_u065_2020_40_008336645_1 907,200 1,331,468 1.4677
23 Jul 2013 20:26:26 1130983 15733344 hadcm3n_u065_2020_40_008336645_1 881,280 1,294,932 1.4694
23 Jul 2013 20:10:06 1130983 15733344 hadcm3n_u065_2020_40_008336645_1 855,360 1,258,373 1.4712
11 Jun 2013 13:05:04 1130983 15733344 hadcm3n_u065_2020_40_008336645_1 829,440 1,197,466 1.4437
09 Jun 2013 01:51:57 1130983 15733344 hadcm3n_u065_2020_40_008336645_1 803,520 1,155,904 1.4386
08 Jun 2013 04:11:46 1130983 15733344 hadcm3n_u065_2020_40_008336645_1 777,600 1,119,154 1.4392
07 Jun 2013 07:50:05 1130983 15733344 hadcm3n_u065_2020_40_008336645_1 751,680 1,081,165 1.4383
05 Jun 2013 21:30:15 1130983 15733344 hadcm3n_u065_2020_40_008336645_1 725,760 1,040,114 1.4331
02 Jun 2013 22:09:25 1130983 15733344 hadcm3n_u065_2020_40_008336645_1 699,840 999,413 1.4281
01 Jun 2013 23:10:03 1130983 15733344 hadcm3n_u065_2020_40_008336645_1 673,920 959,046 1.4231
01 Jun 2013 03:24:48 1130983 15733344 hadcm3n_u065_2020_40_008336645_1 648,000 922,486 1.4236
30 May 2013 10:34:49 1130983 15733344 hadcm3n_u065_2020_40_008336645_1 622,080 886,944 1.4258
26 May 2013 10:55:47 1130983 15733344 hadcm3n_u065_2020_40_008336645_1 596,160 848,655 1.4235
25 May 2013 22:26:56 1130983 15733344 hadcm3n_u065_2020_40_008336645_1 570,240 806,375 1.4141
23 May 2013 10:43:45 1130983 15733344 hadcm3n_u065_2020_40_008336645_1 544,320 764,005 1.4036


©2024 climateprediction.net