climateprediction.net home page
Task 16606351

Task 16606351

Name hadcm3n_8c6n_1980_40_008725210_3
Workunit 8871188
Created 1 May 2014, 15:20:01 UTC
Sent 1 May 2014, 15:30:33 UTC
Report deadline 31 Jul 2014, 22:57:44 UTC
Received 22 Sep 2014, 14:36:39 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 851413
Run time 34 days 19 hours 7 min 28 sec
CPU time 20 days 18 hours 21 min 27 sec
Validate state Invalid
Credit 8,087.04
Device peak FLOPS 2.17 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>7.3.11</core_client_version>
<![CDATA[
<message>
Zařízení nezná tento příkaz.
 (0x16) - exit code 22 (0x16)
</message>
<stderr_txt>
17:32:30 (5768): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4880, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2352, iMonCtr=1
Model crash detected, will try to restart...
16:58:13 (5060): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
17:01:16 (3600): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
17:01:17 (3600): No heartbeat from core client for 30 sec - exiting
17:10:25 (5740): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
17:16:28 (1116): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
19:29:04 (1140): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
19:32:06 (3928): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
20:14:24 (1432): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
20:26:27 (5796): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2884, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3692, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
CSuspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2208, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2168, iMonCtr=1
Model crasSuspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4728, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4872, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4492, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
CSuspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4216, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4136, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CSuspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3956, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3408, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3604, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4852, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3108, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3440, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4624, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2564, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4576, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3384, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4772, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...

zip error: Could not create output file (was replacing the original zip file)
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2876, iMonCtr=1
Model crash detected, will try to restart...
CSuspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4688, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3448, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3020, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2064, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4332, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4332, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4332, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4332, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4332, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4332, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
21 Sep 2014 11:09:18 851413 16606351 hadcm3n_8c6n_1980_40_008725210_3 673,920 1,778,205 2.6386
17 Sep 2014 13:18:38 851413 16606351 hadcm3n_8c6n_1980_40_008725210_3 648,000 1,711,321 2.6409
14 Sep 2014 19:11:43 851413 16606351 hadcm3n_8c6n_1980_40_008725210_3 622,080 1,644,943 2.6443
10 Sep 2014 19:55:44 851413 16606351 hadcm3n_8c6n_1980_40_008725210_3 596,160 1,577,650 2.6464
07 Sep 2014 12:35:51 851413 16606351 hadcm3n_8c6n_1980_40_008725210_3 570,240 1,510,686 2.6492
03 Sep 2014 13:40:47 851413 16606351 hadcm3n_8c6n_1980_40_008725210_3 544,320 1,443,769 2.6524
31 Aug 2014 19:09:16 851413 16606351 hadcm3n_8c6n_1980_40_008725210_3 518,400 1,377,291 2.6568
20 Aug 2014 15:55:53 851413 16606351 hadcm3n_8c6n_1980_40_008725210_3 492,480 1,309,749 2.6595
14 Aug 2014 15:49:09 851413 16606351 hadcm3n_8c6n_1980_40_008725210_3 466,560 1,242,748 2.6636
31 Jul 2014 19:23:18 851413 16606351 hadcm3n_8c6n_1980_40_008725210_3 440,640 1,173,395 2.6629
23 Jul 2014 16:44:52 851413 16606351 hadcm3n_8c6n_1980_40_008725210_3 414,720 1,105,850 2.6665
12 Jul 2014 10:44:02 851413 16606351 hadcm3n_8c6n_1980_40_008725210_3 388,800 1,037,934 2.6696
08 Jul 2014 20:41:05 851413 16606351 hadcm3n_8c6n_1980_40_008725210_3 362,880 970,215 2.6737
03 Jul 2014 13:51:57 851413 16606351 hadcm3n_8c6n_1980_40_008725210_3 336,960 902,639 2.6788
29 Jun 2014 18:33:05 851413 16606351 hadcm3n_8c6n_1980_40_008725210_3 311,040 835,118 2.6849
26 Jun 2014 18:52:52 851413 16606351 hadcm3n_8c6n_1980_40_008725210_3 285,120 767,037 2.6902
21 Jun 2014 14:16:24 851413 16606351 hadcm3n_8c6n_1980_40_008725210_3 259,200 698,656 2.6954
16 Jun 2014 15:49:00 851413 16606351 hadcm3n_8c6n_1980_40_008725210_3 233,280 629,615 2.6990
12 Jun 2014 15:20:22 851413 16606351 hadcm3n_8c6n_1980_40_008725210_3 207,360 560,720 2.7041
07 Jun 2014 07:47:16 851413 16606351 hadcm3n_8c6n_1980_40_008725210_3 181,440 491,878 2.7110


©2024 climateprediction.net