climateprediction.net home page
Task 15729054

Task 15729054

Name hadcm3n_zc6k_1960_40_008350439_0
Workunit 8501300
Created 17 Apr 2013, 19:00:26 UTC
Sent 17 Apr 2013, 19:03:19 UTC
Report deadline 18 Jul 2013, 2:30:30 UTC
Received 18 May 2013, 9:49:30 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 193 (0x000000C1) EXIT_SIGNAL
Computer ID 1169024
Run time 10 days 21 hours 34 min 54 sec
CPU time 10 days 10 hours 6 min 26 sec
Validate state Invalid
Credit 6,220.80
Device peak FLOPS 2.32 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>7.0.28</core_client_version>
<![CDATA[
<message>
 - exit code 193 (0xc1)
</message>
<stderr_txt>
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6804, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6804, iMonCtr=1
Model crash detected, will try to restart...
13:36:30 (6608): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
17:31:27 (6312): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6248, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4320, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4996, iMonCtr=1
Model crash detected, will try to restart...
10:36:06 (6232): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1116, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1116, iMonCtr=1
Model crash detected, will try to restart...
07:36:06 (5576): No heartbeat from core client for 30 sec - exiting
07:36:07 (5576): No heartbeat from core client for 30 sec - exiting
07:36:08 (5576): No heartbeat from core client for 30 sec - exiting
07:36:09 (5576): No heartbeat from core client for 30 sec - exiting
07:36:11 (5576): No heartbeat from core client for 30 sec - exiting
07:36:12 (5576): No heartbeat from core client for 30 sec - exiting
07:36:13 (5576): No heartbeat from core client for 30 sec - exiting
07:36:14 (5576): No heartbeat from core client for 30 sec - exiting
07:36:15 (5576): No heartbeat from core client for 30 sec - exiting
07:36:16 (5576): No heartbeat from core client for 30 sec - exiting
07:36:17 (5576): No heartbeat from core client for 30 sec - exiting
07:36:18 (5576): No heartbeat from core client for 30 sec - exiting
07:36:19 (5576): No heartbeat from core client for 30 sec - exiting
07:36:20 (5576): No heartbeat from core client for 30 sec - exiting
07:36:21 (5576): No heartbeat from core client for 30 sec - exiting
07:36:23 (5576): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
07:48:23 (448): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
07:22:32 (4232): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
07:43:25 (3916): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
07:49:28 (6044): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3264, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3264, iMonCtr=1
Model crash detected, will try to restart...
18:58:07 (3376): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6336, iMonCtr=1
Model crash deController:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1252, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1252, iMonCtr=1
Model crash detected, will try to restart...
BUFFIN: C I/O Error feof - Unit 65 - Return code = 16
BUFFIN: C I/O Error feof - Unit 68 - Return code = 16
BUFFIN: C I/O Error feof - Unit 69 - Return code = 16
Error converting file to netcdf: dataout/zc6kko.pji0c10
Error converting file to netcdf: dataout/zc6kko.pii0c10
Error converting file to netcdf: dataout/zc6kko.pfi0c10
Signal 11 received, exiting...
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
18 May 2013 06:57:56 1169024 15729054 hadcm3n_zc6k_1960_40_008350439_0 518,400 900,300 1.7367
16 May 2013 18:40:37 1169024 15729054 hadcm3n_zc6k_1960_40_008350439_0 492,480 854,520 1.7351
15 May 2013 20:36:33 1169024 15729054 hadcm3n_zc6k_1960_40_008350439_0 466,560 810,919 1.7381
13 May 2013 15:32:50 1169024 15729054 hadcm3n_zc6k_1960_40_008350439_0 440,640 766,336 1.7391
09 May 2013 08:36:02 1169024 15729054 hadcm3n_zc6k_1960_40_008350439_0 414,720 720,611 1.7376
08 May 2013 13:10:14 1169024 15729054 hadcm3n_zc6k_1960_40_008350439_0 388,800 678,465 1.7450
04 May 2013 16:36:09 1169024 15729054 hadcm3n_zc6k_1960_40_008350439_0 362,880 634,597 1.7488
03 May 2013 17:33:40 1169024 15729054 hadcm3n_zc6k_1960_40_008350439_0 336,960 590,179 1.7515
02 May 2013 18:15:06 1169024 15729054 hadcm3n_zc6k_1960_40_008350439_0 311,040 546,251 1.7562
01 May 2013 20:36:14 1169024 15729054 hadcm3n_zc6k_1960_40_008350439_0 285,120 502,463 1.7623
01 May 2013 08:28:47 1169024 15729054 hadcm3n_zc6k_1960_40_008350439_0 259,200 458,501 1.7689
30 Apr 2013 10:36:05 1169024 15729054 hadcm3n_zc6k_1960_40_008350439_0 233,280 414,423 1.7765
29 Apr 2013 12:03:45 1169024 15729054 hadcm3n_zc6k_1960_40_008350439_0 207,360 368,874 1.7789
28 Apr 2013 10:01:33 1169024 15729054 hadcm3n_zc6k_1960_40_008350439_0 181,440 321,922 1.7743
27 Apr 2013 10:43:33 1169024 15729054 hadcm3n_zc6k_1960_40_008350439_0 155,520 274,794 1.7669
26 Apr 2013 12:19:02 1169024 15729054 hadcm3n_zc6k_1960_40_008350439_0 129,600 229,783 1.7730
25 Apr 2013 11:38:03 1169024 15729054 hadcm3n_zc6k_1960_40_008350439_0 103,680 184,021 1.7749
23 Apr 2013 22:11:16 1169024 15729054 hadcm3n_zc6k_1960_40_008350439_0 77,760 137,169 1.7640
23 Apr 2013 08:34:00 1169024 15729054 hadcm3n_zc6k_1960_40_008350439_0 51,840 91,110 1.7575
22 Apr 2013 08:03:35 1169024 15729054 hadcm3n_zc6k_1960_40_008350439_0 25,920 44,228 1.7063


©2024 climateprediction.net