climateprediction.net home page
Task 15462140

Task 15462140

Name hadcm3n_z96l_1880_40_008253845_1
Workunit 8408969
Created 26 Nov 2012, 13:04:08 UTC
Sent 26 Nov 2012, 13:38:14 UTC
Report deadline 25 Feb 2013, 21:05:25 UTC
Received 28 Jan 2013, 20:37:23 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status -226 (0xFFFFFF1E) ERR_TOO_MANY_EXITS
Computer ID 1254939
Run time 24 days 17 hours 19 min 8 sec
CPU time 23 days 3 hours 29 min 31 sec
Validate state Invalid
Credit 12,130.56
Device peak FLOPS 2.30 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>7.0.28</core_client_version>
<![CDATA[
<message>
too many exit(0)s
</message>
<stderr_txt>
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4108, iMonCtr=1
Model crash detected, will try to restart...
12:27:59 (3900): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5116, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6872, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4016, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1504, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
15:39:19 (6560): No heartbeat from core client for 30 sec - exiting
15:39:20 (6560): No heartbeat from core client for 30 sec - exiting
15:39:21 (6560): No heartbeat from core client for 30 sec - exiting
15:39:22 (6560): No heartbeat from core client for 30 sec - exiting
15:39:23 (6560): No heartbeat from core client for 30 sec - exiting
15:39:24 (6560): No heartbeat from core client for 30 sec - exiting
15:39:25 (6560): No heartbeat from core client for 30 sec - exiting
15:39:26 (6560): No heartbeat from core client for 30 sec - exiting
15:39:27 (6560): No heartbeat from core client for 30 sec - exiting
15:39:28 (6560): No heartbeat from core client for 30 sec - exiting
15:39:29 (6560): No heartbeat from core client for 30 sec - exiting
15:39:30 (6560): No heartbeat from core client for 30 sec - exiting
15:39:31 (6560): No heartbeat from core client for 30 sec - exiting
15:39:32 (6560): No heartbeat from core client for 30 sec - exiting
15:39:33 (6560): No heartbeat from core client for 30 sec - exiting
15:39:34 (6560): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
16:51:00 (4384): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
16:59:59 (4648): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4888, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4076, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
19:02:30 (4628): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
19:41:44 (5740): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
19:50:48 (4940): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
10:10:27 (3232): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
15:32:49 (6784): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
12:10:47 (1512): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
19:20:42 (2948): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6812, iMonCtr=1
Model crash detected, will try to restart...

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
28 Jan 2013 08:54:32 1254939 15462140 hadcm3n_z96l_1880_40_008253845_1 1,010,880 1,959,872 1.9388
27 Jan 2013 10:41:25 1254939 15462140 hadcm3n_z96l_1880_40_008253845_1 984,960 1,911,422 1.9406
26 Jan 2013 12:40:08 1254939 15462140 hadcm3n_z96l_1880_40_008253845_1 959,040 1,862,373 1.9419
25 Jan 2013 09:45:29 1254939 15462140 hadcm3n_z96l_1880_40_008253845_1 933,120 1,814,476 1.9445
24 Jan 2013 11:01:29 1254939 15462140 hadcm3n_z96l_1880_40_008253845_1 907,200 1,768,117 1.9490
23 Jan 2013 11:23:40 1254939 15462140 hadcm3n_z96l_1880_40_008253845_1 881,280 1,721,097 1.9530
22 Jan 2013 08:55:31 1254939 15462140 hadcm3n_z96l_1880_40_008253845_1 855,360 1,675,008 1.9582
20 Jan 2013 17:11:19 1254939 15462140 hadcm3n_z96l_1880_40_008253845_1 829,440 1,628,438 1.9633
19 Jan 2013 16:30:52 1254939 15462140 hadcm3n_z96l_1880_40_008253845_1 803,520 1,579,996 1.9663
18 Jan 2013 14:36:15 1254939 15462140 hadcm3n_z96l_1880_40_008253845_1 777,600 1,532,436 1.9707
17 Jan 2013 15:01:28 1254939 15462140 hadcm3n_z96l_1880_40_008253845_1 751,680 1,485,294 1.9760
09 Jan 2013 11:55:52 1254939 15462140 hadcm3n_z96l_1880_40_008253845_1 725,760 1,437,941 1.9813
07 Jan 2013 19:07:38 1254939 15462140 hadcm3n_z96l_1880_40_008253845_1 699,840 1,389,443 1.9854
06 Jan 2013 17:43:17 1254939 15462140 hadcm3n_z96l_1880_40_008253845_1 673,920 1,342,086 1.9915
05 Jan 2013 14:10:36 1254939 15462140 hadcm3n_z96l_1880_40_008253845_1 648,000 1,293,029 1.9954
01 Jan 2013 19:07:08 1254939 15462140 hadcm3n_z96l_1880_40_008253845_1 622,080 1,242,965 1.9981
31 Dec 2012 17:46:41 1254939 15462140 hadcm3n_z96l_1880_40_008253845_1 596,160 1,191,989 1.9994
30 Dec 2012 16:31:11 1254939 15462140 hadcm3n_z96l_1880_40_008253845_1 570,240 1,141,128 2.0011
29 Dec 2012 15:52:16 1254939 15462140 hadcm3n_z96l_1880_40_008253845_1 544,320 1,090,040 2.0026
28 Dec 2012 14:30:47 1254939 15462140 hadcm3n_z96l_1880_40_008253845_1 518,400 1,038,865 2.0040


©2024 climateprediction.net