climateprediction.net home page
Task 15887396

Task 15887396

Name hadcm3n_48sj_1980_40_008400210_0
Workunit 8551066
Created 8 Jul 2013, 23:51:22 UTC
Sent 10 Jul 2013, 16:35:22 UTC
Report deadline 10 Oct 2013, 0:02:33 UTC
Received 20 Sep 2013, 20:11:26 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status -226 (0xFFFFFF1E) ERR_TOO_MANY_EXITS
Computer ID 1031487
Run time 23 days 10 hours 32 min 22 sec
CPU time 11 days 12 hours 49 min 2 sec
Validate state Invalid
Credit 5,287.68
Device peak FLOPS 2.06 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>6.10.18</core_client_version>
<![CDATA[
<message>
too many exit(0)s
</message>
<stderr_txt>
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1388, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2776, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=428, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4420, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4988, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2784, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=420, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5008, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5360, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4608, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CCPDN Monitor - Quit request from BOINC...
10:28:31 (6540): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4336, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4976, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5156, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4872, iMonCtr=1
Model crash detected, will try to restart...
09:41:37 (5736): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
08:26:18 (5244): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=12400, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4932, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2188, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4400, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5288, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4560, iMonCtr=1
Model crash detected, will try to restart...

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
15 Sep 2013 07:28:30 1031487 15887396 hadcm3n_48sj_1980_40_008400210_0 440,640 976,724 2.2166
02 Sep 2013 15:22:43 1031487 15887396 hadcm3n_48sj_1980_40_008400210_0 414,720 919,702 2.2176
30 Aug 2013 07:07:01 1031487 15887396 hadcm3n_48sj_1980_40_008400210_0 388,800 861,701 2.2163
28 Aug 2013 23:42:18 1031487 15887396 hadcm3n_48sj_1980_40_008400210_0 362,880 804,529 2.2171
18 Aug 2013 16:16:18 1031487 15887396 hadcm3n_48sj_1980_40_008400210_0 336,960 742,733 2.2042
18 Aug 2013 16:16:18 1031487 15887396 hadcm3n_48sj_1980_40_008400210_0 311,040 681,735 2.1918
18 Aug 2013 16:16:18 1031487 15887396 hadcm3n_48sj_1980_40_008400210_0 285,120 620,643 2.1768
18 Aug 2013 16:16:18 1031487 15887396 hadcm3n_48sj_1980_40_008400210_0 259,200 560,649 2.1630
29 Jul 2013 12:54:49 1031487 15887396 hadcm3n_48sj_1980_40_008400210_0 233,280 500,031 2.1435
24 Jul 2013 20:12:35 1031487 15887396 hadcm3n_48sj_1980_40_008400210_0 207,360 440,456 2.1241
23 Jul 2013 21:49:30 1031487 15887396 hadcm3n_48sj_1980_40_008400210_0 181,440 383,515 2.1137
23 Jul 2013 20:33:19 1031487 15887396 hadcm3n_48sj_1980_40_008400210_0 155,520 329,182 2.1167
23 Jul 2013 17:17:49 1031487 15887396 hadcm3n_48sj_1980_40_008400210_0 129,600 273,930 2.1137
23 Jul 2013 17:17:49 1031487 15887396 hadcm3n_48sj_1980_40_008400210_0 103,680 219,865 2.1206
23 Jul 2013 17:17:49 1031487 15887396 hadcm3n_48sj_1980_40_008400210_0 77,760 161,446 2.0762
23 Jul 2013 17:17:49 1031487 15887396 hadcm3n_48sj_1980_40_008400210_0 51,840 109,614 2.1145
11 Jul 2013 18:46:58 1031487 15887396 hadcm3n_48sj_1980_40_008400210_0 25,920 57,330 2.2118


©2024 climateprediction.net