climateprediction.net home page
Task 14662708

Task 14662708

Name hadam3p_pnw_c88y_1985_1_007943649_1
Workunit 8098761
Created 13 May 2012, 9:08:06 UTC
Sent 13 May 2012, 9:17:45 UTC
Report deadline 25 Apr 2013, 14:37:45 UTC
Received 7 Jul 2012, 0:47:32 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 0 (0x00000000)
Computer ID 1164541
Run time 8 days 5 hours 20 min 6 sec
CPU time 1 days 18 hours 53 min 30 sec
Validate state Invalid
Credit 2,505.24
Device peak FLOPS 2.02 GFLOPS
Application version UK Met Office HadAM3P-HadRM3P Pacific North West v6.09
windows_intelx86
Stderr
<core_client_version>7.0.28</core_client_version>
<![CDATA[
<stderr_txt>
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4848, selfPID=3916, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3984, selfPID=3984, iMonCtr=2
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4952, iMonCtr=2
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4564, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4848, selfPID=2720, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Regional yearly means requires 12 input files got 1
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4048, selfPID=4396, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Regional yearly means requires 12 input files got 1
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1388, selfPID=2956, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3484, selfPID=936, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3644, selfPID=4952, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3956, selfPID=2352, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6756, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1220, iMonCtr=2
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Regional yearly means requires 12 input files got 5
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3748, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1100, iMonCtr=2
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3452, selfPID=3032, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1200, selfPID=4204, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3500, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5416, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5500, selfPID=4432, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Regional yearly means requires 12 input files got 10
Called boinc_finish
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
22:24:17 (3860): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4120, selfPID=2912, iMonCtr=1
Model crash detected, will try to restart...
19:48:45 (3328): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
21:47:40 (7624): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
00:46:25 (6292): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=9776, selfPID=9776, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3948, iMonCtr=2
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Regional yearly means requires 12 input files got 3
Called boinc_finish

</stderr_txt><message>
upload failure: <file_xfer_error>
  <file_name>hadam3p_pnw_c88y_1985_1_007943649_1_11.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_pnw_c88y_1985_1_007943649_1_12.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>

</message>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
23 May 2012 22:46:30 1164541 14662708 hadam3p_pnw_c88y_1985_1_007943649_1 115,296 420,957 3.6511
22 May 2012 22:38:54 1164541 14662708 hadam3p_pnw_c88y_1985_1_007943649_1 103,776 379,811 3.6599
21 May 2012 12:24:22 1164541 14662708 hadam3p_pnw_c88y_1985_1_007943649_1 92,256 338,792 3.6723
20 May 2012 16:01:09 1164541 14662708 hadam3p_pnw_c88y_1985_1_007943649_1 80,736 298,405 3.6961
20 May 2012 01:41:56 1164541 14662708 hadam3p_pnw_c88y_1985_1_007943649_1 69,216 257,618 3.7219
18 May 2012 14:10:49 1164541 14662708 hadam3p_pnw_c88y_1985_1_007943649_1 57,696 214,830 3.7235
17 May 2012 19:43:54 1164541 14662708 hadam3p_pnw_c88y_1985_1_007943649_1 46,176 170,595 3.6945
16 May 2012 17:16:45 1164541 14662708 hadam3p_pnw_c88y_1985_1_007943649_1 34,656 127,316 3.6737
15 May 2012 08:12:07 1164541 14662708 hadam3p_pnw_c88y_1985_1_007943649_1 23,137 86,029 3.7182
15 May 2012 08:12:07 1164541 14662708 hadam3p_pnw_c88y_1985_1_007943649_1 23,136 85,533 3.6970
14 May 2012 05:39:08 1164541 14662708 hadam3p_pnw_c88y_1985_1_007943649_1 11,616 43,244 3.7228


©2024 climateprediction.net