climateprediction.net home page
Task 14453668

Task 14453668

Name hadam3p_pnw_b3fc_1959_1_007890171_2
Workunit 8045283
Created 18 Apr 2012, 2:55:27 UTC
Sent 18 Apr 2012, 3:02:53 UTC
Report deadline 31 Mar 2013, 8:22:53 UTC
Received 22 Apr 2012, 3:38:53 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 0 (0x00000000)
Computer ID 962831
Run time 1 days 20 hours 22 min 49 sec
CPU time 1 days 19 hours 44 min
Validate state Invalid
Credit 1,253.67
Device peak FLOPS 3.11 GFLOPS
Application version UK Met Office HadAM3P-HadRM3P Pacific North West v6.09
windows_intelx86
Stderr
<core_client_version>6.12.34</core_client_version>
<![CDATA[
<stderr_txt>
06:09:30 (1528): No heartbeat from core client for 30 sec - exiting
06:09:31 (1528): No heartbeat from core client for 30 sec - exiting
06:09:32 (1528): No heartbeat from core client for 30 sec - exiting
06:09:33 (1528): No heartbeat from core client for 30 sec - exiting
06:09:34 (1528): No heartbeat from core client for 30 sec - exiting
06:09:35 (1528): No heartbeat from core client for 30 sec - exiting
06:09:36 (1528): No heartbeat from core client for 30 sec - exiting
06:09:37 (1528): No heartbeat from core client for 30 sec - exiting
06:09:38 (1528): No heartbeat from core client for 30 sec - exiting
06:09:39 (1528): No heartbeat from core client for 30 sec - exiting
06:09:40 (1528): No heartbeat from core client for 30 sec - exiting
06:09:41 (1528): No heartbeat from core client for 30 sec - exiting
06:09:42 (1528): No heartbeat from core client for 30 sec - exiting
06:09:43 (1528): No heartbeat from core client for 30 sec - exiting
06:09:44 (1528): No heartbeat from core client for 30 sec - exiting
06:09:45 (1528): No heartbeat from core client for 30 sec - exiting
06:09:46 (1528): No heartbeat from core client for 30 sec - exiting
06:09:47 (1528): No heartbeat from core client for 30 sec - exiting
06:09:48 (1528): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
06:00:29 (7700): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
06:00:30 (7700): No heartbeat from core client for 30 sec - exiting
06:02:02 (13736): No heartbeat from core client for 30 sec - exiting
06:02:03 (13736): No heartbeat from core client for 30 sec - exiting
Global Worker:: CPDN process is not running, exiting, bRetVal = 0, checkPID=0, selfPID=13948, iMonCtr=1
CPDN Monitor - No 'heartbeat' from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 0, checkPID=0, selfPID=2004, iMonCtr=1
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=11780, selfPID=4116, iMonCtr=1
Model crash detected, will try to restart...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=11780, selfPID=11780, iMonCtr=2
Leaving CPDN_Main::Monitor...
Regional yearly means requires 12 input files got 5
Called boinc_finish

</stderr_txt>
<message>
upload failure: <file_xfer_error>
  <file_name>hadam3p_pnw_b3fc_1959_1_007890171_2_6.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_pnw_b3fc_1959_1_007890171_2_7.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_pnw_b3fc_1959_1_007890171_2_8.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_pnw_b3fc_1959_1_007890171_2_9.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_pnw_b3fc_1959_1_007890171_2_10.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_pnw_b3fc_1959_1_007890171_2_11.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_pnw_b3fc_1959_1_007890171_2_12.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>

</message>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
21 Apr 2012 13:52:05 962831 14453668 hadam3p_pnw_b3fc_1959_1_007890171_2 57,696 140,054 2.4274
21 Apr 2012 04:46:43 962831 14453668 hadam3p_pnw_b3fc_1959_1_007890171_2 46,176 112,110 2.4279
20 Apr 2012 13:34:39 962831 14453668 hadam3p_pnw_b3fc_1959_1_007890171_2 34,656 84,143 2.4279
20 Apr 2012 04:32:26 962831 14453668 hadam3p_pnw_b3fc_1959_1_007890171_2 23,136 56,297 2.4333
19 Apr 2012 12:24:09 962831 14453668 hadam3p_pnw_b3fc_1959_1_007890171_2 11,616 28,414 2.4461


©2024 climateprediction.net