climateprediction.net home page
Task 17942128

Task 17942128

Name hadam3p_anz_n34k_2013_1_009520605_0
Workunit 9602348
Created 11 Feb 2015, 19:27:42 UTC
Sent 18 Feb 2015, 9:08:08 UTC
Report deadline 31 Jan 2016, 14:28:08 UTC
Received 23 Feb 2015, 12:35:11 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 0 (0x00000000)
Computer ID 1330326
Run time 2 days 6 hours 46 min 42 sec
CPU time 1 days 23 hours 2 min 58 sec
Validate state Invalid
Credit 2,000.18
Device peak FLOPS 4.49 GFLOPS
Application version UK Met Office HadAM3P-HadRM3P Australia New Zealand v6.10
windows_intelx86
Stderr
<core_client_version>7.4.36</core_client_version>
<![CDATA[
<stderr_txt>
03:22:43 (2740): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
03:22:44 (2740): No heartbeat from core client for 30 sec - exiting
03:22:45 (2740): No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - Suspend request from BOINC...
03:31:57 (6796): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
03:31:59 (6796): No heartbeat from core client for 30 sec - exiting
03:32:00 (6796): No heartbeat from core client for 30 sec - exiting
03:32:01 (6796): No heartbeat from core client for 30 sec - exiting
03:32:02 (6796): No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - Suspend request from BOINC...
03:41:14 (1696): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
05:03:39 (6636): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
07:14:50 (6840): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
07:14:51 (6840): No heartbeat from core client for 30 sec - exiting
07:14:52 (6840): No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
09:35:36 (380): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4344, selfPID=4344, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1148, selfPID=1148, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
04:53:58 (6580): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
05:33:41 (6804): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
05:33:42 (6804): No heartbeat from core client for 30 sec - exiting
05:33:43 (6804): No heartbeat from core client for 30 sec - exiting
Global Worker:: CPDN process is not running, exiting, bRetVal = 0, checkPID=0, selfPID=6532, iMonCtr=1
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=0, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3264, selfPID=6204, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Called boinc_finish

</stderr_txt>
<message>
upload failure: <file_xfer_error>
  <file_name>hadam3p_anz_n34k_2013_1_009520605_0_5.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_anz_n34k_2013_1_009520605_0_6.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_anz_n34k_2013_1_009520605_0_7.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_anz_n34k_2013_1_009520605_0_8.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_anz_n34k_2013_1_009520605_0_9.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_anz_n34k_2013_1_009520605_0_10.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_anz_n34k_2013_1_009520605_0_11.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_anz_n34k_2013_1_009520605_0_12.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>

</message>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
22 Feb 2015 21:38:38 1330326 17942128 hadam3p_anz_n34k_2013_1_009520605_0 46,379 143,156 3.0867
22 Feb 2015 09:45:40 1330326 17942128 hadam3p_anz_n34k_2013_1_009520605_0 34,859 107,346 3.0794
21 Feb 2015 15:53:26 1330326 17942128 hadam3p_anz_n34k_2013_1_009520605_0 23,339 71,853 3.0787
18 Feb 2015 21:55:28 1330326 17942128 hadam3p_anz_n34k_2013_1_009520605_0 11,819 35,524 3.0057


©2024 climateprediction.net