climateprediction.net home page
Task 16949630

Task 16949630

Name hadam3p_anz_rnu0_2012_1_008957467_0
Workunit 9101642
Created 27 Aug 2014, 12:15:24 UTC
Sent 31 Aug 2014, 20:00:57 UTC
Report deadline 14 Aug 2015, 1:20:57 UTC
Received 3 Oct 2014, 18:44:24 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 0 (0x00000000)
Computer ID 1337772
Run time 1 days 0 hours 44 min 55 sec
CPU time 23 hours 31 min 16 sec
Validate state Invalid
Credit 1,006.54
Device peak FLOPS 3.32 GFLOPS
Application version UK Met Office HadAM3P-HadRM3P Australia New Zealand v6.10
windows_intelx86
Stderr
<core_client_version>7.2.42</core_client_version>
<![CDATA[
<stderr_txt>
Suspended CPDN Monitor - Suspend request from BOINC...
21:58:29 (5804): No heartbeat from core client for 30 sec - exiting
21:58:30 (5804): No heartbeat from core client for 30 sec - exiting
21:58:31 (5804): No heartbeat from core client for 30 sec - exiting
22:00:02 (5804): No heartbeat from core client for 30 sec - exiting
22:00:03 (5804): No heartbeat from core client for 30 sec - exiting
22:00:04 (5804): No heartbeat from core client for 30 sec - exiting
22:00:05 (5804): No heartbeat from core client for 30 sec - exiting
22:00:06 (5804): No heartbeat from core client for 30 sec - exiting
22:00:07 (5804): No heartbeat from core client for 30 sec - exiting
22:00:08 (5804): No heartbeat from core client for 30 sec - exiting
22:00:09 (5804): No heartbeat from core client for 30 sec - exiting
22:00:10 (5804): No heartbeat from core client for 30 sec - exiting
22:00:11 (5804): No heartbeat from core client for 30 sec - exiting
22:00:12 (5804): No heartbeat from core client for 30 sec - exiting
22:00:13 (5804): No heartbeat from core client for 30 sec - exiting
22:00:14 (5804): No heartbeat from core client for 30 sec - exiting
22:00:15 (5804): No heartbeat from core client for 30 sec - exiting
22:00:16 (5804): No heartbeat from core client for 30 sec - exiting
22:00:17 (5804): No heartbeat from core client for 30 sec - exiting
22:00:18 (5804): No heartbeat from core client for 30 sec - exiting
22:00:19 (5804): No heartbeat from core client for 30 sec - exiting
22:00:20 (5804): No heartbeat from core client for 30 sec - exiting
22:00:21 (5804): No heartbeat from core client for 30 sec - exiting
22:00:22 (5804): No heartbeat from core client for 30 sec - exiting
22:00:23 (5804): No heartbeat from core client for 30 sec - exiting
22:00:24 (5804): No heartbeat from core client for 30 sec - exiting
22:00:25 (5804): No heartbeat from core client for 30 sec - exiting
22:00:26 (5804): No heartbeat from core client for 30 sec - exiting
22:00:27 (5804): No heartbeat from core client for 30 sec - exiting
22:00:28 (5804): No heartbeat from core client for 30 sec - exiting
22:00:29 (5804): No heartbeat from core client for 30 sec - exiting
22:00:30 (5804): No heartbeat from core client for 30 sec - exiting
22:00:31 (5804): No heartbeat from core client for 30 sec - exiting
22:00:32 (5804): No heartbeat from core client for 30 sec - exiting
22:00:33 (5804): No heartbeat from core client for 30 sec - exiting
22:00:34 (5804): No heartbeat from core client for 30 sec - exiting
22:00:35 (5804): No heartbeat from core client for 30 sec - exiting
22:00:36 (5804): No heartbeat from core client for 30 sec - exiting
22:00:38 (5804): No heartbeat from core client for 30 sec - exiting
22:00:39 (5804): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2056, iMonCtr=2
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5952, iMonCtr=2
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
19:30:37 (5352): No heartbeat from core client for 30 sec - exiting
19:30:38 (5352): No heartbeat from core client for 30 sec - exiting
19:30:39 (5352): No heartbeat from core client for 30 sec - exiting
19:30:40 (5352): No heartbeat from core client for 30 sec - exiting
19:30:41 (5352): No heartbeat from core client for 30 sec - exiting
19:30:42 (5352): No heartbeat from core client for 30 sec - exiting
19:30:43 (5352): No heartbeat from core client for 30 sec - exiting
19:30:44 (5352): No heartbeat from core client for 30 sec - exiting
19:30:46 (5352): No heartbeat from core client for 30 sec - exiting
19:30:47 (5352): No heartbeat from core client for 30 sec - exiting
19:30:48 (5352): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2372, selfPID=5792, iMonCtr=1
Model crash detected, will try to restart...
07:38:48 (5708): No heartbeat from core client for 30 sec - exiting
07:38:49 (5708): No heartbeat from core client for 30 sec - exiting
07:38:50 (5708): No heartbeat from core client for 30 sec - exiting
07:38:51 (5708): No heartbeat from core client for 30 sec - exiting
07:38:52 (5708): No heartbeat from core client for 30 sec - exiting
07:38:53 (5708): No heartbeat from core client for 30 sec - exiting
07:38:54 (5708): No heartbeat from core client for 30 sec - exiting
07:38:55 (5708): No heartbeat from core client for 30 sec - exiting
07:38:57 (5708): No heartbeat from core client for 30 sec - exiting
07:38:58 (5708): No heartbeat from core client for 30 sec - exiting
07:38:59 (5708): No heartbeat from core client for 30 sec - exiting
07:39:00 (5708): No heartbeat from core client for 30 sec - exiting
07:39:01 (5708): No heartbeat from core client for 30 sec - exiting
07:39:02 (5708): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
07:48:17 (4248): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
07:48:18 (4248): No heartbeat from core client for 30 sec - exiting
07:48:19 (4248): No heartbeat from core client for 30 sec - exiting
07:48:20 (4248): No heartbeat from core client for 30 sec - exiting
07:48:21 (4248): No heartbeat from core client for 30 sec - exiting
07:48:22 (4248): No heartbeat from core client for 30 sec - exiting
07:48:23 (4248): No heartbeat from core client for 30 sec - exiting
07:48:24 (4248): No heartbeat from core client for 30 sec - exiting
07:48:26 (4248): No heartbeat from core client for 30 sec - exiting
07:48:27 (4248): No heartbeat from core client for 30 sec - exiting
07:48:28 (4248): No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 0, checkPID=0, selfPID=6104, iMonCtr=1
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=0, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5672, selfPID=5692, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Called boinc_finish

</stderr_txt>
<message>
upload failure: <file_xfer_error>
  <file_name>hadam3p_anz_rnu0_2012_1_008957467_0_3.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_anz_rnu0_2012_1_008957467_0_4.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_anz_rnu0_2012_1_008957467_0_5.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_anz_rnu0_2012_1_008957467_0_6.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_anz_rnu0_2012_1_008957467_0_7.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_anz_rnu0_2012_1_008957467_0_8.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_anz_rnu0_2012_1_008957467_0_9.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_anz_rnu0_2012_1_008957467_0_10.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_anz_rnu0_2012_1_008957467_0_11.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_anz_rnu0_2012_1_008957467_0_12.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>

</message>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
28 Sep 2014 08:54:22 1337772 16949630 hadam3p_anz_rnu0_2012_1_008957467_0 23,339 67,056 2.8731
13 Sep 2014 22:54:54 1337772 16949630 hadam3p_anz_rnu0_2012_1_008957467_0 11,819 34,717 2.9374


©2024 climateprediction.net