climateprediction.net home page
Task 16463520

Task 16463520

Name hadam3p_anz_p07u_2012_1_008630953_0
Workunit 8777465
Created 3 Apr 2014, 9:27:08 UTC
Sent 12 Apr 2014, 12:16:28 UTC
Report deadline 25 Mar 2015, 17:36:28 UTC
Received 25 Apr 2014, 21:27:07 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 0 (0x00000000)
Computer ID 1302489
Run time 8 days 0 hours 39 min 52 sec
CPU time 7 days 15 hours 1 min 28 sec
Validate state Invalid
Credit 3,983.32
Device peak FLOPS 2.33 GFLOPS
Application version UK Met Office HadAM3P-HadRM3P Australia New Zealand v6.10
windows_intelx86
Stderr
<core_client_version>7.2.42</core_client_version>
<![CDATA[
<stderr_txt>
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3604, selfPID=6584, iMonCtr=1
Model crash detected, will try to restart...
05:03:10 (5772): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
05:04:03 (4260): No heartbeat from core client for 30 sec - exiting
05:04:04 (4260): No heartbeat from core client for 30 sec - exiting
05:04:05 (4260): No heartbeat from core client for 30 sec - exiting
05:04:06 (4260): No heartbeat from core client for 30 sec - exiting
05:04:07 (4260): No heartbeat from core client for 30 sec - exiting
05:04:08 (4260): No heartbeat from core client for 30 sec - exiting
05:04:09 (4260): No heartbeat from core client for 30 sec - exiting
05:04:10 (4260): No heartbeat from core client for 30 sec - exiting
05:04:11 (4260): No heartbeat from core client for 30 sec - exiting
05:04:12 (4260): No heartbeat from core client for 30 sec - exiting
05:04:13 (4260): No heartbeat from core client for 30 sec - exiting
05:04:14 (4260): No heartbeat from core client for 30 sec - exiting
05:04:15 (4260): No heartbeat from core client for 30 sec - exiting
05:04:16 (4260): No heartbeat from core client for 30 sec - exiting
05:04:17 (4260): No heartbeat from core client for 30 sec - exiting
05:04:18 (4260): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
23:43:02 (11012): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
23:43:03 (11012): No heartbeat from core client for 30 sec - exiting
23:43:04 (11012): No heartbeat from core client for 30 sec - exiting
23:43:05 (11012): No heartbeat from core client for 30 sec - exiting
23:43:06 (11012): No heartbeat from core client for 30 sec - exiting
23:43:07 (11012): No heartbeat from core client for 30 sec - exiting
23:43:08 (11012): No heartbeat from core client for 30 sec - exiting
23:43:09 (11012): No heartbeat from core client for 30 sec - exiting
23:43:10 (11012): No heartbeat from core client for 30 sec - exiting
23:43:11 (11012): No heartbeat from core client for 30 sec - exiting
23:43:12 (11012): No heartbeat from core client for 30 sec - exiting
23:43:13 (11012): No heartbeat from core client for 30 sec - exiting
23:43:14 (11012): No heartbeat from core client for 30 sec - exiting
23:43:15 (11012): No heartbeat from core client for 30 sec - exiting
23:43:16 (11012): No heartbeat from core client for 30 sec - exiting
23:43:17 (11012): No heartbeat from core client for 30 sec - exiting
23:43:18 (11012): No heartbeat from core client for 30 sec - exiting
23:43:19 (11012): No heartbeat from core client for 30 sec - exiting
23:43:20 (11012): No heartbeat from core client for 30 sec - exiting
23:43:21 (11012): No heartbeat from core client for 30 sec - exiting
23:43:22 (11012): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=19352, selfPID=19352, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
05:43:58 (11872): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4280, selfPID=4280, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=17432, selfPID=17432, iMonCtr=2
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=17892, selfPID=17892, iMonCtr=2
00:02:34 (10660): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 0, checkPID=0, selfPID=4632, iMonCtr=1
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4040, selfPID=4040, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4040, selfPID=5052, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Called boinc_finish

</stderr_txt>
<message>
upload failure: <file_xfer_error>
  <file_name>hadam3p_anz_p07u_2012_1_008630953_0_9.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_anz_p07u_2012_1_008630953_0_10.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_anz_p07u_2012_1_008630953_0_11.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_anz_p07u_2012_1_008630953_0_12.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>

</message>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
24 Apr 2014 08:44:53 1302489 16463520 hadam3p_anz_p07u_2012_1_008630953_0 92,363 600,436 6.5008
23 Apr 2014 07:30:31 1302489 16463520 hadam3p_anz_p07u_2012_1_008630953_0 80,843 524,602 6.4891
20 Apr 2014 08:08:01 1302489 16463520 hadam3p_anz_p07u_2012_1_008630953_0 69,323 445,216 6.4223
19 Apr 2014 01:33:13 1302489 16463520 hadam3p_anz_p07u_2012_1_008630953_0 57,803 367,749 6.3621
17 Apr 2014 23:25:18 1302489 16463520 hadam3p_anz_p07u_2012_1_008630953_0 46,283 294,710 6.3676
16 Apr 2014 22:52:23 1302489 16463520 hadam3p_anz_p07u_2012_1_008630953_0 34,763 218,756 6.2928
15 Apr 2014 23:49:01 1302489 16463520 hadam3p_anz_p07u_2012_1_008630953_0 23,243 143,231 6.1623
15 Apr 2014 01:12:40 1302489 16463520 hadam3p_anz_p07u_2012_1_008630953_0 11,723 71,103 6.0653


©2024 climateprediction.net