climateprediction.net home page
Task 18395395

Task 18395395

Name hadam3p_anz_r5ez_2012_1_008736633_1
Workunit 8882611
Created 29 Apr 2015, 10:25:34 UTC
Sent 30 Apr 2015, 21:36:30 UTC
Report deadline 12 Apr 2016, 2:56:30 UTC
Received 25 May 2015, 22:26:25 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 0 (0x00000000)
Computer ID 1356942
Run time 3 days 3 hours 33 min 22 sec
CPU time 2 days 15 hours 0 min 50 sec
Validate state Invalid
Credit 2,993.82
Device peak FLOPS 3.06 GFLOPS
Application version UK Met Office HadAM3P-HadRM3P Australia New Zealand v6.10
windows_intelx86
Stderr
<core_client_version>7.4.36</core_client_version>
<![CDATA[
<stderr_txt>
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=8356, selfPID=7008, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
23:03:28 (12124): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=11072, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6924, selfPID=6924, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1612, selfPID=1612, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=8980, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1252, iMonCtr=2
Model crash detected, will try to restart...
16:49:11 (6300): No heartbeat from core client for 30 sec - exiting
16:49:12 (6300): No heartbeat from core client for 30 sec - exiting
16:49:14 (6300): No heartbeat from core client for 30 sec - exiting
16:49:15 (6300): No heartbeat from core client for 30 sec - exiting
16:49:16 (6300): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
17:15:54 (7404): No heartbeat from core client for 30 sec - exiting
17:15:55 (7404): No heartbeat from core client for 30 sec - exiting
17:15:56 (7404): No heartbeat from core client for 30 sec - exiting
17:15:57 (7404): No heartbeat from core client for 30 sec - exiting
17:15:59 (7404): No heartbeat from core client for 30 sec - exiting
17:16:00 (7404): No heartbeat from core client for 30 sec - exiting
17:16:01 (7404): No heartbeat from core client for 30 sec - exiting
17:16:02 (7404): No heartbeat from core client for 30 sec - exiting
17:16:03 (7404): No heartbeat from core client for 30 sec - exiting
17:16:04 (7404): No heartbeat from core client for 30 sec - exiting
17:16:05 (7404): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
17:16:06 (7404): No heartbeat from core client for 30 sec - exiting
17:19:19 (424): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=7496, selfPID=7496, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
18:21:17 (7288): No heartbeat from core client for 30 sec - exiting
18:21:18 (7288): No heartbeat from core client for 30 sec - exiting
18:21:19 (7288): No heartbeat from core client for 30 sec - exiting
18:21:20 (7288): No heartbeat from core client for 30 sec - exiting
18:21:21 (7288): No heartbeat from core client for 30 sec - exiting
18:21:22 (7288): No heartbeat from core client for 30 sec - exiting
18:21:23 (7288): No heartbeat from core client for 30 sec - exiting
18:21:24 (7288): No heartbeat from core client for 30 sec - exiting
18:21:25 (7288): No heartbeat from core client for 30 sec - exiting
18:21:26 (7288): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
18:21:28 (7288): No heartbeat from core client for 30 sec - exiting
Global Worker:: CPDN process is not running, exiting, bRetVal =18:19:28 (6368): No heartbeat from core client for 30 sec - exiting
18:19:30 (6368): No heartbeat from core client for 30 sec - exiting
18:19:31 (6368): No heartbeat from core client for 30 sec - exiting
18:19:32 (6368): No heartbeat from core client for 30 sec - exiting
18:19:33 (6368): No heartbeat from core client for 30 sec - exiting
18:19:34 (6368): No heartbeat from core client for 30 sec - exiting
18:19:35 (6368): No heartbeat from core client for 30 sec - exiting
18:19:36 (6368): No heartbeat from core client for 30 sec - exiting
18:19:37 (6368): No heartbeat from core client for 30 sec - exiting
18:19:38 (6368): No heartbeat from core client for 30 sec - exiting
18:19:39 (6368): No heartbeat from core client for 30 sec - exiting
18:19:40 (6368): No heartbeat from core client for 30 sec - exiting
18:19:42 (6368): No heartbeat from core client for 30 sec - exiting
18:19:43 (6368): No heartbeat from core client for 30 sec - exiting
18:19:44 (6368): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1440, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6432, iMonCtr=2
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
CPDN Monitor - Quit request from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 0, checkPID=0, selfPID=0, iMonCtr=1
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=0, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=8704, selfPID=7856, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Called boinc_finish

</stderr_txt>
<message>
upload failure: <file_xfer_error>
  <file_name>hadam3p_anz_r5ez_2012_1_008736633_1_7.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_anz_r5ez_2012_1_008736633_1_8.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_anz_r5ez_2012_1_008736633_1_9.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_anz_r5ez_2012_1_008736633_1_10.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_anz_r5ez_2012_1_008736633_1_11.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_anz_r5ez_2012_1_008736633_1_12.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>

</message>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
23 May 2015 18:45:02 1356942 18395395 hadam3p_anz_r5ez_2012_1_008736633_1 69,419 222,886 3.2107
22 May 2015 08:36:07 1356942 18395395 hadam3p_anz_r5ez_2012_1_008736633_1 57,899 185,849 3.2099
19 May 2015 17:46:04 1356942 18395395 hadam3p_anz_r5ez_2012_1_008736633_1 46,379 147,813 3.1871
10 May 2015 19:50:57 1356942 18395395 hadam3p_anz_r5ez_2012_1_008736633_1 34,859 110,769 3.1776
08 May 2015 19:21:43 1356942 18395395 hadam3p_anz_r5ez_2012_1_008736633_1 23,339 74,491 3.1917
08 May 2015 19:12:51 1356942 18395395 hadam3p_anz_r5ez_2012_1_008736633_1 11,819 37,336 3.1590


©2024 climateprediction.net