climateprediction.net home page
Task 17943044

Task 17943044

Name hadam3p_anz_n41y_2013_1_009521511_0
Workunit 9603254
Created 11 Feb 2015, 19:35:15 UTC
Sent 17 Feb 2015, 2:59:19 UTC
Report deadline 30 Jan 2016, 8:19:19 UTC
Received 28 Feb 2015, 16:07:08 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 0 (0x00000000)
Computer ID 1295575
Run time 3 days 19 hours 8 min 29 sec
CPU time 3 days 18 hours 24 min 1 sec
Validate state Invalid
Credit 4,484.28
Device peak FLOPS 3.60 GFLOPS
Application version UK Met Office HadAM3P-HadRM3P Australia New Zealand v6.10
windows_intelx86
Stderr
<core_client_version>7.2.42</core_client_version>
<![CDATA[
<stderr_txt>
14:52:08 (5124): No heartbeat from core client for 30 sec - exiting
14:52:09 (5124): No heartbeat from core client for 30 sec - exiting
14:52:10 (5124): No heartbeat from core client for 30 sec - exiting
14:52:11 (5124): No heartbeat from core client for 30 sec - exiting
14:52:12 (5124): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
14:52:13 (5124): No heartbeat from core client for 30 sec - exiting
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6400, selfPID=4748, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=7064, selfPID=4768, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5960, selfPID=4748, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5948, selfPID=4768, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6852, selfPID=4744, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2808, selfPID=4720, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4764, selfPID=4976, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5888, selfPID=4724, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6512, selfPID=4904, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6960, selfPID=5412, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6776, selfPID=5156, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6176, selfPID=4800, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6788, selfPID=4964, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=7272, selfPID=5076, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=7324, selfPID=7232, iMonCtr=1
Model crash detected, will try to restart...
Global Worker:: CPDN process is not running, exiting, bRetVal = 0, checkPID=0, selfPID=7116, iMonCtr=1
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=0, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=7256, selfPID=4948, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Called boinc_finish

</stderr_txt>
<message>
upload failure: <file_xfer_error>
  <file_name>hadam3p_anz_n41y_2013_1_009521511_0_10.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_anz_n41y_2013_1_009521511_0_11.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_anz_n41y_2013_1_009521511_0_12.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>

</message>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
28 Feb 2015 04:02:45 1295575 17943044 hadam3p_anz_n41y_2013_1_009521511_0 103,979 323,935 3.1154
27 Feb 2015 08:04:44 1295575 17943044 hadam3p_anz_n41y_2013_1_009521511_0 92,459 287,848 3.1133
25 Feb 2015 17:04:00 1295575 17943044 hadam3p_anz_n41y_2013_1_009521511_0 80,939 252,405 3.1185
23 Feb 2015 16:07:32 1295575 17943044 hadam3p_anz_n41y_2013_1_009521511_0 69,419 216,765 3.1226
22 Feb 2015 04:07:44 1295575 17943044 hadam3p_anz_n41y_2013_1_009521511_0 57,899 180,966 3.1255
21 Feb 2015 18:03:58 1295575 17943044 hadam3p_anz_n41y_2013_1_009521511_0 46,379 144,886 3.1240
20 Feb 2015 19:40:16 1295575 17943044 hadam3p_anz_n41y_2013_1_009521511_0 34,859 109,684 3.1465
18 Feb 2015 17:49:31 1295575 17943044 hadam3p_anz_n41y_2013_1_009521511_0 23,339 74,328 3.1847
17 Feb 2015 19:46:38 1295575 17943044 hadam3p_anz_n41y_2013_1_009521511_0 11,819 37,730 3.1923


©2024 climateprediction.net