climateprediction.net home page
Task 14215886

Task 14215886

Name hadam3p_eu_a3uj_1972_1_007787611_1
Workunit 7942720
Created 3 Mar 2012, 15:33:12 UTC
Sent 3 Mar 2012, 15:44:43 UTC
Report deadline 13 Feb 2013, 21:04:43 UTC
Received 1 Aug 2012, 4:14:53 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 0 (0x00000000)
Computer ID 1012549
Run time 4 days 12 hours 36 min 34 sec
CPU time 3 days 17 hours 31 min 2 sec
Validate state Invalid
Credit 1,194.02
Device peak FLOPS 1.47 GFLOPS
Application version UK Met Office HadAM3P-HadRM3P Europe v6.09
windows_intelx86
Stderr
<core_client_version>6.12.34</core_client_version>
<![CDATA[
<stderr_txt>
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4132, selfPID=5420, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5120, iMonCtr=2
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4428, selfPID=4428, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2832, selfPID=2832, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4244, iMonCtr=2
Model crash detected, will try to restart...
C20:36:01 (4444): No heartbeat from core client for 30 sec - exiting
20:36:02 (4444): No heartbeat from core client for 30 sec - exiting
20:36:03 (4444): No heartbeat from core client for 30 sec - exiting
20:36:04 (4444): No heartbeat from core client for 30 sec - exiting
20:36:05 (4444): No heartbeat from core client for 30 sec - exiting
20:36:06 (4444): No heartbeat from core client for 30 sec - exiting
20:36:07 (4444): No heartbeat from core client for 30 sec - exiting
20:36:08 (4444): No heartbeat from core client for 30 sec - exiting
20:36:09 (4444): No heartbeat from core client for 30 sec - exiting
20:36:10 (4444): No heartbeat from core client for 30 sec - exiting
20:36:11 (4444): No heartbeat from core client for 30 sec - exiting
20:36:12 (4444): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
20:36:13 (4444): No heartbeat from core client for 30 sec - exiting
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6060, selfPID=4292, iMonCtr=1
Model crash detected, will try to restart...
GController:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5932, selfPID=4556, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3996, selfPID=4476, iMonCtr=1
Model crash detected, will try to restart...
GGController:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5332, selfPID=2392, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
GController:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4864, selfPID=1032, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1148, selfPID=1148, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1148, selfPID=5128, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Called boinc_finish

</stderr_txt>
<message>
upload failure: <file_xfer_error>
  <file_name>hadam3p_eu_a3uj_1972_1_007787611_1_7.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_eu_a3uj_1972_1_007787611_1_8.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_eu_a3uj_1972_1_007787611_1_9.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_eu_a3uj_1972_1_007787611_1_10.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_eu_a3uj_1972_1_007787611_1_11.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_eu_a3uj_1972_1_007787611_1_12.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>

</message>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
26 Jul 2012 11:05:55 1012549 14215886 hadam3p_eu_a3uj_1972_1_007787611_1 69,216 289,442 4.1817
09 Jun 2012 08:11:11 1012549 14215886 hadam3p_eu_a3uj_1972_1_007787611_1 57,698 239,712 4.1546
08 Jun 2012 06:51:57 1012549 14215886 hadam3p_eu_a3uj_1972_1_007787611_1 57,696 239,101 4.1442
22 May 2012 07:11:43 1012549 14215886 hadam3p_eu_a3uj_1972_1_007787611_1 46,176 190,767 4.1313
16 May 2012 10:24:15 1012549 14215886 hadam3p_eu_a3uj_1972_1_007787611_1 34,656 143,006 4.1264
14 May 2012 02:45:17 1012549 14215886 hadam3p_eu_a3uj_1972_1_007787611_1 23,136 95,483 4.1270
22 Apr 2012 11:59:33 1012549 14215886 hadam3p_eu_a3uj_1972_1_007787611_1 11,616 47,296 4.0716


©2024 climateprediction.net