climateprediction.net home page
Task 14001877

Task 14001877

Name hadam3p_eu_97c6_1965_1_007729566_0
Workunit 7884674
Created 26 Jan 2012, 14:30:10 UTC
Sent 9 Feb 2012, 7:53:25 UTC
Report deadline 21 Jan 2013, 13:13:25 UTC
Received 21 Feb 2012, 5:01:03 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 0 (0x00000000)
Computer ID 1140498
Run time 14 hours 42 min 18 sec
CPU time 13 hours 4 min 25 sec
Validate state Invalid
Credit 200.38
Device peak FLOPS 2.70 GFLOPS
Application version UK Met Office HadAM3P-HadRM3P Europe v6.09
windows_intelx86
Stderr
<core_client_version>6.10.58</core_client_version>
<![CDATA[
<stderr_txt>
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
22:00:43 (101596): No heartbeat from core client for 30 sec - exiting
22:00:44 (101596): No heartbeat from core client for 30 sec - exiting
22:00:45 (101596): No heartbeat from core client for 30 sec - exiting
22:00:46 (101596): No heartbeat from core client for 30 sec - exiting
22:00:47 (101596): No heartbeat from core client for 30 sec - exiting
22:00:48 (101596): No heartbeat from core client for 30 sec - exiting
22:00:49 (101596): No heartbeat from core client for 30 sec - exiting
22:00:50 (101596): No heartbeat from core client for 30 sec - exiting
22:00:51 (101596): No heartbeat from core client for 30 sec - exiting
22:00:52 (101596): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=105588, selfPID=105588, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=105156, selfPID=105156, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
04:09:12 (100008): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVaCPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=104936, selfPID=104936, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
05:46:41 (109416): Can't acquire lockfile (32) - waiting 35s
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=110156,06:09:20 (110208): No heartbeat from core client for 30 sec - exiting
06:09:21 (110208): No heartbeat from core client for 30 sec - exiting
06:09:22 (110208): No heartbeat from core client for 30 sec - exiting
06:09:23 (110208): No heartbeat from core client for 30 sec - exiting
06:09:24 (110208): No heartbeat from core client for 30 sec - exiting
06:09:25 (110208): No heartbeat from core client for 30 sec - exiting
06:09:26 (110208): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=108876, selfPID=108876, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=107516, selfPID=107516, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 0, checkPID=0, selfPID=0, iMonCtr=0
CPDN Monitor - Quit request from BOINC...
07:20:15 (96188): No heartbeat from core client for 30 sec - exiting
07:20:26 (96188): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
07:49:19 (109228): Can't acquire lockfile (32) - waiting 35s
CPDN Monitor - Quit request from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 0, checkPID=0, selfPID=21436, iMonCtr=1
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=22244, selfPID=24156, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Called boinc_finish

</stderr_txt>
<message>
<file_xfer_error>
  <file_name>hadam3p_eu_97c6_1965_1_007729566_0_2.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_eu_97c6_1965_1_007729566_0_3.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_eu_97c6_1965_1_007729566_0_4.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_eu_97c6_1965_1_007729566_0_5.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_eu_97c6_1965_1_007729566_0_6.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_eu_97c6_1965_1_007729566_0_7.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_eu_97c6_1965_1_007729566_0_8.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_eu_97c6_1965_1_007729566_0_9.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_eu_97c6_1965_1_007729566_0_10.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_eu_97c6_1965_1_007729566_0_11.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_eu_97c6_1965_1_007729566_0_12.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>

</message>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
16 Feb 2012 13:13:50 1140498 14001877 hadam3p_eu_97c6_1965_1_007729566_0 11,616 29,771 2.5629


©2024 climateprediction.net