climateprediction.net home page
Task 12289299

Task 12289299

Name hadam3p_saf_1tf0_1971_1_007004356_0
Workunit 7207672
Created 24 Nov 2010, 12:13:55 UTC
Sent 26 Jan 2011, 12:14:53 UTC
Report deadline 8 Jan 2012, 17:34:53 UTC
Received 15 Feb 2011, 22:56:44 UTC
Server state Over
Outcome Didn't need
Client state Compute error
Exit status 0 (0x00000000)
Computer ID 1070002
Run time 3 days 0 hours 36 min 20 sec
CPU time 2 days 16 hours 26 min 28 sec
Validate state Invalid
Credit 935.95
Device peak FLOPS 2.61 GFLOPS
Application version UK Met Office HadAM3P-HadRM3P Southern Africa v6.08
windows_intelx86
Stderr
<core_client_version>6.6.36</core_client_version>
<![CDATA[
<stderr_txt>
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2800, selfPID=2800, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2832, selfPID=3024, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3460, selfPID=3460, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4852, iMonCtr=2
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1384, selfPID=1384, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4460, iMonCtr=2
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4304, iMonCtr=2
Model crash detected, will try to restart...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2604, iMonCtr=2
CCPDN Monitor - Quit request from BOINC...
CController:: CPDN process is not running, exiting, bRetVal = 1, checkPID=936, selfPID=4256, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
GlobalController:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2872, selfPID=4536, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4948, iMonCtr=2
Model crash detected, will try to restart...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=384, iMonCtr=2
11:21:21 (2000): No heartbeat from core client for 30 sec - exiting
11:21:22 (2000): No heartbeat from core client for 30 sec - exiting
11:21:23 (2000): No heartbeat from core client for 30 sec - exiting
11:21:24 (2000): No heartbeat from core client for 30 sec - exiting
11:21:25 (2000): No heartbeat from core client for 30 sec - exiting
11:21:26 (2000): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4380, selfPID=204, iMonCtr=1
Model crash detected, will try to restart...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4860, selfPID=4860, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4860, selfPID=3360, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
23:55:33 (3360): called boinc_finish

</stderr_txt>
<message>
<file_xfer_error>
  <file_name>hadam3p_saf_1tf0_1971_1_007004356_0_6.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_saf_1tf0_1971_1_007004356_0_7.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_saf_1tf0_1971_1_007004356_0_8.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_saf_1tf0_1971_1_007004356_0_9.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_saf_1tf0_1971_1_007004356_0_10.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_saf_1tf0_1971_1_007004356_0_11.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_saf_1tf0_1971_1_007004356_0_12.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>

</message>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
12 Feb 2011 09:46:36 1070002 12289299 hadam3p_saf_1tf0_1971_1_007004356_0 57,696 208,476 3.6134
10 Feb 2011 20:35:35 1070002 12289299 hadam3p_saf_1tf0_1971_1_007004356_0 46,176 166,299 3.6014
07 Feb 2011 21:39:26 1070002 12289299 hadam3p_saf_1tf0_1971_1_007004356_0 34,656 124,436 3.5906
04 Feb 2011 19:04:58 1070002 12289299 hadam3p_saf_1tf0_1971_1_007004356_0 23,136 83,190 3.5957
31 Jan 2011 18:27:35 1070002 12289299 hadam3p_saf_1tf0_1971_1_007004356_0 11,616 42,075 3.6222


©2024 climateprediction.net