climateprediction.net home page
Task 14492092

Task 14492092

Name hadam3p_pnw_ccjt_1960_1_007949224_0
Workunit 8104336
Created 18 Apr 2012, 20:58:47 UTC
Sent 20 Apr 2012, 5:58:22 UTC
Report deadline 2 Apr 2013, 11:18:22 UTC
Received 26 Jun 2012, 11:20:35 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 0 (0x00000000)
Computer ID 1205639
Run time 2 days 19 hours 1 min 45 sec
CPU time 6 min 30 sec
Validate state Invalid
Credit 753.03
Device peak FLOPS 1.92 GFLOPS
Application version UK Met Office HadAM3P-HadRM3P Pacific North West v6.09
windows_intelx86
Stderr
<core_client_version>6.12.34</core_client_version>
<![CDATA[
<stderr_txt>
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
12:19:42 (3456): No heartbeat from core client for 30 sec - exiting
12:19:43 (3456): No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2792, selfPID=3100, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Regional yearly means requires 12 input files got 3
12:23:19 (3100): No heartbeat from core client for 30 sec - exiting
12:23:21 (3100): No heartbeat from core client for 30 sec - exiting
12:23:22 (3100): No heartbeat from core client for 30 sec - exiting
12:23:23 (3100): No heartbeat from core client for 30 sec - exiting
12:23:24 (3100): No heartbeat from core client for 30 sec - exiting
12:23:25 (3100): No heartbeat from core client for 30 sec - exiting
12:23:26 (3100): No heartbeat from core client for 30 sec - exiting
12:23:27 (3100): No heartbeat from core client for 30 sec - exiting
12:23:28 (3100): No heartbeat from core client for 30 sec - exiting
12:23:29 (3100): No heartbeat from core client for 30 sec - exiting
12:23:30 (3100): No heartbeat from core client for 30 sec - exiting
12:23:32 (3100): No heartbeat from core client for 30 sec - exiting
12:23:33 (3100): No heartbeat from core client for 30 sec - exiting
Called boinc_finish
12:23:42 (3100): No heartbeat from core client for 30 sec - exiting
12:23:43 (3100): No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - Suspend request from BOINC...
13:18:45 (1200): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
13:18:46 (1200): No heartbeat from core client for 30 sec - exiting
13:18:47 (1200): No heartbeat from core client for 30 sec - exiting
13:18:48 (1200): No heartbeat from core client for 30 sec - exiting
13:18:49 (1200): No heartbeat from core client for 30 sec - exiting
13:18:50 (1200): No heartbeat from core client for 30 sec - exiting
13:18:52 (1200): No heartbeat from core client for 30 sec - exiting
13:18:53 (1200): No heartbeat from core client for 30 sec - exiting
13:18:54 (1200): No heartbeat from core client for 30 sec - exiting
13:18:55 (1200): No heartbeat from core client for 30 sec - exiting
13:18:56 (1200): No heartbeat from core client for 30 sec - exiting

</stderr_txt>
<message>
upload failure: <file_xfer_error>
  <file_name>hadam3p_pnw_ccjt_1960_1_007949224_0_4.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_pnw_ccjt_1960_1_007949224_0_5.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_pnw_ccjt_1960_1_007949224_0_6.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_pnw_ccjt_1960_1_007949224_0_7.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_pnw_ccjt_1960_1_007949224_0_8.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_pnw_ccjt_1960_1_007949224_0_9.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_pnw_ccjt_1960_1_007949224_0_10.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_pnw_ccjt_1960_1_007949224_0_11.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_pnw_ccjt_1960_1_007949224_0_12.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>

</message>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
23 Apr 2012 07:10:14 1205639 14492092 hadam3p_pnw_ccjt_1960_1_007949224_0 34,656 126,672 3.6551
22 Apr 2012 09:41:49 1205639 14492092 hadam3p_pnw_ccjt_1960_1_007949224_0 23,136 84,478 3.6514
21 Apr 2012 12:15:19 1205639 14492092 hadam3p_pnw_ccjt_1960_1_007949224_0 11,616 42,571 3.6649


©2024 climateprediction.net