climateprediction.net home page
Task 14467914

Task 14467914

Name hadam3p_pnw_buqs_1963_1_007926147_0
Workunit 8081259
Created 18 Apr 2012, 13:37:22 UTC
Sent 3 May 2012, 13:39:26 UTC
Report deadline 15 Apr 2013, 18:59:26 UTC
Received 17 Jun 2012, 20:53:11 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 0 (0x00000000)
Computer ID 1075479
Run time 19 hours 50 min
CPU time 19 hours 19 min 56 sec
Validate state Invalid
Credit 502.72
Device peak FLOPS 2.92 GFLOPS
Application version UK Met Office HadAM3P-HadRM3P Pacific North West v6.09
windows_intelx86
Stderr
<core_client_version>6.12.33</core_client_version>
<![CDATA[
<stderr_txt>
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4612, selfPID=4116, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
12:39:05 (1112): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4844, iMonCtr=2
Model crash detected, will try to restart...
07:12:36 (3296): No heartbeat from core client for 30 sec - exiting
07:12:37 (3296): No heartbeat from core client for 30 sec - exiting
07:12:38 (3296): No heartbeat from core client for 30 sec - exiting
07:12:39 (3296): No heartbeat from core client for 30 sec - exiting
07:12:40 (3296): No heartbeat from core client for 30 sec - exiting
07:12:41 (3296): No heartbeat from core client for 30 sec - exiting
07:12:42 (3296): No heartbeat from core client for 30 sec - exiting
07:12:43 (3296): No heartbeat from core client for 30 sec - exiting
07:12:44 (3296): No heartbeat from core client for 30 sec - exiting
07:12:46 (3296): No heartbeat from core client for 30 sec - exiting
07:12:47 (3296): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
07:20:00 (4548): No heartbeat from core client for 30 sec - exiting
07:20:01 (4548): No heartbeat from core client for 30 sec - exiting
07:20:02 (4548): No heartbeat from core client for 30 sec - exiting
07:20:03 (4548): No heartbeat from core client for 30 sec - exiting
07:20:04 (4548): No heartbeat from core client for 30 sec - exiting
07:20:05 (4548): No heartbeat from core client for 30 sec - exiting
07:20:06 (4548): No heartbeat from core client for 30 sec - exiting
07:20:07 (4548): No heartbeat from core client for 30 sec - exiting
07:20:08 (4548): No heartbeat from core client for 30 sec - exiting
07:20:09 (4548): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4524, iMonCtr=2
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2320, selfPID=5320, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=424, selfPID=272, iMonCtr=1
Model crash detected, will try to restart...
11:10:43 (840): No heartbeat from core client for 30 sec - exiting
11:10:44 (840): No heartbeat from core client for 30 sec - exiting
11:10:45 (840): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...

SETPOS: Seek Failed: Invalid argument
SETPOS: Unit 63 to Word Address -198 Failed with Error Code -1

Model crashed: SETPOS: Unit 63 to Word Address -198 Failed with Error Code -1
Leaving CPDN_Main::Monitor...
Regional yearly means requires 12 input files got 2
Called boinc_finish

</stderr_txt>
<message>
upload failure: <file_xfer_error>
  <file_name>hadam3p_pnw_buqs_1963_1_007926147_0_3.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_pnw_buqs_1963_1_007926147_0_4.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_pnw_buqs_1963_1_007926147_0_5.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_pnw_buqs_1963_1_007926147_0_6.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_pnw_buqs_1963_1_007926147_0_7.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_pnw_buqs_1963_1_007926147_0_8.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_pnw_buqs_1963_1_007926147_0_9.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_pnw_buqs_1963_1_007926147_0_10.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_pnw_buqs_1963_1_007926147_0_11.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_pnw_buqs_1963_1_007926147_0_12.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>

</message>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
14 Jun 2012 14:12:05 1075479 14467914 hadam3p_pnw_buqs_1963_1_007926147_0 23,136 52,669 2.2765
12 Jun 2012 13:26:24 1075479 14467914 hadam3p_pnw_buqs_1963_1_007926147_0 11,620 27,140 2.3356
11 Jun 2012 21:55:18 1075479 14467914 hadam3p_pnw_buqs_1963_1_007926147_0 11,616 26,866 2.3128


©2024 climateprediction.net