climateprediction.net home page
Task 12628033

Task 12628033

Name hadam3p_pnw_36gr_1959_1_007188531_0
Workunit 7386813
Created 22 Feb 2011, 13:35:32 UTC
Sent 22 Feb 2011, 17:10:33 UTC
Report deadline 4 Feb 2012, 22:30:33 UTC
Received 3 Jun 2011, 7:29:12 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 0 (0x00000000)
Computer ID 1106458
Run time 2 days 9 hours 26 min 35 sec
CPU time 2 days 7 hours 8 min 40 sec
Validate state Invalid
Credit 1,253.73
Device peak FLOPS 3.01 GFLOPS
Application version UK Met Office HadAM3P-HadRM3P Pacific North West v6.08
windows_intelx86
Stderr
<core_client_version>6.12.26</core_client_version>
<![CDATA[
<stderr_txt>
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4292, selfPID=4188, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4868, iMonCtr=2
ControllController:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3016, selfPID=3648, iMonCtr=1
Model crash detected, will try to restart...
19:49:42 (3256): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5432, selfPID=5432, iMonCtr=2
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
19:48:14 (5820): No heartbeat from core client for 30 sec - exiting
19:48:15 (5820): No heartbeat from core client for 30 sec - exiting
19:48:16 (5820): No heartbeat from core client for 30 sec - exiting
19:48:18 (5820): No heartbeat from core client for 30 sec - exiting
19:48:19 (5820): No heartbeat from core client for 30 sec - exiting
19:48:20 (5820): No heartbeat from core client for 30 sec - exiting
19:48:21 (5820): No heartbeat from core client for 30 sec - exiting
19:48:22 (5820): No heartbeat from core client for 30 sec - exiting
19:48:23 (5820): No heartbeat from core client for 30 sec - exiting
19:48:24 (5820): No heartbeat from core client for 30 sec - exiting
19:48:25 (5820): No heartbeat from core client for 30 sec - exiting
19:48:26 (5820): No heartbeat from core client for 30 sec - exiting
19:48:27 (5820): No heartbeat from core client for 30 sec - exiting
19:48:28 (5820): No heartbeat from core client for 30 sec - exiting
19:48:30 (5820): No heartbeat from core client for 30 sec - exiting
19:48:31 (5820): No heartbeat from core client for 30 sec - exiting
19:48:32 (5820): No heartbeat from core client for 30 sec - exiting
19:48:33 (5820): No heartbeat from core client for 30 sec - exiting
19:48:34 (5820): No heartbeat from core client for 30 sec - exiting
19:48:35 (5820): No heartbeat from core client for 30 sec - exiting
19:48:36 (5820): No heartbeat from core client for 30 sec - exiting
19:48:37 (5820): No heartbeat from core client for 30 sec - exiting
19:48:38 (5820): No heartbeat from core client for 30 sec - exiting
19:48:39 (5820): No heartbeat from core client for 30 sec - exiting
19:48:40 (5820): No heartbeat from core client for 30 sec - exiting
19:48:42 (5820): No heartbeat from core client for 30 sec - exiting
19:48:43 (5820): No heartbeat from core client for 30 sec - exiting
19:48:44 (5820): No heartbeat from core client for 30 sec - exiting
19:48:45 (5820): No heartbeat from core client for 30 sec - exiting
19:48:46 (5820): No heartbeat from core client for 30 sec - exiting
19:48:47 (5820): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1736, selfPID=608, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6116, iMonCtr=2
CSuspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1280, selfPID=4816, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
18:59:12 (1320): No heartbeat from core client for 30 sec - exiting
18:59:14 (1320): No heartbeat from core client for 30 sec - exiting
18:59:15 (1320): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
21:42:03 (136): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
22:18:58 (1424): No heartbeat from core client for 30 sec - exiting
22:19:02 (1424): No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
18:45:26 (4172): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5832, selfPID=2240, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5952, selfPID=5200, iMonCtr=1
Model crash detected, will try to restart...
20:05:53 (5444): No heartbeat from core client for 30 sec - exiting
20:05:54 (5444): No heartbeat from core client for 30 sec - exiting
20:05:55 (5444): No heartbeat from core client for 30 sec - exiting
20:05:56 (5444): No heartbeat from core client for 30 sec - exiting
20:05:57 (5444): No heartbeat from core client for 30 sec - exiting
20:05:58 (5444): No heartbeat from core client for 30 sec - exiting
20:05:59 (5444): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
20:06:00 (5444): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
00:18:27 (6348): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4868, selfPID=1196, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4964, selfPID=1192, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Regional yearly means requires 12 input files got 5
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=876, iMonCtr=2
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Regional yearly means requires 12 input files got 0
22:49:51 (876): called boinc_finish

</stderr_txt>
<message>
upload failure: <file_xfer_error>
  <file_name>hadam3p_pnw_36gr_1959_1_007188531_0_6.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_pnw_36gr_1959_1_007188531_0_7.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_pnw_36gr_1959_1_007188531_0_8.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_pnw_36gr_1959_1_007188531_0_9.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_pnw_36gr_1959_1_007188531_0_10.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_pnw_36gr_1959_1_007188531_0_11.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_pnw_36gr_1959_1_007188531_0_12.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>

</message>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
22 Apr 2011 12:27:48 1106458 12628033 hadam3p_pnw_36gr_1959_1_007188531_0 57,699 167,814 2.9084
22 Apr 2011 12:27:48 1106458 12628033 hadam3p_pnw_36gr_1959_1_007188531_0 57,696 167,434 2.9020
17 Mar 2011 20:36:19 1106458 12628033 hadam3p_pnw_36gr_1959_1_007188531_0 46,176 133,889 2.8995
12 Mar 2011 19:55:01 1106458 12628033 hadam3p_pnw_36gr_1959_1_007188531_0 34,658 99,971 2.8845
12 Mar 2011 15:44:03 1106458 12628033 hadam3p_pnw_36gr_1959_1_007188531_0 34,656 99,614 2.8744
11 Mar 2011 20:54:02 1106458 12628033 hadam3p_pnw_36gr_1959_1_007188531_0 23,136 65,755 2.8421
09 Mar 2011 21:48:44 1106458 12628033 hadam3p_pnw_36gr_1959_1_007188531_0 11,616 32,001 2.7549


©2024 climateprediction.net