climateprediction.net home page
Task 16950386

Task 16950386

Name hadam3p_anz_rof0_2012_1_008958223_0
Workunit 9102398
Created 27 Aug 2014, 12:20:57 UTC
Sent 31 Aug 2014, 12:29:42 UTC
Report deadline 13 Aug 2015, 17:49:42 UTC
Received 19 Sep 2014, 12:34:59 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 0 (0x00000000)
Computer ID 1333820
Run time 3 days 6 hours 16 min 40 sec
CPU time 2 days 20 hours 5 min 17 sec
Validate state Invalid
Credit 1,503.36
Device peak FLOPS 2.48 GFLOPS
Application version UK Met Office HadAM3P-HadRM3P Australia New Zealand v6.10
windows_intelx86
Stderr
<core_client_version>7.2.42</core_client_version>
<![CDATA[
<stderr_txt>
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=8104, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7344, iMonCtr=2
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
11:45:43 (5772): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7448, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5576, selfPID=6632, iMonCtr=1
Model crash detected, will try to restart...
17:44:18 (4160): No heartbeat from core client for 30 sec - exiting
17:44:19 (4160): No heartbeat from core client for 30 sec - exiting
17:44:21 (4160): No heartbeat from core client for 30 sec - exiting
17:44:22 (4160): No heartbeat from core client for 30 sec - exiting
17:44:23 (4160): No heartbeat from core client for 30 sec - exiting
17:44:24 (4160): No heartbeat from core client for 30 sec - exiting
17:44:25 (4160): No heartbeat from core client for 30 sec - exiting
17:44:26 (4160): No heartbeat from core client for 30 sec - exiting
17:44:27 (4160): No heartbeat from core client for 30 sec - exiting
17:44:28 (4160): No heartbeat from core client for 30 sec - exiting
17:44:29 (4160): No heartbeat from core client for 30 sec - exiting
17:44:30 (4160): No heartbeat from core client for 30 sec - exiting
17:44:31 (4160): No heartbeat from core client for 30 sec - exiting
17:44:32 (4160): No heartbeat from core client for 30 sec - exiting
17:44:33 (4160): No heartbeat from core client for 30 sec - exiting
17:44:34 (4160): No heartbeat from core client for 30 sec - exiting
17:44:35 (4160): No heartbeat from core client for 30 sec - exiting
17:44:36 (4160): No heartbeat from core client for 30 sec - exiting
17:44:37 (4160): No heartbeat from core client for 30 sec - exiting
17:44:38 (4160): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=8140, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2252, selfPID=2252, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7592, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=688, selfPID=4136, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
23:35:53 (6656): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=7808, selfPID=7808, iMonCtr=2
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1364, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5164, selfPID=6512, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
GController:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6612, selfPID=3616, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
20:42:14 (6044): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=7668, selfPID=7668, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=8168, selfPID=5856, iMonCtr=1
Model crash detected, will try to restart...
16:44:30 (5192): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1108, selfPID=1108, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
07:52:39 (2676): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=8036, selfPID=8036, iMonCtr=2
11:11:34 (2804): No heartbeat from core client for 30 sec - exiting
11:11:35 (2804): No heartbeat from core client for 30 sec - exiting
11:11:36 (2804): No heartbeat from core client for 30 sec - exiting
11:11:37 (2804): No heartbeat from core client for 30 sec - exiting
11:11:38 (2804): No heartbeat from core client for 30 sec - exiting
11:11:39 (2804): No heartbeat from core client for 30 sec - exiting
11:11:40 (2804): No heartbeat from core client for 30 sec - exiting
11:11:41 (2804): No heartbeat from core client for 30 sec - exiting
11:11:42 (2804): No heartbeat from core client for 30 sec - exiting
11:11:43 (2804): No heartbeat from core client for 30 sec - exiting
11:11:44 (2804): No heartbeat from core client for 30 sec - exiting
11:11:45 (2804): No heartbeat from core client for 30 sec - exiting
11:11:46 (2804): No heartbeat from core client for 30 sec - exiting
11:11:47 (2804): No heartbeat from core client for 30 sec - exiting
11:11:48 (2804): No heartbeat from core client for 30 sec - exiting
11:11:49 (2804): No heartbeat from core client for 30 sec - exiting
11:11:50 (2804): No heartbeat from core client for 30 sec - exiting
11:11:51 (2804): No heartbeat from core client for 30 sec - exiting
11:11:52 (2804): No heartbeat from core client for 30 sec - exiting
11:11:53 (2804): No heartbeat from core client for 30 sec - exiting
11:11:54 (2804): No heartbeat from core client for 30 sec - exiting
11:11:55 (2804): No heartbeat from core client for 30 sec - exiting
11:11:56 (2804): No heartbeat from core client for 30 sec - exiting
11:11:57 (2804): No heartbeat from core client for 30 sec - exiting
11:11:58 (2804): No heartbeat from core client for 30 sec - exiting
11:11:59 (2804): No heartbeat from core client for 30 sec - exiting
11:12:00 (2804): No heartbeat from core client for 30 sec - exiting
11:12:01 (2804): No heartbeat from core client for 30 sec - exiting
11:12:02 (2804): No heartbeat from core client for 30 sec - exiting
11:12:03 (2804): No heartbeat from core client for 30 sec - exiting
11:12:04 (2804): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
15:25:48 (6012): No heartbeat from core client for 30 sec - exiting
15:25:49 (6012): No heartbeat from core client for 30 sec - exiting
15:25:50 (6012): No heartbeat from core client for 30 sec - exiting
15:25:51 (6012): No heartbeat from core client for 30 sec - exiting
15:25:53 (6012): No heartbeat from core client for 30 sec - exiting
15:25:54 (6012): No heartbeat from core client for 30 sec - exiting
15:25:55 (6012): No heartbeat from core client for 30 sec - exiting
15:25:56 (6012): No heartbeat from core client for 30 sec - exiting
15:25:57 (6012): No heartbeat from core client for 30 sec - exiting
15:25:58 (6012): No heartbeat from core client for 30 sec - exiting
15:25:59 (6012): No heartbeat from core client for 30 sec - exiting
15:26:00 (6012): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
20:12:24 (7360): No heartbeat from core client for 30 sec - exiting
20:12:25 (7360): No heartbeat from core client for 30 sec - exiting
20:12:26 (7360): No heartbeat from core client for 30 sec - exiting
20:12:27 (7360): No heartbeat from core client for 30 sec - exiting
20:12:28 (7360): No heartbeat from core client for 30 sec - exiting
20:12:29 (7360): No heartbeat from core client for 30 sec - exiting
20:12:30 (7360): No heartbeat from core client for 30 sec - exiting
20:12:31 (7360): No heartbeat from core client for 30 sec - exiting
20:12:32 (7360): No heartbeat from core client for 30 sec - exiting
20:12:33 (7360): No heartbeat from core client for 30 sec - exiting
20:12:34 (7360): No heartbeat from core client for 30 sec - exiting
20:12:35 (7360): No heartbeat from core client for 30 sec - exiting
20:12:36 (7360): No heartbeat from core client for 30 sec - exiting
20:12:37 (7360): No heartbeat from core client for 30 sec - exiting
20:12:38 (7360): No heartbeat from core client for 30 sec - exiting
20:12:39 (7360): No heartbeat from core client for 30 sec - exiting
20:12:40 (7360): No heartbeat from core client for 30 sec - exiting
20:12:41 (7360): No heartbeat from core client for 30 sec - exiting
20:12:42 (7360): No heartbeat from core client for 30 sec - exiting
20:12:43 (7360): No heartbeat from core client for 30 sec - exiting
20:12:44 (7360): No heartbeat from core client for 30 sec - exiting
20:12:45 (7360): No heartbeat from core client for 30 sec - exiting
20:12:46 (7360): No heartbeat from core client for 30 sec - exiting
20:12:47 (7360): No heartbeat from core client for 30 sec - exiting
20:12:48 (7360): No heartbeat from core client for 30 sec - exiting
20:12:49 (7360): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
00:11:09 (5508): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
10:01:42 (2580): No heartbeat from core client for 30 sec - exiting
10:01:43 (2580): No heartbeat from core client for 30 sec - exiting
10:01:44 (2580): No heartbeat from core client for 30 sec - exiting
10:01:45 (2580): No heartbeat from core client for 30 sec - exiting
10:01:46 (2580): No heartbeat from core client for 30 sec - exiting
10:01:47 (2580): No heartbeat from core client for 30 sec - exiting
10:01:48 (2580): No heartbeat from core client for 30 sec - exiting
10:01:49 (2580): No heartbeat from core client for 30 sec - exiting
10:01:50 (2580): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
14:48:16 (6460): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5004, selfPID=5004, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=7108, selfPID=4616, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=7696, selfPID=3976, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=556, selfPID=5352, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=8168, iMonCtr=2
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=8884, selfPID=6440, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=7020, selfPID=4776, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
CPDN Monitor - Quit request from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 0, checkPID=0, selfPID=840, iMonCtr=1
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=0, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=7424, selfPID=6300, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Called boinc_finish

</stderr_txt>
<message>
upload failure: <file_xfer_error>
  <file_name>hadam3p_anz_rof0_2012_1_008958223_0_4.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_anz_rof0_2012_1_008958223_0_5.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_anz_rof0_2012_1_008958223_0_6.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_anz_rof0_2012_1_008958223_0_7.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_anz_rof0_2012_1_008958223_0_8.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_anz_rof0_2012_1_008958223_0_9.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_anz_rof0_2012_1_008958223_0_10.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_anz_rof0_2012_1_008958223_0_11.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_anz_rof0_2012_1_008958223_0_12.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>

</message>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
16 Sep 2014 07:34:22 1333820 16950386 hadam3p_anz_rof0_2012_1_008958223_0 34,859 218,511 6.2684
11 Sep 2014 10:58:38 1333820 16950386 hadam3p_anz_rof0_2012_1_008958223_0 23,339 147,631 6.3255
04 Sep 2014 10:14:07 1333820 16950386 hadam3p_anz_rof0_2012_1_008958223_0 11,819 76,835 6.5010


©2024 climateprediction.net