climateprediction.net home page
Task 16797010

Task 16797010

Name hadam3p_eu_p46t_2013_1_008876974_0
Workunit 9022903
Created 9 Jul 2014, 16:49:54 UTC
Sent 11 Jul 2014, 16:41:54 UTC
Report deadline 23 Jun 2015, 22:01:54 UTC
Received 14 Jul 2014, 16:18:07 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 0 (0x00000000)
Computer ID 1169024
Run time 1 days 14 hours 17 min 52 sec
CPU time 1 days 13 hours 53 min 21 sec
Validate state Invalid
Credit 995.30
Device peak FLOPS 2.44 GFLOPS
Application version UK Met Office HadAM3P-HadRM3P Europe v6.09
windows_intelx86
Stderr
<core_client_version>7.2.42</core_client_version>
<![CDATA[
<stderr_txt>
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=960, iMonCtr=2
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5748, selfPID=5724, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
17:08:46 (7120): No heartbeat from core client for 30 sec - exiting
17:08:47 (7120): No heartbeat from core client for 30 sec - exiting
17:08:48 (7120): No heartbeat from core client for 30 sec - exiting
17:08:50 (7120): No heartbeat from core client for 30 sec - exiting
17:08:51 (7120): No heartbeat from core client for 30 sec - exiting
17:08:52 (7120): No heartbeat from core client for 30 sec - exiting
17:08:53 (7120): No heartbeat from core client for 30 sec - exiting
17:08:54 (7120): No heartbeat from core client for 30 sec - exiting
17:08:55 (7120): No heartbeat from core client for 30 sec - exiting
17:08:56 (7120): No heartbeat from core client for 30 sec - exiting
17:08:57 (7120): No heartbeat from core client for 30 sec - exiting
17:08:58 (7120): No heartbeat from core client for 30 sec - exiting
17:08:59 (7120): No heartbeat from core client for 30 sec - exiting
17:09:01 (7120): No heartbeat from core client for 30 sec - exiting
17:09:02 (7120): No heartbeat from core client for 30 sec - exiting
17:09:03 (7120): No heartbeat from core client for 30 sec - exiting
17:09:04 (7120): No heartbeat from core client for 30 sec - exiting
17:09:05 (7120): No heartbeat from core client for 30 sec - exiting
17:09:06 (7120): No heartbeat from core client for 30 sec - exiting
17:09:07 (7120): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
17:31:42 (6932): No heartbeat from core client for 30 sec - exiting
17:31:44 (6932): No heartbeat from core client for 30 sec - exiting
17:31:45 (6932): No heartbeat from core client for 30 sec - exiting
17:31:46 (6932): No heartbeat from core client for 30 sec - exiting
17:31:47 (6932): No heartbeat from core client for 30 sec - exiting
17:31:48 (6932): No heartbeat from core client for 30 sec - exiting
17:31:49 (6932): No heartbeat from core client for 30 sec - exiting
17:31:50 (6932): No heartbeat from core client for 30 sec - exiting
17:31:51 (6932): No heartbeat from core client for 30 sec - exiting
17:31:52 (6932): No heartbeat from core client for 30 sec - exiting
17:31:53 (6932): No heartbeat from core client for 30 sec - exiting
17:31:54 (6932): No heartbeat from core client for 30 sec - exiting
17:31:55 (6932): No heartbeat from core client for 30 sec - exiting
17:31:56 (6932): No heartbeat from core client for 30 sec - exiting
17:31:57 (6932): No heartbeat from core client for 30 sec - exiting
17:31:58 (6932): No heartbeat from core client for 30 sec - exiting
17:31:59 (6932): No heartbeat from core client for 30 sec - exiting
17:32:00 (6932): No heartbeat from core client for 30 sec - exiting
17:32:01 (6932): No heartbeat from core client for 30 sec - exiting
17:32:02 (6932): No heartbeat from core client for 30 sec - exiting
17:32:03 (6932): No heartbeat from core client for 30 sec - exiting
17:32:04 (6932): No heartbeat from core client for 30 sec - exiting
17:32:05 (6932): No heartbeat from core client for 30 sec - exiting
17:32:06 (6932): No heartbeat from core client for 30 sec - exiting
17:32:07 (6932): No heartbeat from core client for 30 sec - exiting
17:32:08 (6932): No heartbeat from core client for 30 sec - exiting
17:32:09 (6932): No heartbeat from core client for 30 sec - exiting
17:32:10 (6932): No heartbeat from core client for 30 sec - exiting
17:32:11 (6932): No heartbeat from core client for 30 sec - exiting
17:32:12 (6932): No heartbeat from core client for 30 sec - exiting
17:32:13 (6932): No heartbeat from core client for 30 sec - exiting
17:32:14 (6932): No heartbeat from core client for 30 sec - exiting
17:32:15 (6932): No heartbeat from core client for 30 sec - exiting
17:32:16 (6932): No heartbeat from core client for 30 sec - exiting
17:32:17 (6932): No heartbeat from core client for 30 sec - exiting
17:32:18 (6932): No heartbeat from core client for 30 sec - exiting
17:32:19 (6932): No heartbeat from core client for 30 sec - exiting
17:32:20 (6932): No heartbeat from core client for 30 sec - exiting
17:32:21 (6932): No heartbeat from core client for 30 sec - exiting
17:32:22 (6932): No heartbeat from core client for 30 sec - exiting
17:32:23 (6932): No heartbeat from core client for 30 sec - exiting
17:32:24 (6932): No heartbeat from core client for 30 sec - exiting
17:32:25 (6932): No heartbeat from core client for 30 sec - exiting
17:32:26 (6932): No heartbeat from core client for 30 sec - exiting
17:32:27 (6932): No heartbeat from core client for 30 sec - exiting
17:32:28 (6932): No heartbeat from core client for 30 sec - exiting
17:32:29 (6932): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
17:32:30 (6932): No heartbeat from core client for 30 sec - exiting
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5208, selfPID=5144, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Suspended CPDN Monitor - Suspend request from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3616, iMonCtr=2

Model crashed: READHIST: End of file in READ from history file for namelist NLIHISTO                                                                                                                                                                                           tmp/xaakm.pipe_dummy                                                            2048    
Leaving CPDN_Main::Monitor...
Called boinc_finish

</stderr_txt>
<message>
upload failure: <file_xfer_error>
  <file_name>hadam3p_eu_p46t_2013_1_008876974_0_6.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_eu_p46t_2013_1_008876974_0_7.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_eu_p46t_2013_1_008876974_0_8.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_eu_p46t_2013_1_008876974_0_9.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_eu_p46t_2013_1_008876974_0_10.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_eu_p46t_2013_1_008876974_0_11.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_eu_p46t_2013_1_008876974_0_12.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>

</message>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
14 Jul 2014 14:25:50 1169024 16797010 hadam3p_eu_p46t_2013_1_008876974_0 57,696 133,242 2.3094
14 Jul 2014 06:49:59 1169024 16797010 hadam3p_eu_p46t_2013_1_008876974_0 46,176 106,849 2.3140
13 Jul 2014 15:08:28 1169024 16797010 hadam3p_eu_p46t_2013_1_008876974_0 34,656 80,255 2.3158
13 Jul 2014 07:35:19 1169024 16797010 hadam3p_eu_p46t_2013_1_008876974_0 23,152 53,869 2.3268
12 Jul 2014 23:02:00 1169024 16797010 hadam3p_eu_p46t_2013_1_008876974_0 23,139 53,476 2.3111
12 Jul 2014 22:01:45 1169024 16797010 hadam3p_eu_p46t_2013_1_008876974_0 23,136 53,102 2.2952
12 Jul 2014 13:30:22 1169024 16797010 hadam3p_eu_p46t_2013_1_008876974_0 11,616 26,704 2.2989


©2024 climateprediction.net