climateprediction.net home page
Task 16647647

Task 16647647

Name hadam3p_anz_r0p5_2012_1_008730519_2
Workunit 8876497
Created 23 May 2014, 5:54:32 UTC
Sent 23 May 2014, 5:54:37 UTC
Report deadline 5 May 2015, 11:14:37 UTC
Received 9 Jun 2014, 11:40:10 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 0 (0x00000000)
Computer ID 907613
Run time 4 days 13 hours 42 min 47 sec
CPU time 9 hours 56 min 6 sec
Validate state Invalid
Credit 2,993.82
Device peak FLOPS 2.78 GFLOPS
Application version UK Met Office HadAM3P-HadRM3P Australia New Zealand v6.10
windows_intelx86
Stderr
<core_client_version>7.2.42</core_client_version>
<![CDATA[
<stderr_txt>
CPDN Monitor - Quit request from BOINC...
12:40:55 (4184): No heartbeat from core client for 30 sec - exiting
12:40:56 (4184): No heartbeat from core client for 30 sec - exiting
12:40:57 (4184): No heartbeat from core client for 30 sec - exiting
12:40:58 (4184): No heartbeat from core client for 30 sec - exiting
12:40:59 (4184): No heartbeat from core client for 30 sec - exiting
12:41:00 (4184): No heartbeat from core client for 30 sec - exiting
12:41:02 (4184): No heartbeat from core client for 30 sec - exiting
12:41:03 (4184): No heartbeat from core client for 30 sec - exiting
12:41:04 (4184): No heartbeat from core client for 30 sec - exiting
12:41:05 (4184): No heartbeat from core client for 30 sec - exiting
12:41:06 (4184): No heartbeat from core client for 30 sec - exiting
12:41:07 (4184): No heartbeat from core client for 30 sec - exiting
12:41:08 (4184): No heartbeat from core client for 30 sec - exiting
12:41:09 (4184): No heartbeat from core client for 30 sec - exiting
12:41:10 (4184): No heartbeat from core client for 30 sec - exiting
12:41:11 (4184): No heartbeat from core client for 30 sec - exiting
12:41:12 (4184): No heartbeat from core client for 30 sec - exiting
12:41:14 (4184): No heartbeat from core client for 30 sec - exiting
12:41:15 (4184): No heartbeat from core client for 30 sec - exiting
12:41:16 (4184): No heartbeat from core client for 30 sec - exiting
12:41:17 (4184): No heartbeat from core client for 30 sec - exiting
12:41:18 (4184): No heartbeat from core client for 30 sec - exiting
12:41:19 (4184): No heartbeat from core client for 30 sec - exiting
12:41:20 (4184): No heartbeat from core client for 30 sec - exiting
12:41:21 (4184): No heartbeat from core client for 30 sec - exiting
12:41:22 (4184): No heartbeat from core client for 30 sec - exiting
12:41:23 (4184): No heartbeat from core client for 30 sec - exiting
12:41:24 (4184): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2624, selfPID=2624, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6072, selfPID=6072, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
02:02:30 (7136): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2132, selfPID=5824, iMonCtr=1
Model crash detected, will try to restart...
21:52:24 (4996): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5376, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5932, iMonCtr=2
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4408, iMonCtr=2
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
19:17:29 (4284): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5016, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=456, selfPID=4244, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5196, selfPID=4240, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5004, selfPID=4420, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5672, selfPID=4552, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6052, selfPID=4356, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5400, selfPID=4380, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
cpdnmonitor: cannot open input file F:\BOINC/projects/climateprediction.net/hadam3p_anz_r0p5_2012_1_008730519/dataout/atmos_restart.day after 11 attempts
cpdnmonitor: cannot open input file F:\BOINC/projects/climateprediction.net/hadam3p_anz_r0p5_2012_1_008730519/dataout/region_restart.day after 11 attempts

Model crashed: READHIST: End of file in READ from history file for namelist NLIHISTO                                                                                                                                                                                           tmp/xaakm.pipe_dummy                                                            2048    

Model crashed: READHIST: End of file in READ from history file for namelist NLIHISTO                                                                                                                                                                                           tmp/xaakg.pipe_dummy                                                            2048    
Leaving CPDN_Main::Monitor...
Called boinc_finish

</stderr_txt>
<message>
upload failure: <file_xfer_error>
  <file_name>hadam3p_anz_r0p5_2012_1_008730519_2_7.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_anz_r0p5_2012_1_008730519_2_8.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_anz_r0p5_2012_1_008730519_2_9.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_anz_r0p5_2012_1_008730519_2_10.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_anz_r0p5_2012_1_008730519_2_11.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_anz_r0p5_2012_1_008730519_2_12.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>

</message>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
05 Jun 2014 16:34:19 907613 16647647 hadam3p_anz_r0p5_2012_1_008730519_2 69,419 325,877 4.6943
03 Jun 2014 19:28:37 907613 16647647 hadam3p_anz_r0p5_2012_1_008730519_2 57,899 271,717 4.6929
02 Jun 2014 09:27:23 907613 16647647 hadam3p_anz_r0p5_2012_1_008730519_2 46,379 217,595 4.6917
31 May 2014 10:26:48 907613 16647647 hadam3p_anz_r0p5_2012_1_008730519_2 34,859 163,486 4.6899
28 May 2014 10:57:37 907613 16647647 hadam3p_anz_r0p5_2012_1_008730519_2 23,339 109,047 4.6723
24 May 2014 09:41:43 907613 16647647 hadam3p_anz_r0p5_2012_1_008730519_2 11,819 55,085 4.6607


©2024 climateprediction.net