climateprediction.net home page
Task 12134759

Task 12134759

Name hadam3p_eu_wmho_1971_1_006862900_0
Workunit 7066216
Created 19 Nov 2010, 10:49:49 UTC
Sent 17 Mar 2011, 14:59:49 UTC
Report deadline 27 Feb 2012, 20:19:49 UTC
Received 20 Apr 2011, 23:50:27 UTC
Server state In progress
Outcome ---
Client state Compute error
Exit status 0 (0x00000000)
Computer ID 1090556
Run time 2 days 10 hours 22 min 36 sec
CPU time 2 days 8 hours 57 min 41 sec
Validate state Invalid
Credit 1,194.02
Device peak FLOPS 2.79 GFLOPS
Application version UK Met Office HadAM3P-HadRM3P Europe v6.08
windows_intelx86
Stderr
<core_client_version>6.10.58</core_client_version>
<![CDATA[
<stderr_txt>
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4504, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6124, selfPID=4104, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5584, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5824, iMonCtr=2
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
14:14:33 (4004): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
14:14:35 (4004): No heartbeat from core client for 30 sec - exiting
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5440, iMonCtr=2
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5152, selfPID=436, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5568, selfPID=4608, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3656, iMonCtr=2
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5956, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3236, iMonCtr=2
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2456, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5176, selfPID=5840, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadam3p_eu_wmho_1971_1_006862900/dataout/atmos_restart.day after 11 attempts
cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadam3p_eu_wmho_1971_1_006862900/dataout/region_restart.day after 11 attempts

Model crashed: READHIST: End of file in READ from history file for namelist NLIHISTO                                                                                                                                                                                           tmp/xaakm.pipe_dummy                                                            2048    
Leaving CPDN_Main::Monitor...
17:43:04 (4384): called boinc_finish

</stderr_txt>
<message>
<file_xfer_error>
  <file_name>hadam3p_eu_wmho_1971_1_006862900_0_7.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_eu_wmho_1971_1_006862900_0_8.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_eu_wmho_1971_1_006862900_0_9.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_eu_wmho_1971_1_006862900_0_10.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_eu_wmho_1971_1_006862900_0_11.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_eu_wmho_1971_1_006862900_0_12.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>

</message>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
30 Mar 2011 23:15:49 1090556 12134759 hadam3p_eu_wmho_1971_1_006862900_0 69,216 201,021 2.9043
30 Mar 2011 13:48:16 1090556 12134759 hadam3p_eu_wmho_1971_1_006862900_0 57,696 167,739 2.9073
29 Mar 2011 19:56:51 1090556 12134759 hadam3p_eu_wmho_1971_1_006862900_0 46,176 134,592 2.9148
29 Mar 2011 02:06:40 1090556 12134759 hadam3p_eu_wmho_1971_1_006862900_0 34,656 101,286 2.9226
27 Mar 2011 18:13:45 1090556 12134759 hadam3p_eu_wmho_1971_1_006862900_0 23,136 68,205 2.9480
26 Mar 2011 21:47:59 1090556 12134759 hadam3p_eu_wmho_1971_1_006862900_0 11,616 35,244 3.0341


©2024 climateprediction.net