climateprediction.net home page
Task 13833054

Task 13833054

Name hadam3p_saf_7bbq_2004_1_007622123_0
Workunit 7800442
Created 29 Dec 2011, 19:49:15 UTC
Sent 31 Dec 2011, 16:35:26 UTC
Report deadline 12 Dec 2012, 21:55:26 UTC
Received 26 Jan 2012, 16:45:52 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 0 (0x00000000)
Computer ID 1180397
Run time 2 days 3 hours 18 min 37 sec
CPU time 1 days 19 hours 51 min 53 sec
Validate state Invalid
Credit 1,122.82
Device peak FLOPS 2.16 GFLOPS
Application version UK Met Office HadAM3P-HadRM3P Southern Africa v6.09
windows_intelx86
Stderr
<core_client_version>6.12.34</core_client_version>
<![CDATA[
<stderr_txt>
CPDN Monitor - Quit request from BOINC...
15:47:07 (7204): No heartbeat from core client for 30 sec - exiting
15:47:08 (7204): No heartbeat from core client for 30 sec - exiting
15:47:09 (7204): No heartbeat from core client for 30 sec - exiting
15:47:10 (7204): No heartbeat from core client for 30 sec - exiting
15:47:11 (7204): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2372, selfPID=2372, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1504, selfPID=1504, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=8492, selfPID=8492, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=7888, selfPID=7888, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=7608, selfPID=7608, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5076, selfPID=5076, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
No Process Handle
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3588, selfPID=3588, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4676, selfPID=4676, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
No Process Handle
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=14808, selfPID=14808, iMonCtr=2
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
15:08:04 (1036): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
15:08:05 (1036): No heartbeat from core client for 30 sec - exiting
15:08:06 (1036): No heartbeat from core client for 30 sec - exiting
15:08:07 (1036): No heartbeat from core client for 30 sec - exiting
15:08:08 (1036): No heartbeat from core client for 30 sec - exiting
15:08:09 (1036): No heartbeat from core client for 30 sec - exiting
15:08:10 (1036): No heartbeat from core client for 30 sec - exiting
15:08:11 (1036): No heartbeat from core client for 30 sec - exiting
15:08:12 (1036): No heartbeat from core client for 30 sec - exiting
15:08:13 (1036): No heartbeat from core client for 30 sec - exiting
15:08:14 (1036): No heartbeat from core client for 30 sec - exiting
15:08:15 (1036): No heartbeat from core client for 30 sec - exiting
15:08:16 (1036): No heartbeat from core client for 30 sec - exiting
15:08:17 (1036): No heartbeat from core client for 30 sec - exiting
15:08:18 (1036): No heartbeat from core client for 30 sec - exiting
15:08:19 (1036): No heartbeat from core client for 30 sec - exiting
15:08:20 (1036): No heartbeat from core client for 30 sec - exiting
15:08:21 (1036): No heartbeat from core client for 30 sec - exiting
15:08:22 (1036): No heartbeat from core client for 30 sec - exiting
15:08:23 (1036): No heartbeat from core client for 30 sec - exiting
15:08:24 (1036): No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5144, selfPID=7500, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=9080, selfPID=7232, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
No Process Handle
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=9992, selfPID=9992, iMonCtr=2
CPDN Monitor - Quit request from BOINC...

Model crashed: READHIST: End of file in READ from history file for namelist NLIHISTO                                                                                                                                                                                           tmp/xaakm.pipe_dummy                                                            2048    
Leaving CPDN_Main::Monitor...
Called boinc_finish

</stderr_txt>
<message>
upload failure: <file_xfer_error>
  <file_name>hadam3p_saf_7bbq_2004_1_007622123_0_7.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_saf_7bbq_2004_1_007622123_0_8.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_saf_7bbq_2004_1_007622123_0_9.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_saf_7bbq_2004_1_007622123_0_10.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_saf_7bbq_2004_1_007622123_0_11.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_saf_7bbq_2004_1_007622123_0_12.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>

</message>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
24 Jan 2012 09:03:43 1180397 13833054 hadam3p_saf_7bbq_2004_1_007622123_0 69,216 143,319 2.0706
23 Jan 2012 21:21:13 1180397 13833054 hadam3p_saf_7bbq_2004_1_007622123_0 57,696 128,138 2.2209
23 Jan 2012 16:02:05 1180397 13833054 hadam3p_saf_7bbq_2004_1_007622123_0 46,176 112,533 2.4370
21 Jan 2012 13:26:48 1180397 13833054 hadam3p_saf_7bbq_2004_1_007622123_0 34,656 84,849 2.4483
20 Jan 2012 22:22:54 1180397 13833054 hadam3p_saf_7bbq_2004_1_007622123_0 23,136 54,627 2.3611
19 Jan 2012 01:14:56 1180397 13833054 hadam3p_saf_7bbq_2004_1_007622123_0 11,616 27,835 2.3963


©2024 climateprediction.net