climateprediction.net home page
Task 12882782

Task 12882782

Name hadam3p_eu_2loj_1965_1_007235876_1
Workunit 7434116
Created 12 May 2011, 14:47:15 UTC
Sent 12 May 2011, 14:53:20 UTC
Report deadline 23 Apr 2012, 20:13:20 UTC
Received 7 Jun 2011, 16:14:35 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 0 (0x00000000)
Computer ID 1141360
Run time 3 days 5 hours 50 min 29 sec
CPU time 3 days 1 hours 4 min 23 sec
Validate state Invalid
Credit 1,591.48
Device peak FLOPS 2.32 GFLOPS
Application version UK Met Office HadAM3P-HadRM3P Europe v6.09
windows_intelx86
Stderr
<core_client_version>6.10.58</core_client_version>
<![CDATA[
<stderr_txt>
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4776, selfPID=4776, iMonCtr=2
00:41:14 (3840): Can't acquire lockfile (32) - waiting 35s
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4468, selfPID=3680, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=700, selfPID=2660, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4412, selfPID=4108, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4540, selfPID=2820, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5024, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4012, selfPID=4240, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CCPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3988, iMonCtr=2
Model crash detected, will try to restart...
15:50:07 (3956): No heartbeat from core client for 30 sec - exiting
15:50:08 (3956): No heartbeat from core client for 30 sec - exiting
15:50:09 (3956): No heartbeat from core client for 30 sec - exiting
15:50:10 (3956): No heartbeat from core client for 30 sec - exiting
15:50:12 (3956): No heartbeat from core client for 30 sec - exiting
15:50:13 (3956): No heartbeat from core client for 30 sec - exiting
15:50:14 (3956): No heartbeat from core client for 30 sec - exiting
15:50:15 (3956): No heartbeat from core client for 30 sec - exiting
15:50:16 (3956): No heartbeat from core client for 30 sec - exiting
15:50:17 (3956): No heartbeat from core client for 30 sec - exiting
15:50:18 (3956): No heartbeat from core client for 30 sec - exiting
15:50:19 (3956): No heartbeat from core client for 30 sec - exiting
15:50:20 (3956): No heartbeat from core client for 30 sec - exiting
15:50:21 (3956): No heartbeat from core client for 30 sec - exiting
15:50:22 (3956): No heartbeat from core client for 30 sec - exiting
15:50:24 (3956): No heartbeat from core client for 30 sec - exiting
15:50:25 (3956): No heartbeat from core client for 30 sec - exiting
15:50:26 (3956): No heartbeat from core client for 30 sec - exiting
15:50:27 (3956): No heartbeat from core client for 30 sec - exiting
15:50:28 (3956): No heartbeat from core client for 30 sec - exiting
15:50:29 (3956): No heartbeat from core client for 30 sec - exiting
15:50:30 (3956): No heartbeat from core client for 30 sec - exiting
15:50:31 (3956): No heartbeat from core client for 30 sec - exiting
15:50:32 (3956): No heartbeat from core client for 30 sec - exiting
15:50:33 (3956): No heartbeat from core client for 30 sec - exiting
15:50:34 (3956): No heartbeat from core client for 30 sec - exiting
15:50:36 (3956): No heartbeat from core client for 30 sec - exiting
15:50:37 (3956): No heartbeat from core client for 30 sec - exiting
15:50:38 (3956): No heartbeat from core client for 30 sec - exiting
15:50:39 (3956): No heartbeat from core client for 30 sec - exiting
15:50:40 (3956): No heartbeat from core client for 30 sec - exiting
15:50:41 (3956): No heartbeat from core client for 30 sec - exiting
15:50:42 (3956): No heartbeat from core client for 30 sec - exiting
15:50:43 (3956): No heartbeat from core client for 30 sec - exiting
15:50:44 (3956): No heartbeat from core client for 30 sec - exiting
15:50:45 (3956): No heartbeat from core client for 30 sec - exiting
15:50:47 (3956): No heartbeat from core client for 30 sec - exiting
15:50:48 (3956): No heartbeat from core client for 30 sec - exiting
15:50:49 (3956): No heartbeat from core client for 30 sec - exiting
15:50:50 (3956): No heartbeat from core client for 30 sec - exiting
15:50:51 (3956): No heartbeat from core client for 30 sec - exiting
15:50:52 (3956): No heartbeat from core client for 30 sec - exiting
15:50:53 (3956): No heartbeat from core client for 30 sec - exiting
15:50:54 (3956): No heartbeat from core client for 30 sec - exiting
15:50:55 (3956): No heartbeat from core client for 30 sec - exiting
15:50:56 (3956): No heartbeat from core client for 30 sec - exiting
15:50:57 (3956): No heartbeat from core client for 30 sec - exiting
15:50:59 (3956): No heartbeat from core client for 30 sec - exiting
15:51:00 (3956): No heartbeat from core client for 30 sec - exiting
15:51:01 (3956): No heartbeat from core client for 30 sec - exiting
15:51:02 (3956): No heartbeat from core client for 30 sec - exiting
15:51:03 (3956): No heartbeat from core client for 30 sec - exiting
15:51:04 (3956): No heartbeat from core client for 30 sec - exiting
15:51:05 (3956): No heartbeat from core client for 30 sec - exiting
15:51:06 (3956): No heartbeat from core client for 30 sec - exiting
15:51:07 (3956): No heartbeat from core client for 30 sec - exiting
15:51:08 (3956): No heartbeat from core client for 30 sec - exiting
15:51:09 (3956): No heartbeat from core client for 30 sec - exiting
15:51:11 (3956): No heartbeat from core client for 30 sec - exiting
15:51:12 (3956): No heartbeat from core client for 30 sec - exiting
15:51:13 (3956): No heartbeat from core client for 30 sec - exiting
15:51:14 (3956): No heartbeat from core client for 30 sec - exiting
15:51:15 (3956): No heartbeat from core client for 30 sec - exiting
15:51:16 (3956): No heartbeat from core client for 30 sec - exiting
15:51:17 (3956): No heartbeat from core client for 30 sec - exiting
15:51:18 (3956): No heartbeat from core client for 30 sec - exiting
15:51:19 (3956): No heartbeat from core client for 30 sec - exiting
15:51:20 (3956): No heartbeat from core client for 30 sec - exiting
15:51:21 (3956): No heartbeat from core client for 30 sec - exiting
15:51:23 (3956): No heartbeat from core client for 30 sec - exiting
15:51:24 (3956): No heartbeat from core client for 30 sec - exiting
15:51:25 (3956): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
15:51:26 (3956): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
09:25:08 (5072): No heartbeat from core client for 30 sec - exiting
09:25:10 (5072): No heartbeat from core client for 30 sec - exiting
09:25:11 (5072): No heartbeat from core client for 30 sec - exiting
09:25:12 (5072): No heartbeat from core client for 30 sec - exiting
09:25:13 (5072): No heartbeat from core client for 30 sec - exiting
09:25:14 (5072): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...

Model crashed: READHIST: End of file in READ from history file for namelist NLIHISTO                                                                                                                                                                                           tmp/xaakm.pipe_dummy                                                            2048    
Leaving CPDN_Main::Monitor...
Called boinc_finish

</stderr_txt>
<message>
<file_xfer_error>
  <file_name>hadam3p_eu_2loj_1965_1_007235876_1_9.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_eu_2loj_1965_1_007235876_1_10.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_eu_2loj_1965_1_007235876_1_11.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_eu_2loj_1965_1_007235876_1_12.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>

</message>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
07 Jun 2011 10:50:41 1141360 12882782 hadam3p_eu_2loj_1965_1_007235876_1 92,256 254,362 2.7571
06 Jun 2011 04:33:14 1141360 12882782 hadam3p_eu_2loj_1965_1_007235876_1 80,736 225,682 2.7953
05 Jun 2011 20:24:30 1141360 12882782 hadam3p_eu_2loj_1965_1_007235876_1 69,216 196,526 2.8393
02 Jun 2011 18:28:41 1141360 12882782 hadam3p_eu_2loj_1965_1_007235876_1 57,696 167,585 2.9046
26 May 2011 13:51:43 1141360 12882782 hadam3p_eu_2loj_1965_1_007235876_1 46,176 134,572 2.9143
21 May 2011 21:11:41 1141360 12882782 hadam3p_eu_2loj_1965_1_007235876_1 34,656 101,266 2.9220
15 May 2011 21:11:10 1141360 12882782 hadam3p_eu_2loj_1965_1_007235876_1 23,136 67,870 2.9335
14 May 2011 11:16:48 1141360 12882782 hadam3p_eu_2loj_1965_1_007235876_1 11,621 34,493 2.9682
13 May 2011 11:21:30 1141360 12882782 hadam3p_eu_2loj_1965_1_007235876_1 11,616 34,093 2.9350


©2024 climateprediction.net