climateprediction.net home page
Task 12300233

Task 12300233

Name hadam3p_pnw_zs6b_1994_1_007015035_0
Workunit 7218351
Created 24 Nov 2010, 14:54:24 UTC
Sent 15 Jan 2011, 20:34:57 UTC
Report deadline 29 Dec 2011, 1:54:57 UTC
Received 3 Feb 2011, 17:26:09 UTC
Server state Over
Outcome No reply
Client state Compute error
Exit status 0 (0x00000000)
Computer ID 1117997
Run time 4 days 1 hours 31 min 57 sec
CPU time 3 days 13 hours 4 min 9 sec
Validate state Invalid
Credit 2,755.56
Device peak FLOPS 3.08 GFLOPS
Application version UK Met Office HadAM3P-HadRM3P Pacific North West v6.08
windows_intelx86
Stderr
<core_client_version>6.10.58</core_client_version>
<![CDATA[
<stderr_txt>
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6004, selfPID=5544, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Regional yearly means requires 12 input files got 1
09:39:02 (5736): No heartbeat from core client for 30 sec - exiting
09:39:03 (5736): No heartbeat from core client for 30 sec - exiting
09:39:04 (5736): No heartbeat from core client for 30 sec - exiting
09:39:05 (5736): No heartbeat from core client for 30 sec - exiting
09:39:06 (5736): No heartbeat from core client for 30 sec - exiting
09:39:07 (5736): No heartbeat from core client for 30 sec - exiting
09:39:08 (5736): No heartbeat from core client for 30 sec - exiting
09:39:09 (5736): No heartbeat from core client for 30 sec - exiting
09:39:10 (5736): No heartbeat from core client for 30 sec - exiting
09:39:11 (5736): No heartbeat from core client for 30 sec - exiting
09:39:12 (5736): No heartbeat from core client for 30 sec - exiting
09:39:13 (5736): No heartbeat from core client for 30 sec - exiting
09:39:14 (5736): No heartbeat from core client for 30 sec - exiting
09:39:15 (5736): No heartbeat from core client for 30 sec - exiting
09:39:16 (5736): No heartbeat from core client for 30 sec - exiting
09:39:17 (5736): No heartbeat from core client for 30 sec - exiting
09:39:18 (5736): No heartbeat from core client for 30 sec - exiting
09:39:19 (5736): No heartbeat from core client for 30 sec - exiting
09:39:20 (5736): No heartbeat from core client for 30 sec - exiting
09:39:21 (5736): No heartbeat from core client for 30 sec - exiting
09:39:22 (5736): No heartbeat from core client for 30 sec - exiting
09:39:23 (5736): No heartbeat from core client for 30 sec - exiting
09:39:24 (5736): No heartbeat from core client for 30 sec - exiting
09:39:25 (5736): No heartbeat from core client for 30 sec - exiting
09:39:26 (5736): No heartbeat from core client for 30 sec - exiting
09:39:27 (5736): No heartbeat from core client for 30 sec - exiting
09:39:28 (5736): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
09:39:29 (5736): No heartbeat from core client for 30 sec - exiting
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5292, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3472, iMonCtr=2
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2328, iMonCtr=2
09:32:59 (5948): No heartbeat from core client for 30 sec - exiting
09:33:00 (5948): No heartbeat from core client for 30 sec - exiting
09:33:01 (5948): No heartbeat from core client for 30 sec - exiting
09:33:02 (5948): No heartbeat from core client for 30 sec - exiting
09:33:03 (5948): No heartbeat from core client for 30 sec - exiting
09:33:04 (5948): No heartbeat from core client for 30 sec - exiting
09:33:05 (5948): No heartbeat from core client for 30 sec - exiting
09:33:06 (5948): No heartbeat from core client for 30 sec - exiting
09:33:07 (5948): No heartbeat from core client for 30 sec - exiting
09:33:08 (5948): No heartbeat from core client for 30 sec - exiting
09:33:09 (5948): No heartbeat from core client for 30 sec - exiting
09:33:10 (5948): No heartbeat from core client for 30 sec - exiting
09:33:11 (5948): No heartbeat from core client for 30 sec - exiting
09:33:12 (5948): No heartbeat from core client for 30 sec - exiting
09:33:13 (5948): No heartbeat from core client for 30 sec - exiting
09:33:14 (5948): No heartbeat from core client for 30 sec - exiting
09:33:15 (5948): No heartbeat from core client for 30 sec - exiting
09:33:16 (5948): No heartbeat from core client for 30 sec - exiting
09:33:17 (5948): No heartbeat from core client for 30 sec - exiting
09:33:18 (5948): No heartbeat from core client for 30 sec - exiting
09:33:19 (5948): No heartbeat from core client for 30 sec - exiting
09:33:20 (5948): No heartbeat from core client for 30 sec - exiting
09:33:21 (5948): No heartbeat from core client for 30 sec - exiting
09:33:22 (5948): No heartbeat from core client for 30 sec - exiting
09:33:23 (5948): No heartbeat from core client for 30 sec - exiting
09:33:24 (5948): No heartbeat from core client for 30 sec - exiting
09:33:25 (5948): No heartbeat from core client for 30 sec - exiting
09:33:26 (5948): No heartbeat from core client for 30 sec - exiting
09:33:27 (5948): No heartbeat from core client for 30 sec - exiting
09:33:28 (5948): No heartbeat from core client for 30 sec - exiting
09:33:29 (5948): No heartbeat from core client for 30 sec - exiting
09:33:30 (5948): No heartbeat from core client for 30 sec - exiting
09:33:31 (5948): No heartbeat from core client for 30 sec - exiting
09:33:32 (5948): No heartbeat from core client for 30 sec - exiting
09:33:33 (5948): No heartbeat from core client for 30 sec - exiting
09:33:34 (5948): No heartbeat from core client for 30 sec - exiting
09:33:35 (5948): No heartbeat from core client for 30 sec - exiting
09:33:36 (5948): No heartbeat from core client for 30 sec - exiting
09:33:37 (5948): No heartbeat from core client for 30 sec - exiting
09:33:38 (5948): No heartbeat from core client for 30 sec - exiting
09:33:39 (5948): No heartbeat from core client for 30 sec - exiting
09:33:40 (5948): No heartbeat from core client for 30 sec - exiting
09:33:41 (5948): No heartbeat from core client for 30 sec - exiting
09:33:42 (5948): No heartbeat from core client for 30 sec - exiting
09:33:43 (5948): No heartbeat from core client for 30 sec - exiting
09:33:44 (5948): No heartbeat from core client for 30 sec - exiting
09:33:45 (5948): No heartbeat from core client for 30 sec - exiting
09:33:46 (5948): No heartbeat from core client for 30 sec - exiting
09:33:47 (5948): No heartbeat from core client for 30 sec - exiting
09:33:48 (5948): No heartbeat from core client for 30 sec - exiting
09:33:49 (5948): No heartbeat from core client for 30 sec - exiting
09:33:50 (5948): No heartbeat from core client for 30 sec - exiting
09:33:51 (5948): No heartbeat from core client for 30 sec - exiting
09:33:52 (5948): No heartbeat from core client for 30 sec - exiting
09:33:53 (5948): No heartbeat from core client for 30 sec - exiting
09:33:54 (5948): No heartbeat from core client for 30 sec - exiting
09:33:55 (5948): No heartbeat from core client for 30 sec - exiting
09:33:56 (5948): No heartbeat from core client for 30 sec - exiting
09:33:57 (5948): No heartbeat from core client for 30 sec - exiting
09:33:58 (5948): No heartbeat from core client for 30 sec - exiting
09:33:59 (5948): No heartbeat from core client for 30 sec - exiting
09:34:00 (5948): No heartbeat from core client for 30 sec - exiting
09:34:01 (5948): No heartbeat from core client for 30 sec - exiting
09:34:03 (5948): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
GController:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2364, selfPID=5592, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Regional yearly means requires 12 input files got 8
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5652, selfPID=5448, iMonCtr=1
Model crash detected, will try to restart...
17:35:11 (4840): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=360, iMonCtr=2
Model crash detected, will try to restart...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3644, iMonCtr=2
17:11:26 (5564): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3624, selfPID=3624, iMonCtr=2
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5736, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4260, selfPID=5128, iMonCtr=1
Model crash detected, will try to restart...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1456, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4724, selfPID=5136, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Regional yearly means requires 12 input files got 11
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4280, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5260, iMonCtr=2
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Regional yearly means requires 12 input files got 11
cpdnmonitor: cannot open input file C:\ProgramData/projects/climateprediction.net/hadam3p_pnw_zs6b_1994_1_007015035/dataout/atmos_restart.day after 11 attempts
cpdnmonitor: cannot open input file C:\ProgramData/projects/climateprediction.net/hadam3p_pnw_zs6b_1994_1_007015035/dataout/region_restart.day after 11 attempts

Model crashed: READHIST: End of file in READ from history file for namelist NLIHISTO                                                                                                                                                                                           tmp/xaakm.pipe_dummy                                                            2048    

Model crashed: READHIST: End of file in READ from history file for namelist NLIHISTO                                                                                                                                                                                           tmp/xaakg.pipe_dummy                                                            2048    
Leaving CPDN_Main::Monitor...
Regional yearly means requires 12 input files got 2
17:24:50 (5648): called boinc_finish

</stderr_txt>
<message>
<file_xfer_error>
  <file_name>hadam3p_pnw_zs6b_1994_1_007015035_0_12.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>

</message>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
02 Feb 2011 18:47:41 1117997 12300233 hadam3p_pnw_zs6b_1994_1_007015035_0 126,816 287,333 2.2657
31 Jan 2011 23:35:41 1117997 12300233 hadam3p_pnw_zs6b_1994_1_007015035_0 115,296 260,303 2.2577
24 Jan 2011 17:08:48 1117997 12300233 hadam3p_pnw_zs6b_1994_1_007015035_0 103,776 233,977 2.2546
24 Jan 2011 09:29:50 1117997 12300233 hadam3p_pnw_zs6b_1994_1_007015035_0 92,264 208,409 2.2588
24 Jan 2011 00:04:30 1117997 12300233 hadam3p_pnw_zs6b_1994_1_007015035_0 92,256 208,098 2.2557
23 Jan 2011 16:16:18 1117997 12300233 hadam3p_pnw_zs6b_1994_1_007015035_0 80,736 182,269 2.2576
22 Jan 2011 23:13:34 1117997 12300233 hadam3p_pnw_zs6b_1994_1_007015035_0 69,216 156,515 2.2613
22 Jan 2011 15:07:57 1117997 12300233 hadam3p_pnw_zs6b_1994_1_007015035_0 57,696 130,255 2.2576
18 Jan 2011 17:03:23 1117997 12300233 hadam3p_pnw_zs6b_1994_1_007015035_0 46,176 104,122 2.2549
17 Jan 2011 22:36:24 1117997 12300233 hadam3p_pnw_zs6b_1994_1_007015035_0 34,656 78,375 2.2615
17 Jan 2011 14:50:41 1117997 12300233 hadam3p_pnw_zs6b_1994_1_007015035_0 23,136 52,821 2.2831
16 Jan 2011 21:28:53 1117997 12300233 hadam3p_pnw_zs6b_1994_1_007015035_0 11,616 27,202 2.3418


©2024 climateprediction.net