climateprediction.net home page
Task 12253438

Task 12253438

Name hadam3p_pnw_zk3q_1997_1_006970974_0
Workunit 7174290
Created 23 Nov 2010, 12:27:05 UTC
Sent 14 Feb 2011, 1:22:12 UTC
Report deadline 27 Jan 2012, 6:42:12 UTC
Received 3 Jun 2011, 7:29:12 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 0 (0x00000000)
Computer ID 1106458
Run time 4 days 13 hours 27 min 53 sec
CPU time 4 days 9 hours 53 min 16 sec
Validate state Invalid
Credit 2,755.56
Device peak FLOPS 3.14 GFLOPS
Application version UK Met Office HadAM3P-HadRM3P Pacific North West v6.08
windows_intelx86
Stderr
<core_client_version>6.12.26</core_client_version>
<![CDATA[
<stderr_txt>
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5780, selfPID=5824, iMonCtr=1
Model crash detected, will try to restart...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5800, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5652, selfPID=5652, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=7704, selfPID=7600, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6116, selfPID=4764, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Regional yearly means requires 12 input files got 1
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3928, selfPID=3928, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5940, selfPID=1892, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5796, selfPID=4288, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5488, selfPID=5488, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
19:49:42 (3564): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5356, selfPID=5356, iMonCtr=2
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2992, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2976, selfPID=3792, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5864, selfPID=4604, iMonCtr=1
Model crash detected, will try to restart...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5556, iMonCtr=2
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5944, selfPID=4964, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
19:48:14 (5808): No heartbeat from core client for 30 sec - exiting
19:48:15 (5808): No heartbeat from core client for 30 sec - exiting
19:48:16 (5808): No heartbeat from core client for 30 sec - exiting
19:48:18 (5808): No heartbeat from core client for 30 sec - exiting
19:48:19 (5808): No heartbeat from core client for 30 sec - exiting
19:48:20 (5808): No heartbeat from core client for 30 sec - exiting
19:48:21 (5808): No heartbeat from core client for 30 sec - exiting
19:48:22 (5808): No heartbeat from core client for 30 sec - exiting
19:48:23 (5808): No heartbeat from core client for 30 sec - exiting
19:48:24 (5808): No heartbeat from core client for 30 sec - exiting
19:48:25 (5808): No heartbeat from core client for 30 sec - exiting
19:48:26 (5808): No heartbeat from core client for 30 sec - exiting
19:48:27 (5808): No heartbeat from core client for 30 sec - exiting
19:48:28 (5808): No heartbeat from core client for 30 sec - exiting
19:48:30 (5808): No heartbeat from core client for 30 sec - exiting
19:48:31 (5808): No heartbeat from core client for 30 sec - exiting
19:48:32 (5808): No heartbeat from core client for 30 sec - exiting
19:48:33 (5808): No heartbeat from core client for 30 sec - exiting
19:48:34 (5808): No heartbeat from core client for 30 sec - exiting
19:48:35 (5808): No heartbeat from core client for 30 sec - exiting
19:48:36 (5808): No heartbeat from core client for 30 sec - exiting
19:48:37 (5808): No heartbeat from core client for 30 sec - exiting
19:48:38 (5808): No heartbeat from core client for 30 sec - exiting
19:48:39 (5808): No heartbeat from core client for 30 sec - exiting
19:48:40 (5808): No heartbeat from core client for 30 sec - exiting
19:48:42 (5808): No heartbeat from core client for 30 sec - exiting
19:48:43 (5808): No heartbeat from core client for 30 sec - exiting
19:48:44 (5808): No heartbeat from core client for 30 sec - exiting
19:48:45 (5808): No heartbeat from core client for 30 sec - exiting
19:48:46 (5808): No heartbeat from core client for 30 sec - exiting
19:48:47 (5808): No heartbeat from core client for 30 sec - exiting
19:48:48 (5808): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
18:59:12 (1968): No heartbeat from core client for 30 sec - exiting
18:59:14 (1968): No heartbeat from core client for 30 sec - exiting
18:59:15 (1968): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
21:42:03 (5004): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
22:18:57 (3180): No heartbeat from core client for 30 sec - exiting
22:19:02 (3180): No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
18:45:26 (4164): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
18:45:27 (4164): No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5876, iMonCtr=2
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5976, selfPID=5192, iMonCtr=1
Model crash detected, will try to restart...
20:05:53 (5436): No heartbeat from core client for 30 sec - exiting
20:05:54 (5436): No heartbeat from core client for 30 sec - exiting
20:05:55 (5436): No heartbeat from core client for 30 sec - exiting
20:05:56 (5436): No heartbeat from core client for 30 sec - exiting
20:05:57 (5436): No heartbeat from core client for 30 sec - exiting
20:05:58 (5436): No heartbeat from core client for 30 sec - exiting
20:05:59 (5436): No heartbeat from core client for 30 sec - exiting
20:06:00 (5436): No heartbeat from core client for 30 sec - exiting
20:06:01 (5436): No heartbeat from core client for 30 sec - exiting
20:06:02 (5436): No heartbeat from core client for 30 sec - exiting
20:06:03 (5436): No heartbeat from core client for 30 sec - exiting
20:06:04 (5436): No heartbeat from core client for 30 sec - exiting
20:06:05 (5436): No heartbeat from core client for 30 sec - exiting
20:06:06 (5436): No heartbeat from core client for 30 sec - exiting
20:06:07 (5436): No heartbeat from core client for 30 sec - exiting
20:06:08 (5436): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5940, selfPID=4184, iMonCtr=1
Model crash detected, will try to restart...
00:18:27 (6356): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5172, iMonCtr=2
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
CCoegionnt Wolrek::rCPD CPDN processnot runt runniniting, bRet Val = 1,= 1, chPID=2816, selfPID=2344, iMonCtMonC
Mode= 2
ash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Regional yearly means requires 12 input files got 11
cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadam3p_pnw_zk3q_1997_1_006970974/dataout/atmos_restart.day after 11 attempts
cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadam3p_pnw_zk3q_1997_1_006970974/dataout/region_restart.day after 11 attempts

Model crashed: READHIST: End of file in READ from history file for namelist NLIHISTO                                                                                                                                                                                           tmp/xaakm.pipe_dummy                                                            2048    

Model crashed: READHIST: End of file in READ from history file for namelist NLIHISTO                                                                                                                                                                                           tmp/xaakg.pipe_dummy                                                            2048    
Leaving CPDN_Main::Monitor...
Regional yearly means requires 12 input files got 4
22:49:49 (4060): called boinc_finish

</stderr_txt>
<message>
upload failure: <file_xfer_error>
  <file_name>hadam3p_pnw_zk3q_1997_1_006970974_0_12.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>

</message>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
20 Mar 2011 11:19:16 1106458 12253438 hadam3p_pnw_zk3q_1997_1_006970974_0 126,816 361,803 2.8530
17 Mar 2011 20:57:33 1106458 12253438 hadam3p_pnw_zk3q_1997_1_006970974_0 115,296 328,141 2.8461
14 Mar 2011 13:22:21 1106458 12253438 hadam3p_pnw_zk3q_1997_1_006970974_0 103,782 294,512 2.8378
14 Mar 2011 10:40:35 1106458 12253438 hadam3p_pnw_zk3q_1997_1_006970974_0 103,776 294,157 2.8345
12 Mar 2011 07:37:56 1106458 12253438 hadam3p_pnw_zk3q_1997_1_006970974_0 92,256 260,259 2.8211
11 Mar 2011 14:50:44 1106458 12253438 hadam3p_pnw_zk3q_1997_1_006970974_0 80,736 226,813 2.8093
08 Mar 2011 10:22:43 1106458 12253438 hadam3p_pnw_zk3q_1997_1_006970974_0 69,216 194,055 2.8036
08 Mar 2011 10:22:43 1106458 12253438 hadam3p_pnw_zk3q_1997_1_006970974_0 57,696 161,996 2.8078
08 Mar 2011 10:22:43 1106458 12253438 hadam3p_pnw_zk3q_1997_1_006970974_0 46,176 127,997 2.7719
08 Mar 2011 10:22:43 1106458 12253438 hadam3p_pnw_zk3q_1997_1_006970974_0 34,656 94,181 2.7176
26 Feb 2011 08:45:09 1106458 12253438 hadam3p_pnw_zk3q_1997_1_006970974_0 23,136 61,245 2.6472
24 Feb 2011 16:02:01 1106458 12253438 hadam3p_pnw_zk3q_1997_1_006970974_0 11,616 29,669 2.5541


©2024 climateprediction.net