Task 12173094

Name	hadam3p_pnw_yx6f_1998_1_006898063_0
Workunit	7101379
Created	20 Nov 2010, 13:08:06 UTC
Sent	24 Apr 2011, 11:04:05 UTC
Report deadline	5 Apr 2012, 16:24:05 UTC
Received	18 Jun 2011, 0:16:12 UTC
Server state	Over
Outcome	Success
Client state	Done
Exit status	0 (0x00000000)
Computer ID	859116
Run time	8 days 15 hours 18 min 46 sec
CPU time	7 days 20 hours 4 min 8 sec
Validate state	Workunit error - check skipped
Credit	3,003.83
Device peak FLOPS	1.86 GFLOPS
Application version	UK Met Office HadAM3P-HadRM3P Pacific North West v6.09 windows_intelx86
Stderr	<core_client_version>6.6.36</core_client_version> <![CDATA[ <stderr_txt> CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3496, selfPID=2700, iMonCtr=1 Model crash detected, will try to restart... 17:53:04 (2368): No heartbeat from core client for 30 sec - exiting 17:53:05 (2368): No heartbeat from core client for 30 sec - exiting 17:53:06 (2368): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4028, iMonCtr=2 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4508, iMonCtr=2 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... CPDN Monitor - Quit request from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3772, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=740, selfPID=3932, iMonCtr=1 Model crash detected, will try to restart... C19:35:28 (1752): No heartbeat from core client for 30 sec - exiting 19:35:33 (1752): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2364, iMonCtr=2 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Regional yearly means requires 12 input files got 1 CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3172, iMonCtr=2 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... 19:04:50 (1364): No heartbeat from core client for 30 sec - exiting 19:04:52 (1364): No heartbeat from core client for 30 sec - exiting 19:04:53 (1364): No heartbeat from core client for 30 sec - exiting 19:04:54 (1364): No heartbeat from core client for 30 sec - exiting 19:04:55 (1364): No heartbeat from core client for 30 sec - exiting 19:04:56 (1364): No heartbeat from core client for 30 sec - exiting 19:04:57 (1364): No heartbeat from core client for 30 sec - exiting 19:04:58 (1364): No heartbeat from core client for 30 sec - exiting 19:04:59 (1364): No heartbeat from core client for 30 sec - exiting 19:05:00 (1364): No heartbeat from core client for 30 sec - exiting 19:05:01 (1364): No heartbeat from core client for 30 sec - exiting 19:05:02 (1364): No heartbeat from core client for 30 sec - exiting 19:05:03 (1364): No heartbeat from core client for 30 sec - exiting 19:05:04 (1364): No heartbeat from core client for 30 sec - exiting 19:05:05 (1364): No heartbeat from core client for 30 sec - exiting 19:05:06 (1364): No heartbeat from core client for 30 sec - exiting 19:05:07 (1364): No heartbeat from core client for 30 sec - exiting 19:05:08 (1364): No heartbeat from core client for 30 sec - exiting 19:05:09 (1364): No heartbeat from core client for 30 sec - exiting 19:05:10 (1364): No heartbeat from core client for 30 sec - exiting 19:05:11 (1364): No heartbeat from core client for 30 sec - exiting 19:05:12 (1364): No heartbeat from core client for 30 sec - exiting 19:05:13 (1364): No heartbeat from core client for 30 sec - exiting 19:05:14 (1364): No heartbeat from core client for 30 sec - exiting 19:05:15 (1364): No heartbeat from core client for 30 sec - exiting 19:05:16 (1364): No heartbeat from core client for 30 sec - exiting 19:05:17 (1364): No heartbeat from core client for 30 sec - exiting 19:05:18 (1364): No heartbeat from core client for 30 sec - exiting 19:05:19 (1364): No heartbeat from core client for 30 sec - exiting 19:05:20 (1364): No heartbeat from core client for 30 sec - exiting 19:05:21 (1364): No heartbeat from core client for 30 sec - exiting 19:05:22 (1364): No heartbeat from core client for 30 sec - exiting 19:05:23 (1364): No heartbeat from core client for 30 sec - exiting 19:05:24 (1364): No heartbeat from core client for 30 sec - exiting 19:05:25 (1364): No heartbeat from core client for 30 sec - exiting 19:05:26 (1364): No heartbeat from core client for 30 sec - exiting 19:05:27 (1364): No heartbeat from core client for 30 sec - exiting 19:05:28 (1364): No heartbeat from core client for 30 sec - exiting 19:05:29 (1364): No heartbeat from core client for 30 sec - exiting 19:05:30 (1364): No heartbeat from core client for 30 sec - exiting 19:05:31 (1364): No heartbeat from core client for 30 sec - exiting 19:05:32 (1364): No heartbeat from core client for 30 sec - exiting 19:05:33 (1364): No heartbeat from core client for 30 sec - exiting 19:05:34 (1364): No heartbeat from core client for 30 sec - exiting 19:05:35 (1364): No heartbeat from core client for 30 sec - exiting 19:05:36 (1364): No heartbeat from core client for 30 sec - exiting 19:05:37 (1364): No heartbeat from core client for 30 sec - exiting 19:05:38 (1364): No heartbeat from core client for 30 sec - exiting 19:05:39 (1364): No heartbeat from core client for 30 sec - exiting 19:05:40 (1364): No heartbeat from core client for 30 sec - exiting 19:05:41 (1364): No heartbeat from core client for 30 sec - exiting 19:05:42 (1364): No heartbeat from core client for 30 sec - exiting 19:05:43 (1364): No heartbeat from core client for 30 sec - exiting 19:05:44 (1364): No heartbeat from core client for 30 sec - exiting 19:05:45 (1364): No heartbeat from core client for 30 sec - exiting 19:05:46 (1364): No heartbeat from core client for 30 sec - exiting 19:05:47 (1364): No heartbeat from core client for 30 sec - exiting 19:05:48 (1364): No heartbeat from core client for 30 sec - exiting 19:05:49 (1364): No heartbeat from core client for 30 sec - exiting 19:05:50 (1364): No heartbeat from core client for 30 sec - exiting 19:05:51 (1364): No heartbeat from core client for 30 sec - exiting 19:05:52 (1364): No heartbeat from core client for 30 sec - exiting 19:05:53 (1364): No heartbeat from core client for 30 sec - exiting 19:05:54 (1364): No heartbeat from core client for 30 sec - exiting 19:05:55 (1364): No heartbeat from core client for 30 sec - exiting 19:05:56 (1364): No heartbeat from core client for 30 sec - exiting 19:05:57 (1364): No heartbeat from core client for 30 sec - exiting 19:05:58 (1364): No heartbeat from core client for 30 sec - exiting 19:05:59 (1364): No heartbeat from core client for 30 sec - exiting 19:06:00 (1364): No heartbeat from core client for 30 sec - exiting 19:06:01 (1364): No heartbeat from core client for 30 sec - exiting 19:06:02 (1364): No heartbeat from core client for 30 sec - exiting 19:06:03 (1364): No heartbeat from core client for 30 sec - exiting 19:06:04 (1364): No heartbeat from core client for 30 sec - exiting 19:06:05 (1364): No heartbeat from core client for 30 sec - exiting 19:06:06 (1364): No heartbeat from core client for 30 sec - exiting 19:06:07 (1364): No heartbeat from core client for 30 sec - exiting 19:06:08 (1364): No heartbeat from core client for 30 sec - exiting 19:06:09 (1364): No heartbeat from core client for 30 sec - exiting 19:06:10 (1364): No heartbeat from core client for 30 sec - exiting 19:06:11 (1364): No heartbeat from core client for 30 sec - exiting 19:06:12 (1364): No heartbeat from core client for 30 sec - exiting 19:06:13 (1364): No heartbeat from core client for 30 sec - exiting 19:06:14 (1364): No heartbeat from core client for 30 sec - exiting 19:06:15 (1364): No heartbeat from core client for 30 sec - exiting 19:06:16 (1364): No heartbeat from core client for 30 sec - exiting 19:06:17 (1364): No heartbeat from core client for 30 sec - exiting 19:06:18 (1364): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 19:06:19 (1364): No heartbeat from core client for 30 sec - exiting 19:06:20 (1364): No heartbeat from core client for 30 sec - exiting Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1436, iMonCtr=2 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4668, selfPID=2712, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1388, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2960, iMonCtr=2 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=848, iMonCtr=2 Model crash detected, will try to restart... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4524, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=488, iMonCtr=2 Model crash detected, will try to restart... 07:56:51 (2516): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4532, selfPID=5796, iMonCtr=1 Model crash detected, will try to restart... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3684, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2344, iMonCtr=2 Model crash detected, will try to restart... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4584, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4592, selfPID=3976, iMonCtr=1 Model crash detected, will try to restart... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2600, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1812, iMonCtr=2 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2028, selfPID=4088, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3320, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2776, selfPID=2688, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2420, iMonCtr=2 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1428, iMonCtr=2 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3084, selfPID=3716, iMonCtr=1 Model crash detected, will try to restart... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4092, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2508, selfPID=3584, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3724, selfPID=3724, iMonCtr=2 CPDN Monitor - Quit request from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4360, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4128, iMonCtr=2 Model crash detected, will try to restart... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2152, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2368, iMonCtr=2 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2684, selfPID=960, iMonCtr=1 Model crash detected, will try to restart... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3528, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3844, selfPID=644, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1512, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5824, iMonCtr=2 Model crash detected, will try to restart... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4292, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2056, iMonCtr=2 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1292, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2584, iMonCtr=2 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4324, iMonCtr=2 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5476, selfPID=4716, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Leaving CPDN_Main::Monitor... Called boinc_finish </stderr_txt> ]]>

Latest Trickles Received
Time Sent (UTC)	Host ID	Result ID	Result Name	Timestep	CPU Time (sec)	Average (sec/TS)
19 Jun 2011 21:58:12	859116	12173094	hadam3p_pnw_yx6f_1998_1_006898063_0	138,242	675,125	4.8836
16 Jun 2011 23:55:24	859116	12173094	hadam3p_pnw_yx6f_1998_1_006898063_0	138,240	674,385	4.8784
12 Jun 2011 13:39:57	859116	12173094	hadam3p_pnw_yx6f_1998_1_006898063_0	126,720	616,110	4.8620
06 Jun 2011 14:19:15	859116	12173094	hadam3p_pnw_yx6f_1998_1_006898063_0	115,296	559,205	4.8502
04 Jun 2011 23:31:49	859116	12173094	hadam3p_pnw_yx6f_1998_1_006898063_0	103,776	507,190	4.8874
02 Jun 2011 22:48:38	859116	12173094	hadam3p_pnw_yx6f_1998_1_006898063_0	92,256	452,566	4.9055
30 May 2011 15:41:04	859116	12173094	hadam3p_pnw_yx6f_1998_1_006898063_0	80,736	399,418	4.9472
28 May 2011 12:50:48	859116	12173094	hadam3p_pnw_yx6f_1998_1_006898063_0	69,216	347,706	5.0235
21 May 2011 21:16:43	859116	12173094	hadam3p_pnw_yx6f_1998_1_006898063_0	57,696	294,658	5.1071
17 May 2011 22:16:56	859116	12173094	hadam3p_pnw_yx6f_1998_1_006898063_0	46,176	239,540	5.1875
14 May 2011 13:44:10	859116	12173094	hadam3p_pnw_yx6f_1998_1_006898063_0	34,656	181,139	5.2268
08 May 2011 17:08:17	859116	12173094	hadam3p_pnw_yx6f_1998_1_006898063_0	23,136	121,419	5.2481
01 May 2011 19:00:46	859116	12173094	hadam3p_pnw_yx6f_1998_1_006898063_0	11,616	63,324	5.4514