Task 15436676

Name	hadcm3n_zg4g_1920_40_008244913_1
Workunit	8400037
Created	15 Nov 2012, 17:24:39 UTC
Sent	15 Nov 2012, 17:24:48 UTC
Report deadline	15 Feb 2013, 0:51:59 UTC
Received	30 Nov 2012, 10:53:05 UTC
Server state	Over
Outcome	Computation error
Client state	Compute error
Exit status	22 (0x00000016) Unknown error code
Computer ID	1169024
Run time	8 days 12 hours 13 min 5 sec
CPU time	8 days 10 hours 12 min 59 sec
Validate state	Invalid
Credit	5,287.68
Device peak FLOPS	2.47 GFLOPS
Application version	UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86
Stderr	<core_client_version>7.0.28</core_client_version> <![CDATA[ <message> The device does not recognize the command. (0x16) - exit code 22 (0x16) </message> <stderr_txt> Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2988, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5012, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6116, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3992, iMonCtr=1 Model crash detected, will try to restart... 08:16:36 (5388): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6004, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6004, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6004, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6004, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6004, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6004, iMonCtr=1 Model crash detected, will try to restart... Sorry, too many model crashes! :-( Called boinc_finish </stderr_txt> ]]>

Latest Trickles Received
Time Sent (UTC)	Host ID	Result ID	Result Name	Timestep	CPU Time (sec)	Average (sec/TS)
29 Nov 2012 13:38:37	1169024	15436676	hadcm3n_zg4g_1920_40_008244913_1	440,640	695,308	1.5780
28 Nov 2012 14:53:14	1169024	15436676	hadcm3n_zg4g_1920_40_008244913_1	414,720	654,164	1.5774
27 Nov 2012 18:15:04	1169024	15436676	hadcm3n_zg4g_1920_40_008244913_1	388,800	613,287	1.5774
26 Nov 2012 21:10:35	1169024	15436676	hadcm3n_zg4g_1920_40_008244913_1	362,880	572,549	1.5778
26 Nov 2012 10:30:30	1169024	15436676	hadcm3n_zg4g_1920_40_008244913_1	336,960	531,679	1.5779
25 Nov 2012 14:01:32	1169024	15436676	hadcm3n_zg4g_1920_40_008244913_1	311,040	490,882	1.5782
24 Nov 2012 17:25:30	1169024	15436676	hadcm3n_zg4g_1920_40_008244913_1	285,120	450,086	1.5786
23 Nov 2012 16:15:07	1169024	15436676	hadcm3n_zg4g_1920_40_008244913_1	259,200	409,232	1.5788
22 Nov 2012 18:57:58	1169024	15436676	hadcm3n_zg4g_1920_40_008244913_1	233,280	368,312	1.5788
21 Nov 2012 21:50:07	1169024	15436676	hadcm3n_zg4g_1920_40_008244913_1	207,360	327,320	1.5785
21 Nov 2012 10:28:48	1169024	15436676	hadcm3n_zg4g_1920_40_008244913_1	181,440	286,675	1.5800
20 Nov 2012 15:21:58	1169024	15436676	hadcm3n_zg4g_1920_40_008244913_1	155,520	246,036	1.5820
19 Nov 2012 17:45:34	1169024	15436676	hadcm3n_zg4g_1920_40_008244913_1	129,600	205,345	1.5845
18 Nov 2012 21:36:51	1169024	15436676	hadcm3n_zg4g_1920_40_008244913_1	103,680	164,262	1.5843
18 Nov 2012 10:04:59	1169024	15436676	hadcm3n_zg4g_1920_40_008244913_1	77,760	123,033	1.5822
17 Nov 2012 13:26:00	1169024	15436676	hadcm3n_zg4g_1920_40_008244913_1	51,840	81,791	1.5778
16 Nov 2012 14:41:35	1169024	15436676	hadcm3n_zg4g_1920_40_008244913_1	25,920	40,973	1.5807