Task 13318965

Name	hadcm3n_o1gx_1940_40_007433211_0
Workunit	7630714
Created	31 Aug 2011, 21:29:02 UTC
Sent	1 Sep 2011, 0:39:25 UTC
Report deadline	1 Dec 2011, 8:06:36 UTC
Received	27 Sep 2011, 8:38:17 UTC
Server state	Over
Outcome	Computation error
Client state	Compute error
Exit status	22 (0x00000016) Unknown error code
Computer ID	1099430
Run time	10 days 1 hours 5 min 35 sec
CPU time	9 days 12 hours 13 min 30 sec
Validate state	Invalid
Credit	7,776.00
Device peak FLOPS	2.99 GFLOPS
Application version	UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86
Stderr	<core_client_version>6.10.58</core_client_version> <![CDATA[ <message> The device does not recognize the command. (0x16) - exit code 22 (0x16) </message> <stderr_txt> Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 10:56:56 (5004): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 10:56:58 (5004): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5872, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5872, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5872, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5872, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5872, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5872, iMonCtr=1 Model crash detected, will try to restart... Sorry, too many model crashes! :-( Called boinc_finish </stderr_txt> ]]>

Latest Trickles Received
Time Sent (UTC)	Host ID	Result ID	Result Name	Timestep	CPU Time (sec)	Average (sec/TS)
25 Sep 2011 05:38:45	1099430	13318965	hadcm3n_o1gx_1940_40_007433211_0	648,000	816,360	1.2598
24 Sep 2011 20:04:52	1099430	13318965	hadcm3n_o1gx_1940_40_007433211_0	622,080	783,146	1.2589
24 Sep 2011 10:27:30	1099430	13318965	hadcm3n_o1gx_1940_40_007433211_0	596,160	750,003	1.2581
23 Sep 2011 23:55:38	1099430	13318965	hadcm3n_o1gx_1940_40_007433211_0	570,240	716,761	1.2569
23 Sep 2011 13:59:54	1099430	13318965	hadcm3n_o1gx_1940_40_007433211_0	544,320	683,824	1.2563
22 Sep 2011 21:20:44	1099430	13318965	hadcm3n_o1gx_1940_40_007433211_0	518,400	650,849	1.2555
22 Sep 2011 11:40:30	1099430	13318965	hadcm3n_o1gx_1940_40_007433211_0	492,480	618,181	1.2552
22 Sep 2011 01:02:46	1099430	13318965	hadcm3n_o1gx_1940_40_007433211_0	466,560	585,689	1.2553
21 Sep 2011 15:27:02	1099430	13318965	hadcm3n_o1gx_1940_40_007433211_0	440,640	553,236	1.2555
21 Sep 2011 04:08:43	1099430	13318965	hadcm3n_o1gx_1940_40_007433211_0	414,720	521,005	1.2563
20 Sep 2011 18:25:09	1099430	13318965	hadcm3n_o1gx_1940_40_007433211_0	388,800	488,746	1.2571
20 Sep 2011 08:53:36	1099430	13318965	hadcm3n_o1gx_1940_40_007433211_0	362,880	456,374	1.2576
19 Sep 2011 23:15:53	1099430	13318965	hadcm3n_o1gx_1940_40_007433211_0	336,960	424,162	1.2588
19 Sep 2011 10:54:47	1099430	13318965	hadcm3n_o1gx_1940_40_007433211_0	311,040	391,695	1.2593
18 Sep 2011 09:27:33	1099430	13318965	hadcm3n_o1gx_1940_40_007433211_0	285,120	359,166	1.2597
17 Sep 2011 22:53:28	1099430	13318965	hadcm3n_o1gx_1940_40_007433211_0	259,200	326,555	1.2599
17 Sep 2011 09:40:41	1099430	13318965	hadcm3n_o1gx_1940_40_007433211_0	233,280	293,980	1.2602
16 Sep 2011 23:50:46	1099430	13318965	hadcm3n_o1gx_1940_40_007433211_0	207,360	261,067	1.2590
16 Sep 2011 14:07:35	1099430	13318965	hadcm3n_o1gx_1940_40_007433211_0	181,440	227,924	1.2562
16 Sep 2011 04:04:25	1099430	13318965	hadcm3n_o1gx_1940_40_007433211_0	155,520	194,478	1.2505