Task 15588642

Name	hadcm3n_4hcw_1940_40_008303110_0
Workunit	8454245
Created	6 Feb 2013, 21:38:51 UTC
Sent	6 Feb 2013, 21:39:07 UTC
Report deadline	9 May 2013, 5:06:18 UTC
Received	6 Mar 2013, 16:59:47 UTC
Server state	Over
Outcome	Computation error
Client state	Compute error
Exit status	22 (0x00000016) Unknown error code
Computer ID	1210909
Run time	10 days 9 hours 23 min 39 sec
CPU time	10 days 7 hours 56 min 3 sec
Validate state	Invalid
Credit	7,464.96
Device peak FLOPS	2.69 GFLOPS
Application version	UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86
Stderr	<core_client_version>6.10.60</core_client_version> <![CDATA[ <message> The device does not recognize the command. (0x16) - exit code 22 (0x16) </message> <stderr_txt> Suspended CPDN Monitor - Suspend request from BOINC... 06:30:59 (2104): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3984, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3984, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3984, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3984, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3984, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3984, iMonCtr=1 Model crash detected, will try to restart... Sorry, too many model crashes! :-( Called boinc_finish </stderr_txt> ]]>

Latest Trickles Received
Time Sent (UTC)	Host ID	Result ID	Result Name	Timestep	CPU Time (sec)	Average (sec/TS)
05 Mar 2013 07:35:33	1210909	15588642	hadcm3n_4hcw_1940_40_008303110_0	622,080	863,231	1.3877
04 Mar 2013 22:02:24	1210909	15588642	hadcm3n_4hcw_1940_40_008303110_0	596,160	828,919	1.3904
04 Mar 2013 12:19:27	1210909	15588642	hadcm3n_4hcw_1940_40_008303110_0	570,240	794,137	1.3926
04 Mar 2013 02:06:37	1210909	15588642	hadcm3n_4hcw_1940_40_008303110_0	544,320	757,904	1.3924
03 Mar 2013 16:05:44	1210909	15588642	hadcm3n_4hcw_1940_40_008303110_0	518,400	721,784	1.3923
03 Mar 2013 06:00:14	1210909	15588642	hadcm3n_4hcw_1940_40_008303110_0	492,480	685,561	1.3921
02 Mar 2013 19:59:56	1210909	15588642	hadcm3n_4hcw_1940_40_008303110_0	466,560	649,704	1.3925
02 Mar 2013 09:58:48	1210909	15588642	hadcm3n_4hcw_1940_40_008303110_0	440,640	613,915	1.3932
01 Mar 2013 23:59:28	1210909	15588642	hadcm3n_4hcw_1940_40_008303110_0	414,720	578,087	1.3939
01 Mar 2013 13:05:21	1210909	15588642	hadcm3n_4hcw_1940_40_008303110_0	388,800	542,328	1.3949
01 Mar 2013 03:11:09	1210909	15588642	hadcm3n_4hcw_1940_40_008303110_0	362,880	506,969	1.3971
28 Feb 2013 17:24:33	1210909	15588642	hadcm3n_4hcw_1940_40_008303110_0	336,960	471,919	1.4005
28 Feb 2013 07:37:28	1210909	15588642	hadcm3n_4hcw_1940_40_008303110_0	311,040	437,084	1.4052
27 Feb 2013 21:54:24	1210909	15588642	hadcm3n_4hcw_1940_40_008303110_0	285,120	402,304	1.4110
27 Feb 2013 12:07:35	1210909	15588642	hadcm3n_4hcw_1940_40_008303110_0	259,200	367,352	1.4173
27 Feb 2013 02:20:20	1210909	15588642	hadcm3n_4hcw_1940_40_008303110_0	233,280	332,334	1.4246
26 Feb 2013 16:05:58	1210909	15588642	hadcm3n_4hcw_1940_40_008303110_0	207,360	296,799	1.4313
26 Feb 2013 15:05:43	1210909	15588642	hadcm3n_4hcw_1940_40_008303110_0	181,440	259,752	1.4316
25 Feb 2013 19:18:24	1210909	15588642	hadcm3n_4hcw_1940_40_008303110_0	155,520	222,671	1.4318
25 Feb 2013 09:04:50	1210909	15588642	hadcm3n_4hcw_1940_40_008303110_0	129,600	185,530	1.4316