Task 16824275

Name	hadcm3n_8epm_1980_40_008728485_1
Workunit	8874463
Created	26 Jul 2014, 0:26:34 UTC
Sent	26 Jul 2014, 0:26:44 UTC
Report deadline	25 Oct 2014, 7:53:55 UTC
Received	8 Sep 2014, 21:39:15 UTC
Server state	Over
Outcome	Computation error
Client state	Compute error
Exit status	22 (0x00000016) Unknown error code
Computer ID	1323789
Run time	5 days 12 hours 36 min 25 sec
CPU time	2 days 23 hours 28 min 21 sec
Validate state	Invalid
Credit	8,398.08
Device peak FLOPS	3.69 GFLOPS
Application version	UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86
Stderr	<core_client_version>7.2.42</core_client_version> <![CDATA[ <message> The device does not recognize the command. (0x16) - exit code 22 (0x16) </message> <stderr_txt> Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 07:29:12 (7672): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 07:29:13 (7672): No heartbeat from core client for 30 sec - exiting 07:29:14 (7672): No heartbeat from core client for 30 sec - exiting 07:29:15 (7672): No heartbeat from core client for 30 sec - exiting 07:29:17 (7672): No heartbeat from core client for 30 sec - exiting 07:29:18 (7672): No heartbeat from core client for 30 sec - exiting 07:29:19 (7672): No heartbeat from core client for 30 sec - exiting 07:29:20 (7672): No heartbeat from core client for 30 sec - exiting 08:36:15 (5100): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 08:36:16 (5100): No heartbeat from core client for 30 sec - exiting 08:36:17 (5100): No heartbeat from core client for 30 sec - exiting BUFFOUT: C I/O Error - Return code = 32 Model crashed: WRITDUMP: BAD BUFFOUT OF DATA tmp/pipe_dummy 2048 Signal 11 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=8840, iMonCtr=1 Model crash detected, will try to restart... Signal 11 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=8840, iMonCtr=1 Model crash detected, will try to restart... Signal 11 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=8840, iMonCtr=1 Model crash detected, will try to restart... Signal 11 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=8840, iMonCtr=1 Model crash detected, will try to restart... Signal 11 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=8840, iMonCtr=1 Model crash detected, will try to restart... Sorry, too many model crashes! :-( Called boinc_finish </stderr_txt> ]]>

Latest Trickles Received
Time Sent (UTC)	Host ID	Result ID	Result Name	Timestep	CPU Time (sec)	Average (sec/TS)
08 Sep 2014 21:44:38	1323789	16824275	hadcm3n_8epm_1980_40_008728485_1	699,840	460,019	0.6573
08 Sep 2014 21:44:38	1323789	16824275	hadcm3n_8epm_1980_40_008728485_1	673,920	442,660	0.6568
08 Sep 2014 21:44:38	1323789	16824275	hadcm3n_8epm_1980_40_008728485_1	648,000	425,264	0.6563
08 Sep 2014 21:44:38	1323789	16824275	hadcm3n_8epm_1980_40_008728485_1	622,080	407,884	0.6557
08 Sep 2014 21:44:38	1323789	16824275	hadcm3n_8epm_1980_40_008728485_1	596,160	390,504	0.6550
08 Sep 2014 21:44:38	1323789	16824275	hadcm3n_8epm_1980_40_008728485_1	570,240	373,131	0.6543
08 Sep 2014 21:44:38	1323789	16824275	hadcm3n_8epm_1980_40_008728485_1	544,320	355,756	0.6536
29 Aug 2014 09:25:46	1323789	16824275	hadcm3n_8epm_1980_40_008728485_1	518,400	338,526	0.6530
29 Aug 2014 09:25:20	1323789	16824275	hadcm3n_8epm_1980_40_008728485_1	492,480	321,485	0.6528
29 Aug 2014 09:24:38	1323789	16824275	hadcm3n_8epm_1980_40_008728485_1	466,560	304,449	0.6525
29 Aug 2014 01:38:13	1323789	16824275	hadcm3n_8epm_1980_40_008728485_1	440,640	287,370	0.6522
26 Aug 2014 22:27:03	1323789	16824275	hadcm3n_8epm_1980_40_008728485_1	414,720	270,316	0.6518
26 Aug 2014 07:26:16	1323789	16824275	hadcm3n_8epm_1980_40_008728485_1	388,800	253,240	0.6513
26 Aug 2014 01:58:38	1323789	16824275	hadcm3n_8epm_1980_40_008728485_1	362,880	236,354	0.6513
25 Aug 2014 21:42:45	1323789	16824275	hadcm3n_8epm_1980_40_008728485_1	336,960	219,487	0.6514
25 Aug 2014 16:17:19	1323789	16824275	hadcm3n_8epm_1980_40_008728485_1	311,040	202,629	0.6515
25 Aug 2014 12:02:40	1323789	16824275	hadcm3n_8epm_1980_40_008728485_1	285,120	185,671	0.6512
25 Aug 2014 07:33:20	1323789	16824275	hadcm3n_8epm_1980_40_008728485_1	259,200	168,800	0.6512
25 Aug 2014 02:16:34	1323789	16824275	hadcm3n_8epm_1980_40_008728485_1	233,280	151,954	0.6514
24 Aug 2014 21:43:34	1323789	16824275	hadcm3n_8epm_1980_40_008728485_1	207,360	135,180	0.6519