Task 13125414

Name	hadcm3n_yljd_1900_40_007360771_1
Workunit	7558201
Created	6 Jul 2011, 15:14:52 UTC
Sent	7 Jul 2011, 16:56:29 UTC
Report deadline	7 Oct 2011, 0:23:40 UTC
Received	3 Sep 2011, 16:31:07 UTC
Server state	Over
Outcome	Computation error
Client state	Compute error
Exit status	193 (0x000000C1) EXIT_SIGNAL
Computer ID	893745
Run time	14 days 4 hours 50 min 1 sec
CPU time	13 days 1 hours 45 min 5 sec
Validate state	Invalid
Credit	9,331.20
Device peak FLOPS	2.91 GFLOPS
Application version	UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86
Stderr	<core_client_version>6.10.18</core_client_version> <![CDATA[ <message> - exit code 193 (0xc1) </message> <stderr_txt> CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5124, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5072, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5072, iMonCtr=1 Model crash detected, will try to restart... 17:54:39 (4148): No heartbeat from core client for 30 sec - exiting 17:54:40 (4148): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 17:54:44 (2320): Can't acquire lockfile (32) - waiting 35s Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2320, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5376, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5172, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5036, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4916, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5540, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5876, iMonCtr=1 Model crash detected, will try to restart... Signal 11 received, exiting... Called boinc_finish </stderr_txt> ]]>

Latest Trickles Received
Time Sent (UTC)	Host ID	Result ID	Result Name	Timestep	CPU Time (sec)	Average (sec/TS)
03 Sep 2011 15:31:54	893745	13125414	hadcm3n_yljd_1900_40_007360771_1	777,600	1,129,496	1.4525
31 Aug 2011 19:50:17	893745	13125414	hadcm3n_yljd_1900_40_007360771_1	751,680	1,091,682	1.4523
08 Aug 2011 20:12:29	893745	13125414	hadcm3n_yljd_1900_40_007360771_1	725,760	1,053,966	1.4522
07 Aug 2011 11:52:17	893745	13125414	hadcm3n_yljd_1900_40_007360771_1	699,840	1,016,075	1.4519
07 Aug 2011 00:49:16	893745	13125414	hadcm3n_yljd_1900_40_007360771_1	673,920	978,415	1.4518
06 Aug 2011 13:49:44	893745	13125414	hadcm3n_yljd_1900_40_007360771_1	648,000	940,941	1.4521
04 Aug 2011 18:31:43	893745	13125414	hadcm3n_yljd_1900_40_007360771_1	622,080	903,473	1.4523
31 Jul 2011 19:34:01	893745	13125414	hadcm3n_yljd_1900_40_007360771_1	596,160	865,037	1.4510
31 Jul 2011 08:12:08	893745	13125414	hadcm3n_yljd_1900_40_007360771_1	570,240	826,798	1.4499
30 Jul 2011 20:46:24	893745	13125414	hadcm3n_yljd_1900_40_007360771_1	544,320	788,431	1.4485
30 Jul 2011 09:07:27	893745	13125414	hadcm3n_yljd_1900_40_007360771_1	518,400	749,170	1.4452
29 Jul 2011 21:32:31	893745	13125414	hadcm3n_yljd_1900_40_007360771_1	492,480	710,596	1.4429
27 Jul 2011 20:39:19	893745	13125414	hadcm3n_yljd_1900_40_007360771_1	466,560	672,493	1.4414
25 Jul 2011 23:04:17	893745	13125414	hadcm3n_yljd_1900_40_007360771_1	440,640	634,354	1.4396
25 Jul 2011 22:00:32	893745	13125414	hadcm3n_yljd_1900_40_007360771_1	414,720	596,386	1.4380
25 Jul 2011 21:13:04	893745	13125414	hadcm3n_yljd_1900_40_007360771_1	388,800	558,410	1.4362
25 Jul 2011 20:45:06	893745	13125414	hadcm3n_yljd_1900_40_007360771_1	362,880	520,147	1.4334
25 Jul 2011 20:27:35	893745	13125414	hadcm3n_yljd_1900_40_007360771_1	336,960	482,368	1.4315
25 Jul 2011 18:55:43	893745	13125414	hadcm3n_yljd_1900_40_007360771_1	311,040	444,547	1.4292
25 Jul 2011 18:20:12	893745	13125414	hadcm3n_yljd_1900_40_007360771_1	285,120	407,057	1.4277