Task 17255369

Name	hadcm3n_sccz_1940_40_009113398_0
Workunit	9243734
Created	22 Oct 2014, 15:17:01 UTC
Sent	23 Oct 2014, 14:54:43 UTC
Report deadline	22 Jan 2015, 22:21:54 UTC
Received	10 Dec 2014, 13:02:04 UTC
Server state	Over
Outcome	Success
Client state	Done
Exit status	0 (0x00000000)
Computer ID	1169024
Run time	18 days 16 hours 38 min 37 sec
CPU time	18 days 13 hours 38 min 59 sec
Validate state	Valid
Credit	12,441.60
Device peak FLOPS	2.43 GFLOPS
Application version	UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86
Stderr	<core_client_version>7.2.42</core_client_version> <![CDATA[ <stderr_txt> Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6976, iMonCtr=1 Model crash detected, will try to restart... 09:00:53 (6652): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 07:55:09 (6900): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 13:48:14 (5028): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6932, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6932, iMonCtr=1 Model crash detected, will try to restart... 08:53:49 (6424): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 08:37:14 (6476): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 08:00:05 (7048): No heartbeat from core client for 30 sec - exiting 08:00:06 (7048): No heartbeat from core client for 30 sec - exiting 08:00:07 (7048): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6916, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6916, iMonCtr=1 Model crash detected, will try to restart... 07:22:52 (6604): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6448, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6448, iMonCtr=1 Model crash detected, will try to restart... 07:27:38 (3884): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 13:54:19 (5448): No heartbeat from core client for 30 sec - exiting 13:54:20 (5448): No heartbeat from core client for 30 sec - exiting 13:54:21 (5448): No heartbeat from core client for 30 sec - exiting 13:54:22 (5448): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 08:22:51 (6084): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6836, iMonCtr=1 Model crash detected, will try to restart... 07:53:17 (3648): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 07:57:19 (5048): No heartbeat from core client for 30 sec - exiting 07:57:20 (5048): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... BUFFIN: C I/O Error feof - Unit 63 - Return code = 16 BUFFIN: C I/O Error feof - Unit 64 - Return code = 16 BUFFIN: C I/O Error feof - Unit 65 - Return code = 16 BUFFIN: C I/O Error feof - Unit 66 - Return code = 16 BUFFIN: C I/O Error feof - Unit 67 - Return code = 16 BUFFIN: C I/O Error feof - Unit 68 - Return code = 16 BUFFIN: C I/O Error feof - Unit 69 - Return code = 16 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6644, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6852, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6756, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6928, iMonCtr=1 Model crash detected, will try to restart... 08:40:44 (5924): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4656, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... 08:39:50 (6996): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 11:37:56 (6404): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Called boinc_finish </stderr_txt> ]]>

Latest Trickles Received
Time Sent (UTC)	Host ID	Result ID	Result Name	Timestep	CPU Time (sec)	Average (sec/TS)
10 Dec 2014 11:55:50	1169024	17255369	hadcm3n_sccz_1940_40_009113398_0	1,036,800	1,603,707	1.5468
09 Dec 2014 14:45:57	1169024	17255369	hadcm3n_sccz_1940_40_009113398_0	1,010,880	1,563,387	1.5466
08 Dec 2014 17:37:18	1169024	17255369	hadcm3n_sccz_1940_40_009113398_0	984,960	1,523,068	1.5463
07 Dec 2014 20:03:49	1169024	17255369	hadcm3n_sccz_1940_40_009113398_0	959,040	1,482,697	1.5460
06 Dec 2014 20:32:06	1169024	17255369	hadcm3n_sccz_1940_40_009113398_0	933,120	1,442,188	1.5456
06 Dec 2014 09:18:34	1169024	17255369	hadcm3n_sccz_1940_40_009113398_0	907,200	1,401,855	1.5453
05 Dec 2014 11:15:47	1169024	17255369	hadcm3n_sccz_1940_40_009113398_0	881,280	1,361,596	1.5450
04 Dec 2014 11:19:15	1169024	17255369	hadcm3n_sccz_1940_40_009113398_0	855,360	1,321,406	1.5449
03 Dec 2014 13:49:59	1169024	17255369	hadcm3n_sccz_1940_40_009113398_0	829,440	1,281,240	1.5447
02 Dec 2014 16:12:47	1169024	17255369	hadcm3n_sccz_1940_40_009113398_0	803,520	1,241,096	1.5446
01 Dec 2014 17:57:52	1169024	17255369	hadcm3n_sccz_1940_40_009113398_0	777,600	1,200,893	1.5444
30 Nov 2014 20:41:42	1169024	17255369	hadcm3n_sccz_1940_40_009113398_0	751,680	1,167,236	1.5528
30 Nov 2014 09:20:22	1169024	17255369	hadcm3n_sccz_1940_40_009113398_0	725,760	1,126,728	1.5525
29 Nov 2014 10:58:03	1169024	17255369	hadcm3n_sccz_1940_40_009113398_0	699,840	1,086,331	1.5523
28 Nov 2014 07:13:30	1169024	17255369	hadcm3n_sccz_1940_40_009113398_0	673,920	1,046,141	1.5523
27 Nov 2014 11:46:26	1169024	17255369	hadcm3n_sccz_1940_40_009113398_0	648,000	1,005,695	1.5520
26 Nov 2014 15:12:33	1169024	17255369	hadcm3n_sccz_1940_40_009113398_0	622,080	965,277	1.5517
25 Nov 2014 18:22:51	1169024	17255369	hadcm3n_sccz_1940_40_009113398_0	596,160	924,869	1.5514
24 Nov 2014 21:45:09	1169024	17255369	hadcm3n_sccz_1940_40_009113398_0	570,240	884,495	1.5511
24 Nov 2014 10:45:46	1169024	17255369	hadcm3n_sccz_1940_40_009113398_0	544,320	844,262	1.5510