Task 15671569

Name	hadcm3n_4ff5_1980_40_008324125_2
Workunit	8475260
Created	18 Mar 2013, 21:45:17 UTC
Sent	18 Mar 2013, 21:45:27 UTC
Report deadline	18 Jun 2013, 5:12:38 UTC
Received	25 Apr 2013, 16:55:10 UTC
Server state	Over
Outcome	Computation error
Client state	Compute error
Exit status	22 (0x00000016) Unknown error code
Computer ID	1265083
Run time	8 days 18 hours 49 min 42 sec
CPU time	7 days 11 hours 6 min 40 sec
Validate state	Invalid
Credit	5,287.68
Device peak FLOPS	3.35 GFLOPS
Application version	UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86
Stderr	<core_client_version>7.0.28</core_client_version> <![CDATA[ <message> The device does not recognize the command. (0x16) - exit code 22 (0x16) </message> <stderr_txt> CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6788, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3980, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3980, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3980, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3980, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3980, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3980, iMonCtr=1 Model crash detected, will try to restart... Sorry, too many model crashes! :-( Called boinc_finish 16:51:03 (3980): No heartbeat from core client for 30 sec - exiting 16:51:04 (3980): No heartbeat from core client for 30 sec - exiting Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3708, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... 14:00:22 (2752): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2988, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 09:50:54 (3448): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4928, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4256, iMonCtr=1 Model crash detected, will try to restart... Model crashed: ATM_DYN : INVALID THETA DETECTED. tmp/pipe_dummy 2048 Model crashed: ATM_DYN : INVALID THETA DETECTED. tmp/pipe_dummy 2048 Model crashed: ATM_DYN : INVALID THETA DETECTED. tmp/pipe_dummy 2048 Model crashed: ATM_DYN : INVALID THETA DETECTED. tmp/pipe_dummy 2048 Model crashed: ATM_DYN : INVALID THETA DETECTED. tmp/pipe_dummy 2048 Model crashed: ATM_DYN : INVALID THETA DETECTED. tmp/pipe_dummy 2048 Sorry, too many model crashes! :-( Called boinc_finish </stderr_txt> ]]>

Latest Trickles Received
Time Sent (UTC)	Host ID	Result ID	Result Name	Timestep	CPU Time (sec)	Average (sec/TS)
24 Apr 2013 13:30:00	1265083	15671569	hadcm3n_4ff5_1980_40_008324125_2	440,640	717,619	1.6286
24 Apr 2013 02:58:13	1265083	15671569	hadcm3n_4ff5_1980_40_008324125_2	414,720	680,439	1.6407
23 Apr 2013 16:26:02	1265083	15671569	hadcm3n_4ff5_1980_40_008324125_2	388,800	644,066	1.6565
18 Apr 2013 22:45:10	1265083	15671569	hadcm3n_4ff5_1980_40_008324125_2	362,880	607,479	1.6740
17 Apr 2013 16:05:53	1265083	15671569	hadcm3n_4ff5_1980_40_008324125_2	336,960	571,957	1.6974
15 Apr 2013 22:59:11	1265083	15671569	hadcm3n_4ff5_1980_40_008324125_2	311,040	535,321	1.7211
12 Apr 2013 19:32:03	1265083	15671569	hadcm3n_4ff5_1980_40_008324125_2	285,120	498,790	1.7494
11 Apr 2013 17:37:23	1265083	15671569	hadcm3n_4ff5_1980_40_008324125_2	259,200	462,314	1.7836
10 Apr 2013 05:47:07	1265083	15671569	hadcm3n_4ff5_1980_40_008324125_2	233,280	425,584	1.8243
09 Apr 2013 19:05:29	1265083	15671569	hadcm3n_4ff5_1980_40_008324125_2	207,360	389,186	1.8769
08 Apr 2013 16:21:04	1265083	15671569	hadcm3n_4ff5_1980_40_008324125_2	181,440	353,268	1.9470
05 Apr 2013 06:13:34	1265083	15671569	hadcm3n_4ff5_1980_40_008324125_2	155,520	317,974	2.0446
04 Apr 2013 19:48:11	1265083	15671569	hadcm3n_4ff5_1980_40_008324125_2	129,600	283,224	2.1854
04 Apr 2013 00:57:39	1265083	15671569	hadcm3n_4ff5_1980_40_008324125_2	103,680	249,727	2.4086
26 Mar 2013 22:57:15	1265083	15671569	hadcm3n_4ff5_1980_40_008324125_2	77,760	104,725	1.3468
25 Mar 2013 21:15:38	1265083	15671569	hadcm3n_4ff5_1980_40_008324125_2	51,840	69,507	1.3408
22 Mar 2013 00:16:20	1265083	15671569	hadcm3n_4ff5_1980_40_008324125_2	25,920	35,501	1.3696