Task 11026541

Name	hadsm3dhet2_jpu7_006596481_3
Workunit	6799854
Created	15 Mar 2010, 12:02:44 UTC
Sent	30 Sep 2010, 12:39:05 UTC
Report deadline	12 Sep 2011, 17:59:05 UTC
Received	6 Oct 2010, 3:49:21 UTC
Server state	Over
Outcome	Computation error
Client state	Compute error
Exit status	22 (0x00000016) Unknown error code
Computer ID	1076018
Run time	4 days 8 hours 15 min 26 sec
CPU time	4 days 4 hours 10 min 59 sec
Validate state	Invalid
Credit	1,488.65
Device peak FLOPS	2.31 GFLOPS
Application version	UK Met Office HadSM3 Slab Model v6.07 windows_intelx86
Stderr	<core_client_version>6.10.18</core_client_version> <![CDATA[ <message> The device does not recognize the command. (0x16) - exit code 22 (0x16) </message> <stderr_txt> CPDN Monitor - Quit request from BOINC... No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3184, iMonCtr=1 Model crash detected, will try to restart... CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6140, iMonCtr=1 Model crash detected, will try to restart... CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6140, iMonCtr=1 Model crash detected, will try to restart... CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6140, iMonCtr=1 Model crash detected, will try to restart... CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6140, iMonCtr=1 Model crash detected, will try to restart... CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6140, iMonCtr=1 Model crash detected, will try to restart... CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6140, iMonCtr=1 Model crash detected, will try to restart... Sorry, too many model crashes! :-( called boinc_finish </stderr_txt> ]]>

Latest Trickles Received
Time Sent (UTC)	Host ID	Result ID	Result Name	Timestep	CPU Time (sec)	Average (sec/TS)
06 Oct 2010 03:45:22	1076018	11026541	hadsm3dhet2_jpu7_006596481_3	162,030	346,403	2.1379
06 Oct 2010 03:45:22	1076018	11026541	hadsm3dhet2_jpu7_006596481_3	151,228	324,102	2.1431
06 Oct 2010 03:45:22	1076018	11026541	hadsm3dhet2_jpu7_006596481_3	140,426	301,296	2.1456
04 Oct 2010 22:19:07	1076018	11026541	hadsm3dhet2_jpu7_006596481_3	129,624	277,920	2.1440
04 Oct 2010 14:03:15	1076018	11026541	hadsm3dhet2_jpu7_006596481_3	118,822	254,247	2.1397
04 Oct 2010 04:40:36	1076018	11026541	hadsm3dhet2_jpu7_006596481_3	108,020	229,979	2.1290
03 Oct 2010 20:13:27	1076018	11026541	hadsm3dhet2_jpu7_006596481_3	97,218	207,086	2.1301
03 Oct 2010 12:06:29	1076018	11026541	hadsm3dhet2_jpu7_006596481_3	86,416	184,548	2.1356
03 Oct 2010 03:04:26	1076018	11026541	hadsm3dhet2_jpu7_006596481_3	75,614	160,289	2.1198
02 Oct 2010 18:46:38	1076018	11026541	hadsm3dhet2_jpu7_006596481_3	64,812	137,424	2.1203
02 Oct 2010 10:25:21	1076018	11026541	hadsm3dhet2_jpu7_006596481_3	54,010	114,533	2.1206
02 Oct 2010 03:50:53	1076018	11026541	hadsm3dhet2_jpu7_006596481_3	43,208	91,240	2.1116
01 Oct 2010 17:44:27	1076018	11026541	hadsm3dhet2_jpu7_006596481_3	32,406	68,686	2.1195
01 Oct 2010 09:40:01	1076018	11026541	hadsm3dhet2_jpu7_006596481_3	21,604	46,170	2.1371
01 Oct 2010 02:58:42	1076018	11026541	hadsm3dhet2_jpu7_006596481_3	10,802	22,553	2.0879