climateprediction.net home page
Task 11026694

Task 11026694

Name hadsm3dhet2_jpum_006596496_6
Workunit 6799869
Created 15 Mar 2010, 12:02:45 UTC
Sent 30 Sep 2010, 11:46:05 UTC
Report deadline 12 Sep 2011, 17:06:05 UTC
Received 3 Jan 2011, 5:25:07 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1033866
Run time
CPU time 31 days 5 hours 4 min 21 sec
Validate state Invalid
Credit 5,954.60
Device peak FLOPS 1.47 GFLOPS
Application version UK Met Office HadSM3 Slab Model v6.07
windows_intelx86
Stderr
<core_client_version>6.2.28</core_client_version>
<![CDATA[
<message>
The device does not recognize the command. (0x16) - exit code 22 (0x16)
</message>
<stderr_txt>
No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2520, iMonCtr=1
Model crash detected, will try to restart...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - No 'heartbeat' from BOINC...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3332, iMonCtr=1
Model crash detected, will try to restart...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3796, iMonCtr=1
Model crash detected, will try to restart...
No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=9852, iMonCtr=1
Model crash detected, will try to restart...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
MainError:	03:26:00 AM	No files match the supplied pattern.
MainError:	03:26:00 AM	No files match the supplied pattern.
MainError:	03:26:00 AM	No files match the supplied pattern.
MainError:	03:26:00 AM	No files match the supplied pattern.
MainError:	03:26:00 AM	No files match the supplied pattern.
MainError:	03:26:00 AM	No files match the supplied pattern.
MainError:	03:26:00 AM	No files match the supplied pattern.
MainError:	03:26:00 AM	No files match the supplied pattern.
MainError:	03:26:00 AM	No files match the supplied pattern.
MainError:	03:26:00 AM	No files match the supplied pattern.
MainError:	03:26:00 AM	No files match the supplied pattern.
MainError:	03:26:00 AM	No files match the supplied pattern.
MainError:	03:26:00 AM	No files match the supplied pattern.
MainError:	03:26:00 AM	No files match the supplied pattern.
MainError:	03:26:00 AM	No files match the supplied pattern.
MainError:	03:26:00 AM	No files match the supplied pattern.
MainError:	03:26:00 AM	No files match the supplied pattern.
MainError:	03:26:00 AM	No files match the supplied pattern.
MainError:	03:26:00 AM	No files match the supplied pattern.
MainError:	03:26:00 AM	No files match the supplied pattern.
MainError:	03:26:00 AM	No files match the supplied pattern.
MainError:	03:26:00 AM	No files match the supplied pattern.
MainError:	03:26:00 AM	No files match the supplied pattern.
MainError:	03:26:00 AM	No files match the supplied pattern.
MainError:	03:26:00 AM	No files match the supplied pattern.
MainError:	03:26:00 AM	No files match the supplied pattern.
MainError:	03:26:00 AM	No files match the supplied pattern.
MainError:	03:26:00 AM	No files match the supplied pattern.
MainError:	03:26:00 AM	No files match the supplied pattern.
MainError:	03:26:00 AM	No files match the supplied pattern.
MainError:	03:26:00 AM	No files match the supplied pattern.
MainError:	03:26:00 AM	No files match the supplied pattern.
MainError:	03:26:00 AM	No files match the supplied pattern.
MainError:	03:26:00 AM	No files match the supplied pattern.
MainError:	03:26:00 AM	No files match the supplied pattern.
MainError:	03:26:00 AM	No files match the supplied pattern.
MainError:	03:26:00 AM	No files match the supplied pattern.
MainError:	03:26:00 AM	No files match the supplied pattern.
MainError:	03:26:00 AM	No files match the supplied pattern.
MainError:	03:26:00 AM	No files match the supplied pattern.
MainError:	03:26:00 AM	No files match the supplied pattern.
MainError:	03:26:00 AM	No files match the supplied pattern.
MainError:	03:26:00 AM	No files match the supplied pattern.
MainError:	03:26:00 AM	No files match the supplied pattern.
MainError:	03:26:00 AM	No files match the supplied pattern.
MainError:	03:26:00 AM	No files match the supplied pattern.
MainError:	03:26:00 AM	No files match the supplied pattern.
MainError:	03:26:00 AM	No files match the supplied pattern.
MainError:	03:26:00 AM	No files match the supplied pattern.
MainError:	03:26:00 AM	No files match the supplied pattern.
MainError:	03:26:00 AM	No files match the supplied pattern.
MainError:	03:26:00 AM	No files match the supplied pattern.
MainError:	03:26:00 AM	No files match the supplied pattern.
MainError:	03:26:00 AM	No files match the supplied pattern.
MainError:	03:26:00 AM	No files match the supplied pattern.
MainError:	03:26:00 AM	No files match the supplied pattern.
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7408, iMonCtr=1
Model crash detected, will try to restart...
No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - No 'heartbeat' from BOINC...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3980, iMonCtr=1
Model crash detected, will try to restart...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6148, iMonCtr=1
Model crash detected, will try to restart...
MainError:	10:46:04 AM	No files match the supplied pattern.
MainError:	10:46:04 AM	No files match the supplied pattern.
MainError:	10:46:04 AM	No files match the supplied pattern.
MainError:	10:46:04 AM	No files match the supplied pattern.
MainError:	10:46:04 AM	No files match the supplied pattern.
MainError:	10:46:04 AM	No files match the supplied pattern.
MainError:	10:46:04 AM	No files match the supplied pattern.
MainError:	10:46:04 AM	No files match the supplied pattern.
MainError:	10:46:04 AM	No files match the supplied pattern.
MainError:	10:46:04 AM	No files match the supplied pattern.
MainError:	10:46:04 AM	No files match the supplied pattern.
MainError:	10:46:04 AM	No files match the supplied pattern.
MainError:	10:46:04 AM	No files match the supplied pattern.
MainError:	10:46:04 AM	No files match the supplied pattern.
MainError:	10:46:04 AM	No files match the supplied pattern.
MainError:	10:46:04 AM	No files match the supplied pattern.
MainError:	10:46:04 AM	No files match the supplied pattern.
MainError:	10:46:04 AM	No files match the supplied pattern.
MainError:	10:46:04 AM	No files match the supplied pattern.
MainError:	10:46:04 AM	No files match the supplied pattern.
MainError:	10:46:04 AM	No files match the supplied pattern.
MainError:	10:46:04 AM	No files match the supplied pattern.
MainError:	10:46:04 AM	No files match the supplied pattern.
MainError:	10:46:04 AM	No files match the supplied pattern.
MainError:	10:46:04 AM	No files match the supplied pattern.
MainError:	10:46:04 AM	No files match the supplied pattern.
MainError:	10:46:04 AM	No files match the supplied pattern.
MainError:	10:46:04 AM	No files match the supplied pattern.
MainError:	10:46:04 AM	No files match the supplied pattern.
MainError:	10:46:04 AM	No files match the supplied pattern.
MainError:	10:46:04 AM	No files match the supplied pattern.
MainError:	10:46:04 AM	No files match the supplied pattern.
MainError:	10:46:04 AM	No files match the supplied pattern.
MainError:	10:46:04 AM	No files match the supplied pattern.
MainError:	10:46:04 AM	No files match the supplied pattern.
MainError:	10:46:04 AM	No files match the supplied pattern.
MainError:	10:46:04 AM	No files match the supplied pattern.
MainError:	10:46:04 AM	No files match the supplied pattern.
MainError:	10:46:04 AM	No files match the supplied pattern.
MainError:	10:46:04 AM	No files match the supplied pattern.
MainError:	10:46:04 AM	No files match the supplied pattern.
MainError:	10:46:04 AM	No files match the supplied pattern.
MainError:	10:46:04 AM	No files match the supplied pattern.
MainError:	10:46:04 AM	No files match the supplied pattern.
MainError:	10:46:04 AM	No files match the supplied pattern.
MainError:	10:46:04 AM	No files match the supplied pattern.
MainError:	10:46:04 AM	No files match the supplied pattern.
MainError:	10:46:04 AM	No files match the supplied pattern.
MainError:	10:46:04 AM	No files match the supplied pattern.
MainError:	10:46:04 AM	No files match the supplied pattern.
MainError:	10:46:04 AM	No files match the supplied pattern.
MainError:	10:46:04 AM	No files match the supplied pattern.
MainError:	10:46:04 AM	No files match the supplied pattern.
MainError:	10:46:04 AM	No files match the supplied pattern.
MainError:	10:46:04 AM	No files match the supplied pattern.
MainError:	10:46:04 AM	No files match the supplied pattern.
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - No 'heartbeat' from BOINC...

Model crashed: (null)

Model crashed: (null)

Model crashed: (null)

Model crashed: (null)

Model crashed: (null)

Model crashed: (null)
Sorry, too many model crashes! :-(
called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
01 Jan 2011 04:52:27 1033866 11026694 hadsm3dhet2_jpum_006596496_6 129,624 2,670,478 4.1203
30 Dec 2010 07:12:58 1033866 11026694 hadsm3dhet2_jpum_006596496_6 118,822 2,630,200 4.1270
29 Dec 2010 00:06:19 1033866 11026694 hadsm3dhet2_jpum_006596496_6 108,020 2,587,375 4.1298
27 Dec 2010 11:35:28 1033866 11026694 hadsm3dhet2_jpum_006596496_6 97,218 2,545,339 4.1340
26 Dec 2010 11:35:02 1033866 11026694 hadsm3dhet2_jpum_006596496_6 86,416 2,498,183 4.1298
23 Dec 2010 19:38:49 1033866 11026694 hadsm3dhet2_jpum_006596496_6 75,614 2,450,846 4.1252
21 Dec 2010 23:09:37 1033866 11026694 hadsm3dhet2_jpum_006596496_6 64,812 2,401,222 4.1166
19 Dec 2010 18:16:05 1033866 11026694 hadsm3dhet2_jpum_006596496_6 54,010 2,352,783 4.1096
16 Dec 2010 18:41:21 1033866 11026694 hadsm3dhet2_jpum_006596496_6 43,208 2,306,829 4.1068
15 Dec 2010 00:46:50 1033866 11026694 hadsm3dhet2_jpum_006596496_6 32,406 2,259,039 4.1006
13 Dec 2010 12:01:26 1033866 11026694 hadsm3dhet2_jpum_006596496_6 21,604 2,210,323 4.0924
11 Dec 2010 19:01:04 1033866 11026694 hadsm3dhet2_jpum_006596496_6 10,802 2,162,431 4.0855
10 Dec 2010 10:49:59 1033866 11026694 hadsm3dhet2_jpum_006596496_6 259,248 2,117,668 4.0843
06 Dec 2010 21:20:20 1033866 11026694 hadsm3dhet2_jpum_006596496_6 248,446 2,069,457 4.0762
05 Dec 2010 09:10:13 1033866 11026694 hadsm3dhet2_jpum_006596496_6 237,644 2,022,269 4.0698
03 Dec 2010 12:24:19 1033866 11026694 hadsm3dhet2_jpum_006596496_6 226,842 1,977,149 4.0675
01 Dec 2010 12:28:03 1033866 11026694 hadsm3dhet2_jpum_006596496_6 216,040 1,932,878 4.0668
29 Nov 2010 18:51:59 1033866 11026694 hadsm3dhet2_jpum_006596496_6 205,238 1,887,943 4.0646
28 Nov 2010 19:37:04 1033866 11026694 hadsm3dhet2_jpum_006596496_6 194,436 1,846,885 4.0709
27 Nov 2010 07:44:39 1033866 11026694 hadsm3dhet2_jpum_006596496_6 183,634 1,803,580 4.0724


©2024 climateprediction.net