climateprediction.net home page
Task 11014325

Task 11014325

Name hadsm3dhet2_jow9_006595259_7
Workunit 6798632
Created 15 Mar 2010, 12:01:03 UTC
Sent 4 Oct 2010, 15:06:37 UTC
Report deadline 16 Sep 2011, 20:26:37 UTC
Received 4 Nov 2010, 15:16:19 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 921061
Run time
CPU time 4 days 12 hours 20 min 46 sec
Validate state Invalid
Credit 1,885.62
Device peak FLOPS 1.43 GFLOPS
Application version UK Met Office HadSM3 Slab Model v6.07
windows_intelx86
Stderr
<core_client_version>5.10.30</core_client_version>
<![CDATA[
<message>
The device does not recognize the command. (0x16) - exit code 22 (0x16)
</message>
<stderr_txt>
Suspended CPDN Monitor - Quit request from BOINC...
No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - Quit request from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - Quit request from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
forrtl: Access is denied.
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2736, iMonCtr=1
Model crash detected, will try to restart...
forrtl: Access is denied.
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2736, iMonCtr=1
Model crash detected, will try to restart...
forrtl: Access is denied.
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2736, iMonCtr=1
Model crash detected, will try to restart...
forrtl: Access is denied.
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2736, iMonCtr=1
Model crash detected, will try to restart...
forrtl: Access is denied.
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2736, iMonCtr=1
Model crash detected, will try to restart...
forrtl: Access is denied.
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2736, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
31 Oct 2010 15:15:24 921061 11014325 hadsm3dhet2_jow9_006595259_7 205,238 386,114 1.8813
30 Oct 2010 12:00:41 921061 11014325 hadsm3dhet2_jow9_006595259_7 194,436 366,326 1.8840
29 Oct 2010 08:34:14 921061 11014325 hadsm3dhet2_jow9_006595259_7 183,634 342,382 1.8645
27 Oct 2010 12:37:21 921061 11014325 hadsm3dhet2_jow9_006595259_7 172,832 318,449 1.8425
25 Oct 2010 16:09:19 921061 11014325 hadsm3dhet2_jow9_006595259_7 162,030 297,046 1.8333
22 Oct 2010 16:14:12 921061 11014325 hadsm3dhet2_jow9_006595259_7 151,228 278,684 1.8428
21 Oct 2010 22:49:45 921061 11014325 hadsm3dhet2_jow9_006595259_7 140,426 260,615 1.8559
21 Oct 2010 15:08:01 921061 11014325 hadsm3dhet2_jow9_006595259_7 129,624 242,341 1.8696
20 Oct 2010 15:09:36 921061 11014325 hadsm3dhet2_jow9_006595259_7 118,822 224,052 1.8856
19 Oct 2010 15:11:02 921061 11014325 hadsm3dhet2_jow9_006595259_7 108,020 204,520 1.8934
18 Oct 2010 16:20:49 921061 11014325 hadsm3dhet2_jow9_006595259_7 97,218 185,043 1.9034
16 Oct 2010 16:18:38 921061 11014325 hadsm3dhet2_jow9_006595259_7 86,416 164,914 1.9084
15 Oct 2010 16:05:36 921061 11014325 hadsm3dhet2_jow9_006595259_7 75,614 145,717 1.9271
13 Oct 2010 15:18:13 921061 11014325 hadsm3dhet2_jow9_006595259_7 64,812 126,413 1.9505
11 Oct 2010 15:07:58 921061 11014325 hadsm3dhet2_jow9_006595259_7 54,010 104,058 1.9266
10 Oct 2010 16:09:11 921061 11014325 hadsm3dhet2_jow9_006595259_7 43,208 79,357 1.8366
10 Oct 2010 16:09:11 921061 11014325 hadsm3dhet2_jow9_006595259_7 32,406 54,569 1.6839
10 Oct 2010 16:09:11 921061 11014325 hadsm3dhet2_jow9_006595259_7 21,604 37,119 1.7182
10 Oct 2010 16:09:11 921061 11014325 hadsm3dhet2_jow9_006595259_7 10,802 19,802 1.8332


©2024 climateprediction.net