climateprediction.net home page
Task 11923248

Task 11923248

Name hadsm3dhet2_u4co_006670033_12
Workunit 6873287
Created 5 Oct 2010, 16:02:55 UTC
Sent 5 Oct 2010, 16:10:56 UTC
Report deadline 17 Sep 2011, 21:30:56 UTC
Received 8 Nov 2010, 15:14:11 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1105550
Run time 13 days 3 hours 50 min 59 sec
CPU time 11 days 18 hours 41 min 3 sec
Validate state Invalid
Credit 7,046.28
Device peak FLOPS 2.92 GFLOPS
Application version UK Met Office HadSM3 Slab Model v6.07
windows_intelx86
Stderr
<core_client_version>6.10.58</core_client_version>
<![CDATA[
<message>
The device does not recognize the command. (0x16) - exit code 22 (0x16)
</message>
<stderr_txt>
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
error:  cannot delete old hadsm3dhet2_u4co_006670033/jobs/climate.spin
error:  cannot delete old hadsm3dhet2_u4co_006670033/jobs/climate.cont
error:  cannot delete old hadsm3dhet2_u4co_006670033/jobs/climate.doub
error:  cannot delete old hadsm3dhet2_u4co_006670033/jobs/ncatts.cpdc
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
error:  cannot delete old hadsm3dhet2_u4co_006670033/jobs/climate.spin
error:  cannot delete old hadsm3dhet2_u4co_006670033/jobs/climate.cont
error:  cannot delete old hadsm3dhet2_u4co_006670033/jobs/climate.doub
error:  cannot delete old hadsm3dhet2_u4co_006670033/jobs/ncatts.cpdc
No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
MainError:	08:54:05 AM	No files match the supplied pattern.
MainError:	08:54:05 AM	No files match the supplied pattern.
MainError:	08:54:05 AM	No files match the supplied pattern.
MainError:	08:54:05 AM	No files match the supplied pattern.
MainError:	08:54:05 AM	No files match the supplied pattern.
MainError:	08:54:05 AM	No files match the supplied pattern.
MainError:	08:54:05 AM	No files match the supplied pattern.
MainError:	08:54:05 AM	No files match the supplied pattern.
MainError:	08:54:05 AM	No files match the supplied pattern.
MainError:	08:54:05 AM	No files match the supplied pattern.
MainError:	08:54:05 AM	No files match the supplied pattern.
MainError:	08:54:05 AM	No files match the supplied pattern.
MainError:	08:54:05 AM	No files match the supplied pattern.
MainError:	08:54:05 AM	No files match the supplied pattern.
MainError:	08:54:05 AM	No files match the supplied pattern.
MainError:	08:54:05 AM	No files match the supplied pattern.
MainError:	08:54:05 AM	No files match the supplied pattern.
MainError:	08:54:05 AM	No files match the supplied pattern.
MainError:	08:54:05 AM	No files match the supplied pattern.
MainError:	08:54:05 AM	No files match the supplied pattern.
MainError:	08:54:05 AM	No files match the supplied pattern.
MainError:	08:54:05 AM	No files match the supplied pattern.
MainError:	08:54:05 AM	No files match the supplied pattern.
MainError:	08:54:05 AM	No files match the supplied pattern.
MainError:	08:54:05 AM	No files match the supplied pattern.
MainError:	08:54:05 AM	No files match the supplied pattern.
MainError:	08:54:05 AM	No files match the supplied pattern.
MainError:	08:54:05 AM	No files match the supplied pattern.
MainError:	08:54:05 AM	No files match the supplied pattern.
MainError:	08:54:05 AM	No files match the supplied pattern.
MainError:	08:54:05 AM	No files match the supplied pattern.
MainError:	08:54:05 AM	No files match the supplied pattern.
MainError:	08:54:05 AM	No files match the supplied pattern.
MainError:	08:54:05 AM	No files match the supplied pattern.
MainError:	08:54:05 AM	No files match the supplied pattern.
MainError:	08:54:05 AM	No files match the supplied pattern.
MainError:	08:54:05 AM	No files match the supplied pattern.
MainError:	08:54:05 AM	No files match the supplied pattern.
MainError:	08:54:05 AM	No files match the supplied pattern.
MainError:	08:54:05 AM	No files match the supplied pattern.
MainError:	08:54:05 AM	No files match the supplied pattern.
MainError:	08:54:05 AM	No files match the supplied pattern.
MainError:	08:54:05 AM	No files match the supplied pattern.
MainError:	08:54:05 AM	No files match the supplied pattern.
MainError:	08:54:05 AM	No files match the supplied pattern.
MainError:	08:54:05 AM	No files match the supplied pattern.
MainError:	08:54:05 AM	No files match the supplied pattern.
MainError:	08:54:05 AM	No files match the supplied pattern.
MainError:	08:54:05 AM	No files match the supplied pattern.
MainError:	08:54:05 AM	No files match the supplied pattern.
MainError:	08:54:05 AM	No files match the supplied pattern.
MainError:	08:54:05 AM	No files match the supplied pattern.
MainError:	08:54:05 AM	No files match the supplied pattern.
MainError:	08:54:05 AM	No files match the supplied pattern.
MainError:	08:54:05 AM	No files match the supplied pattern.
MainError:	08:54:05 AM	No files match the supplied pattern.
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - No 'heartbeat' from BOINC...
MainError:	09:44:26 AM	No files match the supplied pattern.
MainError:	09:44:26 AM	No files match the supplied pattern.
MainError:	09:44:26 AM	No files match the supplied pattern.
MainError:	09:44:26 AM	No files match the supplied pattern.
MainError:	09:44:26 AM	No files match the supplied pattern.
MainError:	09:44:26 AM	No files match the supplied pattern.
MainError:	09:44:26 AM	No files match the supplied pattern.
MainError:	09:44:26 AM	No files match the supplied pattern.
MainError:	09:44:26 AM	No files match the supplied pattern.
MainError:	09:44:26 AM	No files match the supplied pattern.
MainError:	09:44:26 AM	No files match the supplied pattern.
MainError:	09:44:26 AM	No files match the supplied pattern.
MainError:	09:44:26 AM	No files match the supplied pattern.
MainError:	09:44:26 AM	No files match the supplied pattern.
MainError:	09:44:26 AM	No files match the supplied pattern.
MainError:	09:44:26 AM	No files match the supplied pattern.
MainError:	09:44:26 AM	No files match the supplied pattern.
MainError:	09:44:26 AM	No files match the supplied pattern.
MainError:	09:44:26 AM	No files match the supplied pattern.
MainError:	09:44:26 AM	No files match the supplied pattern.
MainError:	09:44:26 AM	No files match the supplied pattern.
MainError:	09:44:26 AM	No files match the supplied pattern.
MainError:	09:44:26 AM	No files match the supplied pattern.
MainError:	09:44:26 AM	No files match the supplied pattern.
MainError:	09:44:26 AM	No files match the supplied pattern.
MainError:	09:44:26 AM	No files match the supplied pattern.
MainError:	09:44:26 AM	No files match the supplied pattern.
MainError:	09:44:26 AM	No files match the supplied pattern.
MainError:	09:44:26 AM	No files match the supplied pattern.
MainError:	09:44:26 AM	No files match the supplied pattern.
MainError:	09:44:26 AM	No files match the supplied pattern.
MainError:	09:44:26 AM	No files match the supplied pattern.
MainError:	09:44:26 AM	No files match the supplied pattern.
MainError:	09:44:26 AM	No files match the supplied pattern.
MainError:	09:44:26 AM	No files match the supplied pattern.
MainError:	09:44:26 AM	No files match the supplied pattern.
MainError:	09:44:26 AM	No files match the supplied pattern.
MainError:	09:44:26 AM	No files match the supplied pattern.
MainError:	09:44:26 AM	No files match the supplied pattern.
MainError:	09:44:26 AM	No files match the supplied pattern.
MainError:	09:44:26 AM	No files match the supplied pattern.
MainError:	09:44:26 AM	No files match the supplied pattern.
MainError:	09:44:26 AM	No files match the supplied pattern.
MainError:	09:44:26 AM	No files match the supplied pattern.
MainError:	09:44:26 AM	No files match the supplied pattern.
MainError:	09:44:26 AM	No files match the supplied pattern.
MainError:	09:44:26 AM	No files match the supplied pattern.
MainError:	09:44:26 AM	No files match the supplied pattern.
MainError:	09:44:26 AM	No files match the supplied pattern.
MainError:	09:44:26 AM	No files match the supplied pattern.
MainError:	09:44:26 AM	No files match the supplied pattern.
MainError:	09:44:26 AM	No files match the supplied pattern.
MainError:	09:44:26 AM	No files match the supplied pattern.
MainError:	09:44:26 AM	No files match the supplied pattern.
MainError:	09:44:26 AM	No files match the supplied pattern.
MainError:	09:44:26 AM	No files match the supplied pattern.
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
forrtl: Access is denied.

CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4696, iMonCtr=1
Model crash detected, will try to restart...
forrtl: Access is denied.

CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4696, iMonCtr=1
Model crash detected, will try to restart...
forrtl: Access is denied.

CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4696, iMonCtr=1
Model crash detected, will try to restart...
forrtl: Access is denied.

CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4696, iMonCtr=1
Model crash detected, will try to restart...
forrtl: Access is denied.

CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4696, iMonCtr=1
Model crash detected, will try to restart...
forrtl: Access is denied.

CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4696, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
06 Nov 2010 22:18:56 1105550 11923248 hadsm3dhet2_u4co_006670033_12 248,446 1,007,071 1.3131
06 Nov 2010 06:45:44 1105550 11923248 hadsm3dhet2_u4co_006670033_12 237,644 993,098 1.3134
04 Nov 2010 10:25:55 1105550 11923248 hadsm3dhet2_u4co_006670033_12 226,842 979,042 1.3136
03 Nov 2010 13:43:15 1105550 11923248 hadsm3dhet2_u4co_006670033_12 216,040 964,923 1.3136
03 Nov 2010 05:05:30 1105550 11923248 hadsm3dhet2_u4co_006670033_12 205,238 951,437 1.3146
02 Nov 2010 11:04:40 1105550 11923248 hadsm3dhet2_u4co_006670033_12 194,436 937,703 1.3153
01 Nov 2010 23:09:07 1105550 11923248 hadsm3dhet2_u4co_006670033_12 183,634 924,146 1.3162
01 Nov 2010 09:17:34 1105550 11923248 hadsm3dhet2_u4co_006670033_12 172,832 910,429 1.3169
31 Oct 2010 20:44:48 1105550 11923248 hadsm3dhet2_u4co_006670033_12 162,030 897,030 1.3181
31 Oct 2010 08:48:04 1105550 11923248 hadsm3dhet2_u4co_006670033_12 151,228 883,232 1.3188
30 Oct 2010 21:49:06 1105550 11923248 hadsm3dhet2_u4co_006670033_12 140,426 869,728 1.3199
30 Oct 2010 10:01:55 1105550 11923248 hadsm3dhet2_u4co_006670033_12 129,624 855,776 1.3204
29 Oct 2010 06:09:17 1105550 11923248 hadsm3dhet2_u4co_006670033_12 118,822 841,775 1.3208
28 Oct 2010 08:59:38 1105550 11923248 hadsm3dhet2_u4co_006670033_12 108,020 827,822 1.3213
27 Oct 2010 09:23:46 1105550 11923248 hadsm3dhet2_u4co_006670033_12 97,218 813,968 1.3220
26 Oct 2010 23:16:41 1105550 11923248 hadsm3dhet2_u4co_006670033_12 86,416 800,011 1.3225
26 Oct 2010 11:02:44 1105550 11923248 hadsm3dhet2_u4co_006670033_12 75,614 785,983 1.3230
26 Oct 2010 03:38:11 1105550 11923248 hadsm3dhet2_u4co_006670033_12 64,812 772,208 1.3238
25 Oct 2010 17:19:34 1105550 11923248 hadsm3dhet2_u4co_006670033_12 54,010 757,947 1.3239
25 Oct 2010 11:50:43 1105550 11923248 hadsm3dhet2_u4co_006670033_12 43,208 743,673 1.3240


©2024 climateprediction.net