climateprediction.net home page
Task 11058425

Task 11058425

Name hadsm3dhet2_jsar_006599669_7
Workunit 6803042
Created 15 Mar 2010, 12:06:48 UTC
Sent 21 Jun 2010, 20:51:49 UTC
Report deadline 4 Jun 2011, 2:11:49 UTC
Received 14 Nov 2010, 14:46:19 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 866996
Run time 10 days 0 hours 11 min 57 sec
CPU time 8 days 22 hours 8 min 13 sec
Validate state Invalid
Credit 2,878.06
Device peak FLOPS 1.74 GFLOPS
Application version UK Met Office HadSM3 Slab Model v6.07
windows_intelx86
Stderr
<core_client_version>6.6.36</core_client_version>
<![CDATA[
<message>
The device does not recognize the command. (0x16) - exit code 22 (0x16)
</message>
<stderr_txt>
CPDN Monitor - Quit request from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5296, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN process is not running, exiting, bRetVal = 1, checkPID=5320, selfPID=5320, iMonCtr=1
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3352, iMonCtr=1
Model crash detected, will try to restart...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5216, iMonCtr=1
Model crash detected, will try to restart...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4960, iMonCtr=1
Model crash detected, will try to restart...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CMainError:	02:33:22 AM	No files match the supplied pattern.
MainError:	02:33:22 AM	No files match the supplied pattern.
MainError:	02:33:22 AM	No files match the supplied pattern.
MainError:	02:33:22 AM	No files match the supplied pattern.
MainError:	02:33:22 AM	No files match the supplied pattern.
MainError:	02:33:22 AM	No files match the supplied pattern.
MainError:	02:33:22 AM	No files match the supplied pattern.
MainError:	02:33:22 AM	No files match the supplied pattern.
MainError:	02:33:22 AM	No files match the supplied pattern.
MainError:	02:33:22 AM	No files match the supplied pattern.
MainError:	02:33:22 AM	No files match the supplied pattern.
MainError:	02:33:22 AM	No files match the supplied pattern.
MainError:	02:33:22 AM	No files match the supplied pattern.
MainError:	02:33:22 AM	No files match the supplied pattern.
MainError:	02:33:22 AM	No files match the supplied pattern.
MainError:	02:33:22 AM	No files match the supplied pattern.
MainError:	02:33:22 AM	No files match the supplied pattern.
MainError:	02:33:22 AM	No files match the supplied pattern.
MainError:	02:33:22 AM	No files match the supplied pattern.
MainError:	02:33:22 AM	No files match the supplied pattern.
MainError:	02:33:22 AM	No files match the supplied pattern.
MainError:	02:33:22 AM	No files match the supplied pattern.
MainError:	02:33:22 AM	No files match the supplied pattern.
MainError:	02:33:22 AM	No files match the supplied pattern.
MainError:	02:33:22 AM	No files match the supplied pattern.
MainError:	02:33:22 AM	No files match the supplied pattern.
MainError:	02:33:22 AM	No files match the supplied pattern.
MainError:	02:33:22 AM	No files match the supplied pattern.
MainError:	02:33:22 AM	No files match the supplied pattern.
MainError:	02:33:22 AM	No files match the supplied pattern.
MainError:	02:33:22 AM	No files match the supplied pattern.
MainError:	02:33:22 AM	No files match the supplied pattern.
MainError:	02:33:22 AM	No files match the supplied pattern.
MainError:	02:33:22 AM	No files match the supplied pattern.
MainError:	02:33:22 AM	No files match the supplied pattern.
MainError:	02:33:22 AM	No files match the supplied pattern.
MainError:	02:33:22 AM	No files match the supplied pattern.
MainError:	02:33:22 AM	No files match the supplied pattern.
MainError:	02:33:22 AM	No files match the supplied pattern.
MainError:	02:33:22 AM	No files match the supplied pattern.
MainError:	02:33:22 AM	No files match the supplied pattern.
MainError:	02:33:22 AM	No files match the supplied pattern.
MainError:	02:33:22 AM	No files match the supplied pattern.
MainError:	02:33:22 AM	No files match the supplied pattern.
MainError:	02:33:22 AM	No files match the supplied pattern.
MainError:	02:33:22 AM	No files match the supplied pattern.
MainError:	02:33:22 AM	No files match the supplied pattern.
MainError:	02:33:22 AM	No files match the supplied pattern.
MainError:	02:33:22 AM	No files match the supplied pattern.
MainError:	02:33:22 AM	No files match the supplied pattern.
MainError:	02:33:22 AM	No files match the supplied pattern.
MainError:	02:33:22 AM	No files match the supplied pattern.
MainError:	02:33:22 AM	No files match the supplied pattern.
MainError:	02:33:22 AM	No files match the supplied pattern.
MainError:	02:33:22 AM	No files match the supplied pattern.
MainError:	02:33:22 AM	No files match the supplied pattern.
CPDN Monitor - Quit request from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Sforrtl: Access is denied.
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5072, iMonCtr=1
Model crash detected, will try to restart...
forrtl: Access is denied.
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5072, iMonCtr=1
Model crash detected, will try to restart...
forrtl: Access is denied.
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5072, iMonCtr=1
Model crash detected, will try to restart...
forrtl: Access is denied.
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5072, iMonCtr=1
Model crash detected, will try to restart...
forrtl: Access is denied.
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5072, iMonCtr=1
Model crash detected, will try to restart...
forrtl: Access is denied.
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5072, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
11 Nov 2010 13:01:37 866996 11058425 hadsm3dhet2_jsar_006599669_7 54,010 761,853 2.4320
09 Nov 2010 18:48:48 866996 11058425 hadsm3dhet2_jsar_006599669_7 43,208 737,004 2.4367
08 Nov 2010 05:27:10 866996 11058425 hadsm3dhet2_jsar_006599669_7 32,406 707,526 2.4259
07 Nov 2010 21:04:39 866996 11058425 hadsm3dhet2_jsar_006599669_7 21,604 683,147 2.4324
07 Nov 2010 12:57:41 866996 11058425 hadsm3dhet2_jsar_006599669_7 10,802 658,099 2.4370
05 Nov 2010 04:37:06 866996 11058425 hadsm3dhet2_jsar_006599669_7 259,248 632,452 2.4396
04 Nov 2010 18:21:29 866996 11058425 hadsm3dhet2_jsar_006599669_7 248,446 607,091 2.4436
31 Oct 2010 09:24:42 866996 11058425 hadsm3dhet2_jsar_006599669_7 237,644 580,533 2.4429
15 Oct 2010 13:58:01 866996 11058425 hadsm3dhet2_jsar_006599669_7 226,842 553,780 2.4413
13 Oct 2010 17:53:03 866996 11058425 hadsm3dhet2_jsar_006599669_7 216,040 528,609 2.4468
26 Sep 2010 07:03:38 866996 11058425 hadsm3dhet2_jsar_006599669_7 205,238 503,563 2.4536
19 Sep 2010 13:28:35 866996 11058425 hadsm3dhet2_jsar_006599669_7 194,436 477,543 2.4560
15 Sep 2010 15:17:04 866996 11058425 hadsm3dhet2_jsar_006599669_7 183,634 449,885 2.4499
10 Sep 2010 09:23:44 866996 11058425 hadsm3dhet2_jsar_006599669_7 172,832 419,671 2.4282
05 Sep 2010 10:43:55 866996 11058425 hadsm3dhet2_jsar_006599669_7 162,030 392,083 2.4198
01 Sep 2010 13:31:32 866996 11058425 hadsm3dhet2_jsar_006599669_7 151,228 363,335 2.4026
29 Aug 2010 18:46:44 866996 11058425 hadsm3dhet2_jsar_006599669_7 140,426 335,932 2.3922
21 Aug 2010 19:00:38 866996 11058425 hadsm3dhet2_jsar_006599669_7 129,624 308,431 2.3794
21 Aug 2010 11:39:38 866996 11058425 hadsm3dhet2_jsar_006599669_7 118,822 284,734 2.3963
13 Aug 2010 15:27:13 866996 11058425 hadsm3dhet2_jsar_006599669_7 108,020 253,158 2.3436


©2024 climateprediction.net