climateprediction.net home page
Task 10984612

Task 10984612

Name hadsm3dhet2_jmlq_006592288_5
Workunit 6795661
Created 15 Mar 2010, 11:57:15 UTC
Sent 15 Oct 2010, 17:46:47 UTC
Report deadline 27 Sep 2011, 23:06:47 UTC
Received 9 Apr 2011, 9:01:55 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1107328
Run time 19 days 7 hours 47 min 48 sec
CPU time 18 days 2 hours 58 min 46 sec
Validate state Invalid
Credit 5,656.87
Device peak FLOPS 1.64 GFLOPS
Application version UK Met Office HadSM3 Slab Model v6.07
windows_intelx86
Stderr
<core_client_version>6.10.58</core_client_version>
<![CDATA[
<message>
The device does not recognize the command. (0x16) - exit code 22 (0x16)
</message>
<stderr_txt>
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6084, iMonCtr=1
Model crash detected, will try to restart...
No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
MainError:	08:09:28 AM	No files match the supplied pattern.
MainError:	08:09:28 AM	No files match the supplied pattern.
MainError:	08:09:28 AM	No files match the supplied pattern.
MainError:	08:09:28 AM	No files match the supplied pattern.
MainError:	08:09:28 AM	No files match the supplied pattern.
MainError:	08:09:28 AM	No files match the supplied pattern.
MainError:	08:09:28 AM	No files match the supplied pattern.
MainError:	08:09:28 AM	No files match the supplied pattern.
MainError:	08:09:28 AM	No files match the supplied pattern.
MainError:	08:09:28 AM	No files match the supplied pattern.
MainError:	08:09:28 AM	No files match the supplied pattern.
MainError:	08:09:28 AM	No files match the supplied pattern.
MainError:	08:09:28 AM	No files match the supplied pattern.
MainError:	08:09:28 AM	No files match the supplied pattern.
MainError:	08:09:28 AM	No files match the supplied pattern.
MainError:	08:09:28 AM	No files match the supplied pattern.
MainError:	08:09:28 AM	No files match the supplied pattern.
MainError:	08:09:28 AM	No files match the supplied pattern.
MainError:	08:09:28 AM	No files match the supplied pattern.
MainError:	08:09:28 AM	No files match the supplied pattern.
MainError:	08:09:28 AM	No files match the supplied pattern.
MainError:	08:09:28 AM	No files match the supplied pattern.
MainError:	08:09:28 AM	No files match the supplied pattern.
MainError:	08:09:28 AM	No files match the supplied pattern.
MainError:	08:09:28 AM	No files match the supplied pattern.
MainError:	08:09:28 AM	No files match the supplied pattern.
MainError:	08:09:28 AM	No files match the supplied pattern.
MainError:	08:09:28 AM	No files match the supplied pattern.
MainError:	08:09:28 AM	No files match the supplied pattern.
MainError:	08:09:28 AM	No files match the supplied pattern.
MainError:	08:09:28 AM	No files match the supplied pattern.
MainError:	08:09:28 AM	No files match the supplied pattern.
MainError:	08:09:28 AM	No files match the supplied pattern.
MainError:	08:09:28 AM	No files match the supplied pattern.
MainError:	08:09:28 AM	No files match the supplied pattern.
MainError:	08:09:28 AM	No files match the supplied pattern.
MainError:	08:09:28 AM	No files match the supplied pattern.
MainError:	08:09:28 AM	No files match the supplied pattern.
MainError:	08:09:28 AM	No files match the supplied pattern.
MainError:	08:09:28 AM	No files match the supplied pattern.
MainError:	08:09:28 AM	No files match the supplied pattern.
MainError:	08:09:28 AM	No files match the supplied pattern.
MainError:	08:09:28 AM	No files match the supplied pattern.
MainError:	08:09:28 AM	No files match the supplied pattern.
MainError:	08:09:28 AM	No files match the supplied pattern.
MainError:	08:09:28 AM	No files match the supplied pattern.
MainError:	08:09:28 AM	No files match the supplied pattern.
MainError:	08:09:28 AM	No files match the supplied pattern.
MainError:	08:09:28 AM	No files match the supplied pattern.
MainError:	08:09:28 AM	No files match the supplied pattern.
MainError:	08:09:28 AM	No files match the supplied pattern.
MainError:	08:09:28 AM	No files match the supplied pattern.
MainError:	08:09:28 AM	No files match the supplied pattern.
MainError:	08:09:28 AM	No files match the supplied pattern.
MainError:	08:09:28 AM	No files match the supplied pattern.
MainError:	08:09:28 AM	No files match the supplied pattern.
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4744, iMonCtr=1
Model crash detected, will try to restart...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5144, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=664, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1532, iMonCtr=1
Model crash detected, will try to restart...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5836, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6412, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5512, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CCPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6736, iMonCtr=1
Model crash detected, will try to restart...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CCPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5140, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4504, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5292, iMonCtr=1
Model crash detected, will try to restart...
MainError:	07:27:48 PM	No files match the supplied pattern.
MainError:	07:27:48 PM	No files match the supplied pattern.
MainError:	07:27:48 PM	No files match the supplied pattern.
MainError:	07:27:48 PM	No files match the supplied pattern.
MainError:	07:27:48 PM	No files match the supplied pattern.
MainError:	07:27:48 PM	No files match the supplied pattern.
MainError:	07:27:48 PM	No files match the supplied pattern.
MainError:	07:27:48 PM	No files match the supplied pattern.
MainError:	07:27:48 PM	No files match the supplied pattern.
MainError:	07:27:48 PM	No files match the supplied pattern.
MainError:	07:27:48 PM	No files match the supplied pattern.
MainError:	07:27:48 PM	No files match the supplied pattern.
MainError:	07:27:48 PM	No files match the supplied pattern.
MainError:	07:27:48 PM	No files match the supplied pattern.
MainError:	07:27:48 PM	No files match the supplied pattern.
MainError:	07:27:48 PM	No files match the supplied pattern.
MainError:	07:27:48 PM	No files match the supplied pattern.
MainError:	07:27:48 PM	No files match the supplied pattern.
MainError:	07:27:48 PM	No files match the supplied pattern.
MainError:	07:27:48 PM	No files match the supplied pattern.
MainError:	07:27:48 PM	No files match the supplied pattern.
MainError:	07:27:48 PM	No files match the supplied pattern.
MainError:	07:27:48 PM	No files match the supplied pattern.
MainError:	07:27:48 PM	No files match the supplied pattern.
MainError:	07:27:48 PM	No files match the supplied pattern.
MainError:	07:27:48 PM	No files match the supplied pattern.
MainError:	07:27:48 PM	No files match the supplied pattern.
MainError:	07:27:48 PM	No files match the supplied pattern.
MainError:	07:27:48 PM	No files match the supplied pattern.
MainError:	07:27:48 PM	No files match the supplied pattern.
MainError:	07:27:48 PM	No files match the supplied pattern.
MainError:	07:27:48 PM	No files match the supplied pattern.
MainError:	07:27:48 PM	No files match the supplied pattern.
MainError:	07:27:48 PM	No files match the supplied pattern.
MainError:	07:27:48 PM	No files match the supplied pattern.
MainError:	07:27:48 PM	No files match the supplied pattern.
MainError:	07:27:48 PM	No files match the supplied pattern.
MainError:	07:27:48 PM	No files match the supplied pattern.
MainError:	07:27:48 PM	No files match the supplied pattern.
MainError:	07:27:48 PM	No files match the supplied pattern.
MainError:	07:27:48 PM	No files match the supplied pattern.
MainError:	07:27:48 PM	No files match the supplied pattern.
MainError:	07:27:48 PM	No files match the supplied pattern.
MainError:	07:27:48 PM	No files match the supplied pattern.
MainError:	07:27:48 PM	No files match the supplied pattern.
MainError:	07:27:48 PM	No files match the supplied pattern.
MainError:	07:27:48 PM	No files match the supplied pattern.
MainError:	07:27:48 PM	No files match the supplied pattern.
MainError:	07:27:48 PM	No files match the supplied pattern.
MainError:	07:27:48 PM	No files match the supplied pattern.
MainError:	07:27:48 PM	No files match the supplied pattern.
MainError:	07:27:48 PM	No files match the supplied pattern.
MainError:	07:27:48 PM	No files match the supplied pattern.
MainError:	07:27:48 PM	No files match the supplied pattern.
MainError:	07:27:48 PM	No files match the supplied pattern.
MainError:	07:27:48 PM	No files match the supplied pattern.
Suspended CPDN Monitor - Quit request from BOINC...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4876, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5424, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4120, iMonCtr=1
Model crash detected, will try to restart...
CCPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5252, iMonCtr=1
Model crash detected, will try to restart...
CCPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6028, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4504, iMonCtr=1
Model crash detected, will try to restart...
CSuspended CPDN Monitor - Quit request from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4764, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4480, iMonCtr=1
Model crash detected, will try to restart...
forrtl: Access is denied.

CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=628, iMonCtr=1
Model crash detected, will try to restart...
forrtl: Access is denied.

CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=628, iMonCtr=1
Model crash detected, will try to restart...
forrtl: Access is denied.

CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=628, iMonCtr=1
Model crash detected, will try to restart...
forrtl: Access is denied.

CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=628, iMonCtr=1
Model crash detected, will try to restart...
forrtl: Access is denied.

CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=628, iMonCtr=1
Model crash detected, will try to restart...
forrtl: Access is denied.

CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=628, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadsm3dhet2_jmlq_006592288/dataout/restart.day

Model crashed: (null)
cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadsm3dhet2_jmlq_006592288/dataout/restart.day

Model crashed: (null)
cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadsm3dhet2_jmlq_006592288/dataout/restart.day

Model crashed: (null)
cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadsm3dhet2_jmlq_006592288/dataout/restart.day

Model crashed: (null)
cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadsm3dhet2_jmlq_006592288/dataout/restart.day

Model crashed: (null)
cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadsm3dhet2_jmlq_006592288/dataout/restart.day

Model crashed: (null)
Sorry, too many model crashes! :-(
called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
07 Apr 2011 20:42:35 1107328 10984612 hadsm3dhet2_jmlq_006592288_5 97,218 1,539,522 2.5004
06 Apr 2011 20:22:26 1107328 10984612 hadsm3dhet2_jmlq_006592288_5 86,416 1,510,398 2.4969
05 Apr 2011 06:18:34 1107328 10984612 hadsm3dhet2_jmlq_006592288_5 75,614 1,482,378 2.4951
04 Apr 2011 08:02:36 1107328 10984612 hadsm3dhet2_jmlq_006592288_5 64,812 1,448,821 2.4838
02 Apr 2011 11:06:37 1107328 10984612 hadsm3dhet2_jmlq_006592288_5 54,010 1,422,587 2.4848
01 Apr 2011 17:49:34 1107328 10984612 hadsm3dhet2_jmlq_006592288_5 43,208 1,391,614 2.4775
31 Mar 2011 20:00:03 1107328 10984612 hadsm3dhet2_jmlq_006592288_5 32,406 1,361,391 2.4712
30 Mar 2011 22:50:02 1107328 10984612 hadsm3dhet2_jmlq_006592288_5 21,604 1,333,001 2.4681
29 Mar 2011 20:27:46 1107328 10984612 hadsm3dhet2_jmlq_006592288_5 10,802 1,302,252 2.4603
28 Mar 2011 19:29:45 1107328 10984612 hadsm3dhet2_jmlq_006592288_5 259,248 1,267,409 2.4444
27 Mar 2011 20:57:36 1107328 10984612 hadsm3dhet2_jmlq_006592288_5 248,446 1,233,014 2.4287
26 Mar 2011 22:42:01 1107328 10984612 hadsm3dhet2_jmlq_006592288_5 237,644 1,205,680 2.4264
25 Mar 2011 19:30:32 1107328 10984612 hadsm3dhet2_jmlq_006592288_5 226,842 1,179,123 2.4257
24 Mar 2011 07:50:21 1107328 10984612 hadsm3dhet2_jmlq_006592288_5 216,040 1,153,840 2.4277
23 Mar 2011 18:16:18 1107328 10984612 hadsm3dhet2_jmlq_006592288_5 205,238 1,128,675 2.4299
23 Mar 2011 04:16:04 1107328 10984612 hadsm3dhet2_jmlq_006592288_5 194,436 1,098,090 2.4204
22 Mar 2011 13:54:40 1107328 10984612 hadsm3dhet2_jmlq_006592288_5 183,634 1,071,077 2.4184
21 Mar 2011 16:12:56 1107328 10984612 hadsm3dhet2_jmlq_006592288_5 172,832 1,040,185 2.4074
20 Mar 2011 21:21:25 1107328 10984612 hadsm3dhet2_jmlq_006592288_5 162,030 1,009,443 2.3961
17 Mar 2011 16:03:21 1107328 10984612 hadsm3dhet2_jmlq_006592288_5 151,228 980,254 2.3881


©2024 climateprediction.net