climateprediction.net home page
Task 10931698

Task 10931698

Name hadsm3dhet2_u71b_006587239_6
Workunit 6790613
Created 12 Mar 2010, 9:05:17 UTC
Sent 13 Mar 2010, 1:23:37 UTC
Report deadline 23 Feb 2011, 6:43:37 UTC
Received 11 Nov 2010, 18:05:49 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 888167
Run time
CPU time 32 days 16 hours 44 min 55 sec
Validate state Invalid
Credit 4,168.22
Device peak FLOPS 0.00 GFLOPS
Application version ---
Stderr
<core_client_version>6.2.19</core_client_version>
<![CDATA[
<message>
The device does not recognize the command. (0x16) - exit code 22 (0x16)
</message>
<stderr_txt>
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7664, iMonCtr=1
Model crash detected, will try to restart...
MainError:	08:47:27 PM	No files match the supplied pattern.
MainError:	08:47:27 PM	No files match the supplied pattern.
MainError:	08:47:27 PM	No files match the supplied pattern.
MainError:	08:47:27 PM	No files match the supplied pattern.
MainError:	08:47:27 PM	No files match the supplied pattern.
MainError:	08:47:27 PM	No files match the supplied pattern.
MainError:	08:47:27 PM	No files match the supplied pattern.
MainError:	08:47:27 PM	No files match the supplied pattern.
MainError:	08:47:27 PM	No files match the supplied pattern.
MainError:	08:47:27 PM	No files match the supplied pattern.
MainError:	08:47:27 PM	No files match the supplied pattern.
MainError:	08:47:27 PM	No files match the supplied pattern.
MainError:	08:47:27 PM	No files match the supplied pattern.
MainError:	08:47:27 PM	No files match the supplied pattern.
MainError:	08:47:27 PM	No files match the supplied pattern.
MainError:	08:47:27 PM	No files match the supplied pattern.
MainError:	08:47:27 PM	No files match the supplied pattern.
MainError:	08:47:27 PM	No files match the supplied pattern.
MainError:	08:47:27 PM	No files match the supplied pattern.
MainError:	08:47:27 PM	No files match the supplied pattern.
MainError:	08:47:27 PM	No files match the supplied pattern.
MainError:	08:47:27 PM	No files match the supplied pattern.
MainError:	08:47:27 PM	No files match the supplied pattern.
MainError:	08:47:27 PM	No files match the supplied pattern.
MainError:	08:47:27 PM	No files match the supplied pattern.
MainError:	08:47:27 PM	No files match the supplied pattern.
MainError:	08:47:27 PM	No files match the supplied pattern.
MainError:	08:47:27 PM	No files match the supplied pattern.
MainError:	08:47:27 PM	No files match the supplied pattern.
MainError:	08:47:27 PM	No files match the supplied pattern.
MainError:	08:47:27 PM	No files match the supplied pattern.
MainError:	08:47:27 PM	No files match the supplied pattern.
MainError:	08:47:27 PM	No files match the supplied pattern.
MainError:	08:47:27 PM	No files match the supplied pattern.
MainError:	08:47:27 PM	No files match the supplied pattern.
MainError:	08:47:27 PM	No files match the supplied pattern.
MainError:	08:47:27 PM	No files match the supplied pattern.
MainError:	08:47:27 PM	No files match the supplied pattern.
MainError:	08:47:27 PM	No files match the supplied pattern.
MainError:	08:47:27 PM	No files match the supplied pattern.
MainError:	08:47:27 PM	No files match the supplied pattern.
MainError:	08:47:27 PM	No files match the supplied pattern.
MainError:	08:47:27 PM	No files match the supplied pattern.
MainError:	08:47:27 PM	No files match the supplied pattern.
MainError:	08:47:27 PM	No files match the supplied pattern.
MainError:	08:47:27 PM	No files match the supplied pattern.
MainError:	08:47:27 PM	No files match the supplied pattern.
MainError:	08:47:27 PM	No files match the supplied pattern.
MainError:	08:47:27 PM	No files match the supplied pattern.
MainError:	08:47:27 PM	No files match the supplied pattern.
MainError:	08:47:27 PM	No files match the supplied pattern.
MainError:	08:47:27 PM	No files match the supplied pattern.
MainError:	08:47:27 PM	No files match the supplied pattern.
MainError:	08:47:27 PM	No files match the supplied pattern.
MainError:	08:47:27 PM	No files match the supplied pattern.
MainError:	08:47:27 PM	No files match the supplied pattern.
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=9572, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6284, iMonCtr=1
Model crash detected, will try to restart...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4920, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=9028, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=9312, iMonCtr=1
Model crash detected, will try to restart...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6180, iMonCtr=1
Model crash detected, will try to restart...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5364, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1616, iMonCtr=1
Model crash detected, will try to restart...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7028, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7508, iMonCtr=1
Model crash detected, will try to restart...
CNo heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=812, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3104, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6796, iMonCtr=1
Model crash detected, will try to restart...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=804, iMonCtr=1
Model crash detected, will try to restart...
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1040, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1040, iMonCtr=1
Model crash detected, will try to restart...

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
13 Oct 2010 18:13:52 888167 10931698 hadsm3dhet2_u71b_006587239_6 194,436 2,436,684 5.3709
17 Jul 2010 03:40:51 888167 10931698 hadsm3dhet2_u71b_006587239_6 183,634 1,731,295 3.9092
10 May 2010 01:36:46 888167 10931698 hadsm3dhet2_u71b_006587239_6 172,832 1,027,212 2.3774
13 Apr 2010 18:55:25 888167 10931698 hadsm3dhet2_u71b_006587239_6 162,030 564,202 1.3393
12 Apr 2010 23:40:04 888167 10931698 hadsm3dhet2_u71b_006587239_6 151,228 549,753 1.3393
12 Apr 2010 19:11:20 888167 10931698 hadsm3dhet2_u71b_006587239_6 140,426 535,367 1.3395
12 Apr 2010 01:12:07 888167 10931698 hadsm3dhet2_u71b_006587239_6 129,624 520,874 1.3394
11 Apr 2010 20:42:14 888167 10931698 hadsm3dhet2_u71b_006587239_6 118,822 506,425 1.3395
10 Apr 2010 23:22:15 888167 10931698 hadsm3dhet2_u71b_006587239_6 108,020 491,882 1.3393
05 Apr 2010 22:17:46 888167 10931698 hadsm3dhet2_u71b_006587239_6 97,218 477,282 1.3389
05 Apr 2010 17:46:34 888167 10931698 hadsm3dhet2_u71b_006587239_6 86,416 462,835 1.3390
04 Apr 2010 01:18:16 888167 10931698 hadsm3dhet2_u71b_006587239_6 75,614 448,455 1.3392
03 Apr 2010 20:29:34 888167 10931698 hadsm3dhet2_u71b_006587239_6 64,812 434,231 1.3400
03 Apr 2010 15:56:15 888167 10931698 hadsm3dhet2_u71b_006587239_6 54,010 419,633 1.3396
01 Apr 2010 23:23:02 888167 10931698 hadsm3dhet2_u71b_006587239_6 43,208 405,099 1.3394
01 Apr 2010 18:50:49 888167 10931698 hadsm3dhet2_u71b_006587239_6 32,406 390,564 1.3391
31 Mar 2010 22:44:26 888167 10931698 hadsm3dhet2_u71b_006587239_6 21,604 376,028 1.3389
30 Mar 2010 01:54:01 888167 10931698 hadsm3dhet2_u71b_006587239_6 10,802 361,712 1.3394
29 Mar 2010 20:47:48 888167 10931698 hadsm3dhet2_u71b_006587239_6 259,248 347,134 1.3390
28 Mar 2010 01:41:33 888167 10931698 hadsm3dhet2_u71b_006587239_6 248,446 332,736 1.3393


©2024 climateprediction.net