climateprediction.net home page
Task 13639702

Task 13639702

Name hadcm3n_t3xn_1940_40_007542845_3
Workunit 7740077
Created 16 Nov 2011, 19:16:28 UTC
Sent 16 Nov 2011, 19:20:43 UTC
Report deadline 16 Feb 2012, 2:47:54 UTC
Received 17 Jan 2012, 15:39:32 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1076220
Run time 10 days 10 hours 15 min 55 sec
CPU time 9 days 23 hours 34 min 12 sec
Validate state Invalid
Credit 9,331.20
Device peak FLOPS 3.54 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>6.10.58</core_client_version>
<![CDATA[
<message>
The device does not recognize the command. (0x16) - exit code 22 (0x16)
</message>
<stderr_txt>
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2280, iMonCtr=1
Model crash detected, will try to restart...
13:00:45 (6760): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
05:04:51 (3076): No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - No 'heartbeat' from BOINC...
18:09:09 (5336): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
16:26:22 (4700): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
13:33:38 (4216): No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - No 'heartbeat' from BOINC...
13:33:40 (4216): No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
14:56:13 (1100): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3280, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3280, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3280, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3280, iMonCtr=1
Model crash detected, will try to restart...
10:22:19 (3280): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6360, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6360, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
17 Jan 2012 05:12:24 1076220 13639702 hadcm3n_t3xn_1940_40_007542845_3 777,600 845,296 1.0871
16 Jan 2012 20:45:18 1076220 13639702 hadcm3n_t3xn_1940_40_007542845_3 751,680 816,557 1.0863
16 Jan 2012 10:50:10 1076220 13639702 hadcm3n_t3xn_1940_40_007542845_3 725,760 787,826 1.0855
13 Jan 2012 00:51:00 1076220 13639702 hadcm3n_t3xn_1940_40_007542845_3 699,840 758,804 1.0843
12 Jan 2012 03:06:41 1076220 13639702 hadcm3n_t3xn_1940_40_007542845_3 673,920 729,365 1.0823
11 Jan 2012 17:22:05 1076220 13639702 hadcm3n_t3xn_1940_40_007542845_3 648,000 700,227 1.0806
11 Jan 2012 02:22:28 1076220 13639702 hadcm3n_t3xn_1940_40_007542845_3 622,080 671,335 1.0792
09 Jan 2012 14:39:49 1076220 13639702 hadcm3n_t3xn_1940_40_007542845_3 596,160 642,161 1.0772
09 Jan 2012 02:16:45 1076220 13639702 hadcm3n_t3xn_1940_40_007542845_3 570,240 614,676 1.0779
08 Jan 2012 16:16:36 1076220 13639702 hadcm3n_t3xn_1940_40_007542845_3 544,320 586,308 1.0771
08 Jan 2012 05:09:11 1076220 13639702 hadcm3n_t3xn_1940_40_007542845_3 518,400 558,812 1.0780
07 Jan 2012 20:52:03 1076220 13639702 hadcm3n_t3xn_1940_40_007542845_3 492,480 531,397 1.0790
07 Jan 2012 12:27:59 1076220 13639702 hadcm3n_t3xn_1940_40_007542845_3 466,560 504,249 1.0808
07 Jan 2012 04:26:23 1076220 13639702 hadcm3n_t3xn_1940_40_007542845_3 440,640 476,814 1.0821
06 Jan 2012 21:04:28 1076220 13639702 hadcm3n_t3xn_1940_40_007542845_3 414,720 449,633 1.0842
06 Jan 2012 12:39:02 1076220 13639702 hadcm3n_t3xn_1940_40_007542845_3 388,800 420,930 1.0826
06 Jan 2012 03:16:54 1076220 13639702 hadcm3n_t3xn_1940_40_007542845_3 362,880 393,561 1.0845
04 Jan 2012 02:49:42 1076220 13639702 hadcm3n_t3xn_1940_40_007542845_3 336,960 365,477 1.0846
03 Jan 2012 13:17:21 1076220 13639702 hadcm3n_t3xn_1940_40_007542845_3 311,040 337,144 1.0839
29 Dec 2011 20:45:47 1076220 13639702 hadcm3n_t3xn_1940_40_007542845_3 285,120 309,403 1.0852


©2024 climateprediction.net