climateprediction.net home page
Task 13498798

Task 13498798

Name hadcm3n_yfcm_1900_40_007352752_2
Workunit 7550182
Created 15 Oct 2011, 3:40:31 UTC
Sent 15 Oct 2011, 3:40:42 UTC
Report deadline 14 Jan 2012, 11:07:53 UTC
Received 31 Oct 2011, 19:54:34 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1060690
Run time 15 days 16 hours 49 min 55 sec
CPU time 14 days 15 hours 59 min 7 sec
Validate state Invalid
Credit 8,398.08
Device peak FLOPS 2.82 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>6.10.58</core_client_version>
<![CDATA[
<message>
The device does not recognize the command. (0x16) - exit code 22 (0x16)
</message>
<stderr_txt>
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
23:33:00 (1488): No heartbeat from core client for 30 sec - exiting
23:33:01 (1488): No heartbeat from core client for 30 sec - exiting
23:33:02 (1488): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2680, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
23:54:57 (5780): No heartbeat from core client for 30 sec - exiting
23:54:58 (5780): No heartbeat from core client for 30 sec - exiting
23:54:59 (5780): No heartbeat from core client for 30 sec - exiting
23:55:00 (5780): No heartbeat from core client for 30 sec - exiting
23:55:01 (5780): No heartbeat from core client for 30 sec - exiting
23:55:02 (5780): No heartbeat from core client for 30 sec - exiting
23:55:03 (5780): No heartbeat from core client for 30 sec - exiting
23:55:04 (5780): No heartbeat from core client for 30 sec - exiting
23:55:05 (5780): No heartbeat from core client for 30 sec - exiting
23:55:06 (5780): No heartbeat from core client for 30 sec - exiting
23:55:07 (5780): No heartbeat from core client for 30 sec - exiting
23:55:08 (5780): No heartbeat from core client for 30 sec - exiting
23:55:09 (5780): No heartbeat from core client for 30 sec - exiting
23:55:10 (5780): No heartbeat from core client for 30 sec - exiting
23:55:11 (5780): No heartbeat from core client for 30 sec - exiting
23:55:12 (5780): No heartbeat from core client for 30 sec - exiting
23:55:13 (5780): No heartbeat from core client for 30 sec - exiting
23:55:14 (5780): No heartbeat from core client for 30 sec - exiting
23:55:15 (5780): No heartbeat from core client for 30 sec - exiting
23:55:16 (5780): No heartbeat from core client for 30 sec - exiting
23:55:17 (5780): No heartbeat from core client for 30 sec - exiting
23:55:18 (5780): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=9276, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=9276, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=9276, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=9276, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=9276, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
31 Oct 2011 19:35:45 1060690 13498798 hadcm3n_yfcm_1900_40_007352752_2 699,840 1,231,168 1.7592
31 Oct 2011 19:17:18 1060690 13498798 hadcm3n_yfcm_1900_40_007352752_2 673,920 1,186,005 1.7599
31 Oct 2011 18:54:23 1060690 13498798 hadcm3n_yfcm_1900_40_007352752_2 648,000 1,140,734 1.7604
31 Oct 2011 18:36:01 1060690 13498798 hadcm3n_yfcm_1900_40_007352752_2 622,080 1,095,953 1.7618
31 Oct 2011 18:19:24 1060690 13498798 hadcm3n_yfcm_1900_40_007352752_2 596,160 1,049,906 1.7611
31 Oct 2011 17:35:10 1060690 13498798 hadcm3n_yfcm_1900_40_007352752_2 570,240 1,002,715 1.7584
31 Oct 2011 17:17:57 1060690 13498798 hadcm3n_yfcm_1900_40_007352752_2 544,320 957,418 1.7589
31 Oct 2011 16:55:44 1060690 13498798 hadcm3n_yfcm_1900_40_007352752_2 518,400 910,731 1.7568
31 Oct 2011 16:38:54 1060690 13498798 hadcm3n_yfcm_1900_40_007352752_2 492,480 865,856 1.7582
31 Oct 2011 15:37:58 1060690 13498798 hadcm3n_yfcm_1900_40_007352752_2 466,560 820,687 1.7590
31 Oct 2011 14:56:09 1060690 13498798 hadcm3n_yfcm_1900_40_007352752_2 440,640 776,663 1.7626
31 Oct 2011 13:32:57 1060690 13498798 hadcm3n_yfcm_1900_40_007352752_2 414,720 731,382 1.7636
31 Oct 2011 13:32:57 1060690 13498798 hadcm3n_yfcm_1900_40_007352752_2 388,800 688,096 1.7698
31 Oct 2011 13:32:57 1060690 13498798 hadcm3n_yfcm_1900_40_007352752_2 362,880 643,456 1.7732
31 Oct 2011 13:32:57 1060690 13498798 hadcm3n_yfcm_1900_40_007352752_2 336,960 599,406 1.7789
31 Oct 2011 13:32:57 1060690 13498798 hadcm3n_yfcm_1900_40_007352752_2 311,040 554,609 1.7831
31 Oct 2011 13:32:57 1060690 13498798 hadcm3n_yfcm_1900_40_007352752_2 285,120 509,577 1.7872
31 Oct 2011 13:32:57 1060690 13498798 hadcm3n_yfcm_1900_40_007352752_2 259,200 465,145 1.7945
31 Oct 2011 13:32:57 1060690 13498798 hadcm3n_yfcm_1900_40_007352752_2 233,280 419,841 1.7997
31 Oct 2011 13:32:57 1060690 13498798 hadcm3n_yfcm_1900_40_007352752_2 207,360 375,417 1.8105


©2024 climateprediction.net