climateprediction.net home page
Task 12732366

Task 12732366

Name hadcm3n_o07e_1900_40_007195597_0
Workunit 7393877
Created 28 Mar 2011, 13:56:13 UTC
Sent 3 Apr 2011, 9:11:39 UTC
Report deadline 3 Jul 2011, 16:38:50 UTC
Received 3 May 2011, 6:47:25 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1120434
Run time 12 days 2 hours 18 min 19 sec
CPU time 11 days 17 hours 54 min 25 sec
Validate state Invalid
Credit 6,842.88
Device peak FLOPS 3.09 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>6.10.58</core_client_version>
<![CDATA[
<message>
The device does not recognize the command. (0x16) - exit code 22 (0x16)
</message>
<stderr_txt>
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
07:42:36 (5520): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
08:10:16 (2428): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6056, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6056, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6056, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6056, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6056, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6056, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
02 May 2011 18:18:57 1120434 12732366 hadcm3n_o07e_1900_40_007195597_0 570,240 1,009,791 1.7708
01 May 2011 17:43:39 1120434 12732366 hadcm3n_o07e_1900_40_007195597_0 544,320 963,183 1.7695
01 May 2011 01:05:36 1120434 12732366 hadcm3n_o07e_1900_40_007195597_0 518,400 917,468 1.7698
30 Apr 2011 10:56:04 1120434 12732366 hadcm3n_o07e_1900_40_007195597_0 492,480 872,143 1.7709
26 Apr 2011 20:46:14 1120434 12732366 hadcm3n_o07e_1900_40_007195597_0 466,560 825,336 1.7690
26 Apr 2011 04:24:19 1120434 12732366 hadcm3n_o07e_1900_40_007195597_0 440,640 780,342 1.7709
23 Apr 2011 21:44:07 1120434 12732366 hadcm3n_o07e_1900_40_007195597_0 414,720 734,907 1.7721
23 Apr 2011 02:31:06 1120434 12732366 hadcm3n_o07e_1900_40_007195597_0 388,800 688,767 1.7715
22 Apr 2011 07:18:26 1120434 12732366 hadcm3n_o07e_1900_40_007195597_0 362,880 641,839 1.7687
21 Apr 2011 02:28:38 1120434 12732366 hadcm3n_o07e_1900_40_007195597_0 336,960 595,502 1.7673
20 Apr 2011 18:23:38 1120434 12732366 hadcm3n_o07e_1900_40_007195597_0 311,040 550,049 1.7684
20 Apr 2011 18:23:38 1120434 12732366 hadcm3n_o07e_1900_40_007195597_0 285,120 503,841 1.7671
20 Apr 2011 18:23:38 1120434 12732366 hadcm3n_o07e_1900_40_007195597_0 259,200 457,054 1.7633
20 Apr 2011 18:23:37 1120434 12732366 hadcm3n_o07e_1900_40_007195597_0 233,280 409,886 1.7571
20 Apr 2011 18:23:37 1120434 12732366 hadcm3n_o07e_1900_40_007195597_0 207,360 362,965 1.7504
12 Apr 2011 10:22:10 1120434 12732366 hadcm3n_o07e_1900_40_007195597_0 181,440 317,569 1.7503
11 Apr 2011 13:25:15 1120434 12732366 hadcm3n_o07e_1900_40_007195597_0 155,520 272,290 1.7508
10 Apr 2011 19:35:15 1120434 12732366 hadcm3n_o07e_1900_40_007195597_0 129,600 227,407 1.7547
09 Apr 2011 21:31:17 1120434 12732366 hadcm3n_o07e_1900_40_007195597_0 103,680 181,761 1.7531
09 Apr 2011 03:49:52 1120434 12732366 hadcm3n_o07e_1900_40_007195597_0 77,760 136,626 1.7570


©2024 climateprediction.net