climateprediction.net home page
Task 15642085

Task 15642085

Name hadcm3n_4e12_1940_40_008312757_1
Workunit 8463892
Created 27 Feb 2013, 2:15:39 UTC
Sent 27 Feb 2013, 2:16:03 UTC
Report deadline 29 May 2013, 9:43:14 UTC
Received 2 Apr 2013, 16:22:43 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1073096
Run time 32 days 22 hours 2 min 41 sec
CPU time 20 days 5 hours 30 min 39 sec
Validate state Invalid
Credit 5,287.68
Device peak FLOPS 1.90 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>6.10.18</core_client_version>
<![CDATA[
<message>
The device does not recognize the command. (0x16) - exit code 22 (0x16)
</message>
<stderr_txt>
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4504, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
12:00:04 (5196): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3848, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3848, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4200, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4200, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4200, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4200, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2052, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=9908, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=8936, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5236, iMonCtr=1
Model crash detected, will try to restart...
17:45:51 (4612): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=8024, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5060, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5060, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5060, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5060, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5060, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5060, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
31 Mar 2013 23:05:50 1073096 15642085 hadcm3n_4e12_1940_40_008312757_1 440,640 1,670,506 3.7911
29 Mar 2013 23:01:01 1073096 15642085 hadcm3n_4e12_1940_40_008312757_1 414,720 1,571,668 3.7897
28 Mar 2013 03:16:50 1073096 15642085 hadcm3n_4e12_1940_40_008312757_1 388,800 1,471,169 3.7839
26 Mar 2013 12:42:22 1073096 15642085 hadcm3n_4e12_1940_40_008312757_1 362,880 1,370,120 3.7757
25 Mar 2013 01:20:24 1073096 15642085 hadcm3n_4e12_1940_40_008312757_1 336,960 1,274,456 3.7822
21 Mar 2013 19:11:42 1073096 15642085 hadcm3n_4e12_1940_40_008312757_1 311,040 1,190,410 3.8272
19 Mar 2013 01:22:15 1073096 15642085 hadcm3n_4e12_1940_40_008312757_1 285,120 1,105,806 3.8784
16 Mar 2013 13:55:22 1073096 15642085 hadcm3n_4e12_1940_40_008312757_1 259,200 1,016,447 3.9215
15 Mar 2013 06:32:52 1073096 15642085 hadcm3n_4e12_1940_40_008312757_1 233,280 917,047 3.9311
13 Mar 2013 10:40:25 1073096 15642085 hadcm3n_4e12_1940_40_008312757_1 207,360 814,723 3.9290
11 Mar 2013 03:10:45 1073096 15642085 hadcm3n_4e12_1940_40_008312757_1 181,440 697,901 3.8465
09 Mar 2013 12:13:03 1073096 15642085 hadcm3n_4e12_1940_40_008312757_1 155,520 592,258 3.8082
08 Mar 2013 00:18:35 1073096 15642085 hadcm3n_4e12_1940_40_008312757_1 129,600 488,778 3.7714
06 Mar 2013 07:30:31 1073096 15642085 hadcm3n_4e12_1940_40_008312757_1 103,680 385,226 3.7155
04 Mar 2013 19:38:46 1073096 15642085 hadcm3n_4e12_1940_40_008312757_1 77,760 292,722 3.7644
02 Mar 2013 17:46:35 1073096 15642085 hadcm3n_4e12_1940_40_008312757_1 51,840 201,983 3.8963
28 Feb 2013 16:55:46 1073096 15642085 hadcm3n_4e12_1940_40_008312757_1 25,920 110,049 4.2457


©2024 climateprediction.net