climateprediction.net home page
Task 16055472

Task 16055472

Name hadcm3n_oevw_1900_40_008474415_2
Workunit 8625254
Created 3 Oct 2013, 18:20:17 UTC
Sent 3 Oct 2013, 18:41:35 UTC
Report deadline 3 Jan 2014, 2:08:46 UTC
Received 2 Nov 2013, 7:12:05 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1295275
Run time 4 days 5 hours 21 min 14 sec
CPU time 3 days 13 hours 45 min 42 sec
Validate state Invalid
Credit 2,177.28
Device peak FLOPS 3.36 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>7.0.64</core_client_version>
<![CDATA[
<message>
Het apparaat herkent de opdracht niet.
 (0x16) - exit code 22 (0x16)
</message>
<stderr_txt>
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5652, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5004, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1396, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4624, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4808, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4816, iMonCtr=1
Model crash detected, will try to restart...
08:04:05 (4888): No heartbeat from core client for 30 sec - exiting
08:04:06 (4888): No heartbeat from core client for 30 sec - exiting
08:04:07 (4888): No heartbeat from core client for 30 sec - exiting
08:04:08 (4888): No heartbeat from core client for 30 sec - exiting
08:04:09 (4888): No heartbeat from core client for 30 sec - exiting
08:04:10 (4888): No heartbeat from core client for 30 sec - exiting
08:04:11 (4888): No heartbeat from core client for 30 sec - exiting
08:04:13 (4888): No heartbeat from core client for 30 sec - exiting
08:04:14 (4888): No heartbeat from core client for 30 sec - exiting
08:04:15 (4888): No heartbeat from core client for 30 sec - exiting
08:04:16 (4888): No heartbeat from core client for 30 sec - exiting
08:04:17 (4888): No heartbeat from core client for 30 sec - exiting
08:04:18 (4888): No heartbeat from core client for 30 sec - exiting
08:04:19 (4888): No heartbeat from core client for 30 sec - exiting
08:04:20 (4888): No heartbeat from core client for 30 sec - exiting
08:04:21 (4888): No heartbeat from core client for 30 sec - exiting
08:04:22 (4888): No heartbeat from core client for 30 sec - exiting
08:04:23 (4888): No heartbeat from core client for 30 sec - exiting
08:04:25 (4888): No heartbeat from core client for 30 sec - exiting
08:04:26 (4888): No heartbeat from core client for 30 sec - exiting
08:04:27 (4888): No heartbeat from core client for 30 sec - exiting
08:04:28 (4888): No heartbeat from core client for 30 sec - exiting
08:04:29 (4888): No heartbeat from core client for 30 sec - exiting
08:04:30 (4888): No heartbeat from core client for 30 sec - exiting
08:04:31 (4888): No heartbeat from core client for 30 sec - exiting
08:04:32 (4888): No heartbeat from core client for 30 sec - exiting
08:04:33 (4888): No heartbeat from core client for 30 sec - exiting
08:04:34 (4888): No heartbeat from core client for 30 sec - exiting
08:04:35 (4888): No heartbeat from core client for 30 sec - exiting
08:04:37 (4888): No heartbeat from core client for 30 sec - exiting
08:04:38 (4888): No heartbeat from core client for 30 sec - exiting
08:04:39 (4888): No heartbeat from core client for 30 sec - exiting
08:04:40 (4888): No heartbeat from core client for 30 sec - exiting
08:04:41 (4888): No heartbeat from core client for 30 sec - exiting
08:04:42 (4888): No heartbeat from core client for 30 sec - exiting
08:04:43 (4888): No heartbeat from core client for 30 sec - exiting
08:04:44 (4888): No heartbeat from core client for 30 sec - exiting
08:04:45 (4888): No heartbeat from core client for 30 sec - exiting
08:04:46 (4888): No heartbeat from core client for 30 sec - exiting
08:04:48 (4888): No heartbeat from core client for 30 sec - exiting
08:04:49 (4888): No heartbeat from core client for 30 sec - exiting
08:04:50 (4888): No heartbeat from core client for 30 sec - exiting
08:04:51 (4888): No heartbeat from core client for 30 sec - exiting
08:04:52 (4888): No heartbeat from core client for 30 sec - exiting
08:04:53 (4888): No heartbeat from core client for 30 sec - exiting
08:04:54 (4888): No heartbeat from core client for 30 sec - exiting
08:04:55 (4888): No heartbeat from core client for 30 sec - exiting
08:04:56 (4888): No heartbeat from core client for 30 sec - exiting
08:04:57 (4888): No heartbeat from core client for 30 sec - exiting
08:04:58 (4888): No heartbeat from core client for 30 sec - exiting
08:05:00 (4888): No heartbeat from core client for 30 sec - exiting
08:05:01 (4888): No heartbeat from core client for 30 sec - exiting
08:05:02 (4888): No heartbeat from core client for 30 sec - exiting
08:05:03 (4888): No heartbeat from core client for 30 sec - exiting
08:05:04 (4888): No heartbeat from core client for 30 sec - exiting
08:05:05 (4888): No heartbeat from core client for 30 sec - exiting
08:05:06 (4888): No heartbeat from core client for 30 sec - exiting
08:05:07 (4888): No heartbeat from core client for 30 sec - exiting
08:05:08 (4888): No heartbeat from core client for 30 sec - exiting
08:05:09 (4888): No heartbeat from core client for 30 sec - exiting
08:05:10 (4888): No heartbeat from core client for 30 sec - exiting
08:05:12 (4888): No heartbeat from core client for 30 sec - exiting
08:05:13 (4888): No heartbeat from core client for 30 sec - exiting
08:05:14 (4888): No heartbeat from core client for 30 sec - exiting
08:05:15 (4888): No heartbeat from core client for 30 sec - exiting
08:05:16 (4888): No heartbeat from core client for 30 sec - exiting
08:05:17 (4888): No heartbeat from core client for 30 sec - exiting
08:05:18 (4888): No heartbeat from core client for 30 sec - exiting
08:05:19 (4888): No heartbeat from core client for 30 sec - exiting
08:05:20 (4888): No heartbeat from core client for 30 sec - exiting
08:05:21 (4888): No heartbeat from core client for 30 sec - exiting
08:05:22 (4888): No heartbeat from core client for 30 sec - exiting
08:05:23 (4888): No heartbeat from core client for 30 sec - exiting
08:05:24 (4888): No heartbeat from core client for 30 sec - exiting
08:05:25 (4888): No heartbeat from core client for 30 sec - exiting
08:05:26 (4888): No heartbeat from core client for 30 sec - exiting
08:05:27 (4888): No heartbeat from core client for 30 sec - exiting
08:05:28 (4888): No heartbeat from core client for 30 sec - exiting
08:05:29 (4888): No heartbeat from core client for 30 sec - exiting
08:05:30 (4888): No heartbeat from core client for 30 sec - exiting
08:05:31 (4888): No heartbeat from core client for 30 sec - exiting
08:05:32 (4888): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
08:05:33 (4888): No heartbeat from core client for 30 sec - exiting
CController:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3528, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4428, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4428, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4428, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4428, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4428, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4428, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
01 Nov 2013 10:27:33 1295275 16055472 hadcm3n_oevw_1900_40_008474415_2 181,440 292,999 1.6149
30 Oct 2013 19:03:24 1295275 16055472 hadcm3n_oevw_1900_40_008474415_2 155,520 249,952 1.6072
27 Oct 2013 15:10:17 1295275 16055472 hadcm3n_oevw_1900_40_008474415_2 129,600 208,133 1.6060
27 Oct 2013 02:05:47 1295275 16055472 hadcm3n_oevw_1900_40_008474415_2 103,680 166,741 1.6082
26 Oct 2013 13:49:39 1295275 16055472 hadcm3n_oevw_1900_40_008474415_2 77,760 124,992 1.6074
25 Oct 2013 06:17:01 1295275 16055472 hadcm3n_oevw_1900_40_008474415_2 51,840 83,245 1.6058
21 Oct 2013 14:22:20 1295275 16055472 hadcm3n_oevw_1900_40_008474415_2 25,920 41,597 1.6048


©2024 climateprediction.net