climateprediction.net home page
Task 16152353

Task 16152353

Name hadcm3n_851d_1980_40_008464485_1
Workunit 8615324
Created 21 Dec 2013, 3:29:10 UTC
Sent 21 Dec 2013, 3:29:15 UTC
Report deadline 22 Mar 2014, 10:56:26 UTC
Received 24 Dec 2013, 5:39:00 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1307334
Run time 2 days 23 hours 22 min 11 sec
CPU time 2 days 22 hours 48 min 49 sec
Validate state Invalid
Credit 1,866.24
Device peak FLOPS 2.80 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>7.2.28</core_client_version>
<![CDATA[
<message>
The device does not recognize the command.
 (0x16) - exit code 22 (0x16)
</message>
<stderr_txt>
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4524, iMonCtr=1
Model crash detected, will try to restart...
16:51:37 (3392): No heartbeat from core client for 30 sec - exiting
16:51:39 (3392): No heartbeat from core client for 30 sec - exiting
16:51:40 (3392): No heartbeat from core client for 30 sec - exiting
16:51:41 (3392): No heartbeat from core client for 30 sec - exiting
16:51:42 (3392): No heartbeat from core client for 30 sec - exiting
16:51:43 (3392): No heartbeat from core client for 30 sec - exiting
16:51:44 (3392): No heartbeat from core client for 30 sec - exiting
16:51:45 (3392): No heartbeat from core client for 30 sec - exiting
16:51:46 (3392): No heartbeat from core client for 30 sec - exiting
16:51:47 (3392): No heartbeat from core client for 30 sec - exiting
16:51:48 (3392): No heartbeat from core client for 30 sec - exiting
16:51:50 (3392): No heartbeat from core client for 30 sec - exiting
16:51:51 (3392): No heartbeat from core client for 30 sec - exiting
16:51:52 (3392): No heartbeat from core client for 30 sec - exiting
16:51:53 (3392): No heartbeat from core client for 30 sec - exiting
16:51:54 (3392): No heartbeat from core client for 30 sec - exiting
16:51:55 (3392): No heartbeat from core client for 30 sec - exiting
16:51:56 (3392): No heartbeat from core client for 30 sec - exiting
16:51:57 (3392): No heartbeat from core client for 30 sec - exiting
16:51:58 (3392): No heartbeat from core client for 30 sec - exiting
16:51:59 (3392): No heartbeat from core client for 30 sec - exiting
16:52:00 (3392): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
16:52:02 (3392): No heartbeat from core client for 30 sec - exiting
16:52:40 (4676): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
16:53:32 (4872): No heartbeat from core client for 30 sec - exiting
16:53:33 (4872): No heartbeat from core client for 30 sec - exiting
16:53:34 (4872): No heartbeat from core client for 30 sec - exiting
16:53:35 (4872): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
16:53:36 (4872): No heartbeat from core client for 30 sec - exiting
16:53:37 (4872): No heartbeat from core client for 30 sec - exiting
16:53:39 (4872): No heartbeat from core client for 30 sec - exiting
16:53:40 (4872): No heartbeat from core client for 30 sec - exiting
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4884, iMonCtr=1
Model crash detected, will try to restart...
Signal 11 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4884, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
17:15:19 (4588): No heartbeat from core client for 30 sec - exiting
17:15:20 (4588): No heartbeat from core client for 30 sec - exiting
17:15:21 (4588): No heartbeat from core client for 30 sec - exiting
17:15:22 (4588): No heartbeat from core client for 30 sec - exiting
17:15:23 (4588): No heartbeat from core client for 30 sec - exiting
17:15:24 (4588): No heartbeat from core client for 30 sec - exiting
17:15:26 (4588): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
23:23:02 (4580): No heartbeat from core client for 30 sec - exiting
23:23:03 (4580): No heartbeat from core client for 30 sec - exiting
23:23:04 (4580): No heartbeat from core client for 30 sec - exiting
23:23:05 (4580): No heartbeat from core client for 30 sec - exiting
23:23:06 (4580): No heartbeat from core client for 30 sec - exiting
23:23:07 (4580): No heartbeat from core client for 30 sec - exiting
23:23:09 (4580): No heartbeat from core client for 30 sec - exiting
23:23:10 (4580): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...

Model crashed: ATM_DYN : INVALID THETA DETECTED.                                                                                                                                                                                                                               tmp/pipe_dummy                                                                  2048    
23:31:51 (4280): No heartbeat from core client for 30 sec - exiting
23:31:53 (4280): No heartbeat from core client for 30 sec - exiting
23:31:54 (4280): No heartbeat from core client for 30 sec - exiting
23:31:55 (4280): No heartbeat from core client for 30 sec - exiting
23:31:56 (4280): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...

Model crashed: P_TH_ADJ : NEGATIVE PRESSURE VALUE CREATED.                                                                                                                                                                                                                     tmp/pipe_dummy                                                                  2048    
Signal 11 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1060, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1060, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1060, iMonCtr=1
Model crash detected, will try to restart...
Signal 11 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1060, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
23 Dec 2013 22:05:57 1307334 16152353 hadcm3n_851d_1980_40_008464485_1 155,520 239,755 1.5416
23 Dec 2013 10:56:43 1307334 16152353 hadcm3n_851d_1980_40_008464485_1 129,600 199,559 1.5398
22 Dec 2013 23:53:48 1307334 16152353 hadcm3n_851d_1980_40_008464485_1 103,680 159,632 1.5397
22 Dec 2013 13:45:52 1307334 16152353 hadcm3n_851d_1980_40_008464485_1 77,760 119,579 1.5378
22 Dec 2013 02:42:56 1307334 16152353 hadcm3n_851d_1980_40_008464485_1 51,840 79,814 1.5396
21 Dec 2013 15:39:00 1307334 16152353 hadcm3n_851d_1980_40_008464485_1 25,920 40,339 1.5563


©2024 climateprediction.net