climateprediction.net home page
Task 15939597

Task 15939597

Name hadcm3n_ziy9_1960_40_008361916_2
Workunit 8512775
Created 25 Aug 2013, 10:49:20 UTC
Sent 25 Aug 2013, 10:50:04 UTC
Report deadline 24 Nov 2013, 18:17:15 UTC
Received 30 Sep 2013, 13:30:50 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1276039
Run time 18 days 4 hours 49 min 21 sec
CPU time 14 days 18 hours 55 min 33 sec
Validate state Invalid
Credit 4,976.64
Device peak FLOPS 1.35 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>7.0.64</core_client_version>
<![CDATA[
<message>
Das Gerät erkennt den Befehl nicht.
 (0x16) - exit code 22 (0x16)
</message>
<stderr_txt>
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3668, iMonCtr=1
Model crash detected, will try to restart...
08:46:37 (4228): No heartbeat from core client for 30 sec - exiting
08:46:38 (4228): No heartbeat from core client for 30 sec - exiting
08:46:39 (4228): No heartbeat from core client for 30 sec - exiting
08:46:40 (4228): No heartbeat from core client for 30 sec - exiting
08:46:41 (4228): No heartbeat from core client for 30 sec - exiting
08:46:42 (4228): No heartbeat from core client for 30 sec - exiting
08:46:43 (4228): No heartbeat from core client for 30 sec - exiting
08:46:44 (4228): No heartbeat from core client for 30 sec - exiting
08:46:45 (4228): No heartbeat from core client for 30 sec - exiting
08:46:46 (4228): No heartbeat from core client for 30 sec - exiting
08:46:47 (4228): No heartbeat from core client for 30 sec - exiting
08:46:49 (4228): No heartbeat from core client for 30 sec - exiting
08:46:50 (4228): No heartbeat from core client for 30 sec - exiting
08:46:51 (4228): No heartbeat from core client for 30 sec - exiting
08:46:52 (4228): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3996, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2176, iMonCtr=1
Model crash detected, will try to restart...
08:56:50 (3920): No heartbeat from core client for 30 sec - exiting
08:56:51 (3920): No heartbeat from core client for 30 sec - exiting
08:56:53 (3920): No heartbeat from core client for 30 sec - exiting
08:56:54 (3920): No heartbeat from core client for 30 sec - exiting
08:56:55 (3920): No heartbeat from core client for 30 sec - exiting
08:56:59 (3920): No heartbeat from core client for 30 sec - exiting
08:57:00 (3920): No heartbeat from core client for 30 sec - exiting
08:57:01 (3920): No heartbeat from core client for 30 sec - exiting
08:57:06 (3920): No heartbeat from core client for 30 sec - exiting
08:57:07 (3920): No heartbeat from core client for 30 sec - exiting
08:57:08 (3920): No heartbeat from core client for 30 sec - exiting
08:57:09 (3920): No heartbeat from core client for 30 sec - exiting
08:57:10 (3920): No heartbeat from core client for 30 sec - exiting
08:57:12 (3920): No heartbeat from core client for 30 sec - exiting
08:57:13 (3920): No heartbeat from core client for 30 sec - exiting
08:57:14 (3920): No heartbeat from core client for 30 sec - exiting
08:57:15 (3920): No heartbeat from core client for 30 sec - exiting
08:57:16 (3920): No heartbeat from core client for 30 sec - exiting
08:57:17 (3920): No heartbeat from core client for 30 sec - exiting
08:57:18 (3920): No heartbeat from core client for 30 sec - exiting
08:57:19 (3920): No heartbeat from core client for 30 sec - exiting
08:57:20 (3920): No heartbeat from core client for 30 sec - exiting
08:57:22 (3920): No heartbeat from core client for 30 sec - exiting
08:57:24 (3920): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
08:57:25 (3920): No heartbeat from core client for 30 sec - exiting
08:57:27 (3920): No heartbeat from core client for 30 sec - exiting
C10:51:15 (3840): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3952, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3104, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3952, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3876, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3556, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2180, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4056, iMonCtr=1
Model crash detected, will try to restart...
CController:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3728, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CCPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3920, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4920, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5644, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3356, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5908, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CCPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4012, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3896, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
20:35:46 (640): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5868, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3820, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3336, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2988, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3648, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4848, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
BUFFOUT: C I/O Error - Return code = 32

Model crashed: WRITHEAD: I/O error                                                                                                                                                                                                                                             tmp/pipe_dummy                                                                  2048    
BUFFOUT: C I/O Error - Return code = 32

Model crashed: WRITDUMP: BAD BUFFOUT OF DATA                                                                                                                                                                                                                                   tmp/pipe_dummy                                                                  2048    
BUFFOUT: C I/O Error - Return code = 32

Model crashed: WRITDUMP: BAD BUFFOUT OF DATA                                                                                                                                                                                                                                   tmp/pipe_dummy                                                                  2048    
BUFFOUT: C I/O Error - Return code = 32

Model crashed: WRITDUMP: BAD BUFFOUT OF DATA                                                                                                                                                                                                                                   tmp/pipe_dummy                                                                  2048    
BUFFOUT: C I/O Error - Return code = 32

Model crashed: WRITDUMP: BAD BUFFOUT OF DATA                                                                                                                                                                                                                                   tmp/pipe_dummy                                                                  2048    
BUFFOUT: C I/O Error - Return code = 32

Model crashed: WRITDUMP: BAD BUFFOUT OF DATA                                                                                                                                                                                                                                   tmp/pipe_dummy                                                                  2048    
Sorry, too many model crashes! :-(
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
29 Sep 2013 19:59:24 1276039 15939597 hadcm3n_ziy9_1960_40_008361916_2 414,720 1,272,841 3.0692
28 Sep 2013 08:49:45 1276039 15939597 hadcm3n_ziy9_1960_40_008361916_2 388,800 1,193,284 3.0691
26 Sep 2013 13:59:49 1276039 15939597 hadcm3n_ziy9_1960_40_008361916_2 362,880 1,113,765 3.0692
23 Sep 2013 16:04:54 1276039 15939597 hadcm3n_ziy9_1960_40_008361916_2 336,960 1,034,161 3.0691
20 Sep 2013 11:09:57 1276039 15939597 hadcm3n_ziy9_1960_40_008361916_2 311,040 954,372 3.0683
18 Sep 2013 15:13:51 1276039 15939597 hadcm3n_ziy9_1960_40_008361916_2 285,120 872,522 3.0602
16 Sep 2013 21:40:16 1276039 15939597 hadcm3n_ziy9_1960_40_008361916_2 259,200 793,072 3.0597
14 Sep 2013 09:48:19 1276039 15939597 hadcm3n_ziy9_1960_40_008361916_2 233,280 714,060 3.0610
12 Sep 2013 10:19:53 1276039 15939597 hadcm3n_ziy9_1960_40_008361916_2 207,360 637,537 3.0745
09 Sep 2013 18:03:14 1276039 15939597 hadcm3n_ziy9_1960_40_008361916_2 181,440 559,587 3.0841
07 Sep 2013 18:47:05 1276039 15939597 hadcm3n_ziy9_1960_40_008361916_2 155,520 480,819 3.0917
06 Sep 2013 09:02:12 1276039 15939597 hadcm3n_ziy9_1960_40_008361916_2 129,600 401,586 3.0987
03 Sep 2013 16:33:46 1276039 15939597 hadcm3n_ziy9_1960_40_008361916_2 103,680 321,098 3.0970
01 Sep 2013 14:26:30 1276039 15939597 hadcm3n_ziy9_1960_40_008361916_2 77,760 239,302 3.0774
30 Aug 2013 16:14:32 1276039 15939597 hadcm3n_ziy9_1960_40_008361916_2 51,840 158,293 3.0535
27 Aug 2013 16:28:13 1276039 15939597 hadcm3n_ziy9_1960_40_008361916_2 25,920 78,467 3.0273


©2024 climateprediction.net