climateprediction.net home page
Task 22015693

Task 22015693

Name hadam4h_a0wp_209411_5_895_012063799_1
Workunit 12063799
Created 2 Feb 2021, 19:20:25 UTC
Sent 2 Feb 2021, 21:19:40 UTC
Report deadline 16 Jan 2022, 2:39:40 UTC
Received 13 Mar 2021, 5:48:58 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1513803
Run time 8 days 7 hours 27 min 28 sec
CPU time 8 days 5 hours 46 min 8 sec
Validate state Invalid
Credit 13,636.74
Device peak FLOPS 5.03 GFLOPS
Application version UK Met Office HadAM4 at N216 resolution v8.52
i686-pc-linux-gnu
Peak working set size 1,357.10 MB
Peak swap size 1,378.46 MB
Peak disk usage 162.81 MB
Stderr
<core_client_version>7.16.6</core_client_version>
<![CDATA[
<message>
process exited with code 22 (0x16, -234)</message>
<stderr_txt>
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...

BUFFOUT: Write Failed: No space left on device
BUFFOUT: C I/O Error - Return code = 1

Model crashed: WRITDUMP: BAD BUFFOUT OF DATA                                                                                                                                                                                                                                   tmp/xnnuj.pipe_dummy                                                            

BUFFOUT: Write Failed: No space left on device
BUFFOUT: C I/O Error - Return code = 1

Model crashed: WRITDUMP: BAD BUFFOUT OF DATA                                                                                                                                                                                                                                   tmp/xnnuj.pipe_dummy                                                            
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7846, iMonCtr=1
Model crash detected, will try to restart...
02:50:31 (7846): No heartbeat from client for 30 sec - exiting
02:50:31 (7846): timer handler: client dead, exiting
02:50:32 (7846): No heartbeat from client for 30 sec - exiting
02:50:32 (7846): timer handler: client dead, exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=77713, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=77713, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=77713, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=77713, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=77713, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=77713, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
16:48:28 (77713): called boinc_finish(22)

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
22 Feb 2021 11:10:37 1513803 22015693 hadam4h_a0wp_209411_5_895_012063799_1 17,483 563,946 32.2568
08 Feb 2021 04:05:52 1513803 22015693 hadam4h_a0wp_209411_5_895_012063799_1 8,843 290,796 32.8843


©2024 climateprediction.net