climateprediction.net home page
Task 15781141

Task 15781141

Name hadcm3n_zm5n_1960_40_008367751_0
Workunit 8518610
Created 13 May 2013, 7:14:47 UTC
Sent 13 May 2013, 7:15:05 UTC
Report deadline 12 Aug 2013, 14:42:16 UTC
Received 3 Jun 2013, 2:45:54 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1159643
Run time 1 days 22 hours 40 min 35 sec
CPU time 1 days 18 hours 43 min 52 sec
Validate state Invalid
Credit 2,177.28
Device peak FLOPS 3.43 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>7.0.64</core_client_version>
<![CDATA[
<message>
The device does not recognize the command.
 (0x16) - exit code 22 (0x16)
</message>
<stderr_txt>
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=39768, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
08:44:21 (4268): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
20:17:44 (4140): No heartbeat from core client for 30 sec - exiting
20:17:46 (4140): No heartbeat from core client for 30 sec - exiting
20:17:47 (4140): No heartbeat from core client for 30 sec - exiting
20:17:48 (4140): No heartbeat from core client for 30 sec - exiting
20:17:49 (4140): No heartbeat from core client for 30 sec - exiting
20:17:50 (4140): No heartbeat from core client for 30 sec - exiting
20:17:51 (4140): No heartbeat from core client for 30 sec - exiting
20:17:52 (4140): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
BUFFIN: C I/O Error feof - Unit 63 - Return code = 16
BUFFIN: C I/O Error feof - Unit 64 - Return code = 16
BUFFIN: C I/O Error feof - Unit 65 - Return code = 16
BUFFIN: C I/O Error feof - Unit 66 - Return code = 16
BUFFIN: C I/O Error feof - Unit 67 - Return code = 16
BUFFIN: C I/O Error feof - Unit 68 - Return code = 16
BUFFIN: C I/O Error feof - Unit 69 - Return code = 16
Error converting file to netcdf: dataout/zm5nko.pjg6c10
Error converting file to netcdf: dataout/zm5nko.pig6c10
Error converting file to netcdf: dataout/zm5nko.pfg6c10
Error converting file to netcdf: dataout/zm5nka.phg6c10
Error converting file to netcdf: dataout/zm5nka.pgg6c10
Error converting file to netcdf: dataout/zm5nka.peg6c10
Error converting file to netcdf: dataout/zm5nka.pdg6c10
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4696, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4216, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4216, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4216, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4216, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4216, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
27 May 2013 18:45:32 1159643 15781141 hadcm3n_zm5n_1960_40_008367751_0 181,440 138,982 0.7660
26 May 2013 16:42:44 1159643 15781141 hadcm3n_zm5n_1960_40_008367751_0 155,520 119,067 0.7656
26 May 2013 09:05:15 1159643 15781141 hadcm3n_zm5n_1960_40_008367751_0 129,600 99,292 0.7661
23 May 2013 06:27:15 1159643 15781141 hadcm3n_zm5n_1960_40_008367751_0 103,680 79,351 0.7653
20 May 2013 14:27:56 1159643 15781141 hadcm3n_zm5n_1960_40_008367751_0 77,760 59,414 0.7641
19 May 2013 16:40:10 1159643 15781141 hadcm3n_zm5n_1960_40_008367751_0 51,840 39,506 0.7621
13 May 2013 13:16:41 1159643 15781141 hadcm3n_zm5n_1960_40_008367751_0 25,920 19,733 0.7613


©2024 climateprediction.net