climateprediction.net home page
Task 15861455

Task 15861455

Name hadcm3n_z8em_1960_40_008317342_3
Workunit 8468477
Created 24 Jun 2013, 13:38:11 UTC
Sent 24 Jun 2013, 13:51:44 UTC
Report deadline 23 Sep 2013, 21:18:55 UTC
Received 18 Jul 2013, 11:47:59 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1117371
Run time 15 days 23 hours 11 min 17 sec
CPU time 8 days 20 hours 7 min 24 sec
Validate state Invalid
Credit 1,866.24
Device peak FLOPS 1.32 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>7.0.64</core_client_version>
<![CDATA[
<message>
The device does not recognize the command.
 (0x16) - exit code 22 (0x16)
</message>
<stderr_txt>
08:43:14 (1800): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
00:55:12 (924): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
00:55:13 (924): No heartbeat from core client for 30 sec - exiting
18:37:37 (5496): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
05:35:19 (2484): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
10:34:07 (4320): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
15:32:52 (1012): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
20:31:41 (4432): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
01:30:39 (5328): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
07:29:37 (5224): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
11:28:20 (2692): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
14:27:18 (3948): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
13:24:49 (6008): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
16:23:26 (4812): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
19:22:15 (5616): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
22:21:07 (1532): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
14:16:27 (5452): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
BUFFIN: C I/O Error feof - Unit 63 - Return code = 16
BUFFIN: C I/O Error feof - Unit 64 - Return code = 16
BUFFIN: C I/O Error feof - Unit 65 - Return code = 16
BUFFIN: C I/O Error feof - Unit 66 - Return code = 16
BUFFIN: C I/O Error feof - Unit 67 - Return code = 16
BUFFIN: C I/O Error feof - Unit 68 - Return code = 16
BUFFIN: C I/O Error feof - Unit 69 - Return code = 16
Error converting file to netcdf: dataout/z8emko.pjg5c10
Error converting file to netcdf: dataout/z8emko.pig5c10
Error converting file to netcdf: dataout/z8emko.pfg5c10
Error converting file to netcdf: dataout/z8emka.phg5c10
Error converting file to netcdf: dataout/z8emka.pgg5c10
Error converting file to netcdf: dataout/z8emka.peg5c10
Error converting file to netcdf: dataout/z8emka.pdg5c10
22:14:19 (4768): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3404, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3404, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3404, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3404, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3404, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3404, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
07 Jul 2013 14:27:11 1117371 15861455 hadcm3n_z8em_1960_40_008317342_3 155,520 663,097 4.2637
04 Jul 2013 18:35:29 1117371 15861455 hadcm3n_z8em_1960_40_008317342_3 129,600 559,572 4.3177
03 Jul 2013 03:19:08 1117371 15861455 hadcm3n_z8em_1960_40_008317342_3 103,680 448,180 4.3227
02 Jul 2013 11:23:50 1117371 15861455 hadcm3n_z8em_1960_40_008317342_3 77,760 334,368 4.3000
28 Jun 2013 12:25:09 1117371 15861455 hadcm3n_z8em_1960_40_008317342_3 51,840 225,059 4.3414
26 Jun 2013 21:02:38 1117371 15861455 hadcm3n_z8em_1960_40_008317342_3 25,920 112,090 4.3245


©2024 climateprediction.net