climateprediction.net home page
Task 15854655

Task 15854655

Name hadcm3n_3bjc_2020_40_008393120_0
Workunit 8543979
Created 21 Jun 2013, 21:10:07 UTC
Sent 24 Jun 2013, 2:14:21 UTC
Report deadline 23 Sep 2013, 9:41:32 UTC
Received 9 Jul 2013, 15:31:12 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1169946
Run time 1 days 23 hours 29 min 13 sec
CPU time 1 days 20 hours 25 min 37 sec
Validate state Invalid
Credit 1,555.20
Device peak FLOPS 3.28 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>6.12.34</core_client_version>
<![CDATA[
<message>
The device does not recognize the command. (0x16) - exit code 22 (0x16)
</message>
<stderr_txt>
20:26:34 (844): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
21:44:41 (4608): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
20:58:07 (6772): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
22:27:00 (3440): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
21:26:12 (5248): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
23:07:43 (1112): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
15:33:03 (4592): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
19:47:20 (5512): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
10:27:12 (4572): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
14:10:44 (6328): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
21:56:42 (6276): No heartbeat from core client for 30 sec - exiting
21:56:43 (6276): No heartbeat from core client for 30 sec - exiting
21:56:44 (6276): No heartbeat from core client for 30 sec - exiting
21:56:45 (6276): No heartbeat from core client for 30 sec - exiting
21:56:46 (6276): No heartbeat from core client for 30 sec - exiting
21:56:48 (6276): No heartbeat from core client for 30 sec - exiting
21:56:49 (6276): No heartbeat from core client for 30 sec - exiting
21:56:50 (6276): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
BUFFIN: C I/O Error feof - Unit 63 - Return code = 16
BUFFIN: C I/O Error feof - Unit 64 - Return code = 16
BUFFIN: C I/O Error feof - Unit 65 - Return code = 16
BUFFIN: C I/O Error feof - Unit 66 - Return code = 16
BUFFIN: C I/O Error feof - Unit 67 - Return code = 16
BUFFIN: C I/O Error feof - Unit 68 - Return code = 16
BUFFIN: C I/O Error feof - Unit 69 - Return code = 16
Error converting file to netcdf: dataout/3bjcko.pjm5c10
Error converting file to netcdf: dataout/3bjcko.pim5c10
Error converting file to netcdf: dataout/3bjcko.pfm5c10
Error converting file to netcdf: dataout/3bjcka.phm5c10
Error converting file to netcdf: dataout/3bjcka.pgm5c10
Error converting file to netcdf: dataout/3bjcka.pem5c10
Error converting file to netcdf: dataout/3bjcka.pdm5c10
11:47:46 (6508): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
14:11:53 (6464): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
18:37:53 (160): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6092, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6092, iMonCtr=1
Model crash detected, will try to restart...
08:38:02 (5736): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4008, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4008, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4008, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4008, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4008, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
07 Jul 2013 05:00:13 1169946 15854655 hadcm3n_3bjc_2020_40_008393120_0 129,600 156,318 1.2062
06 Jul 2013 19:23:31 1169946 15854655 hadcm3n_3bjc_2020_40_008393120_0 103,680 124,843 1.2041
06 Jul 2013 05:25:35 1169946 15854655 hadcm3n_3bjc_2020_40_008393120_0 77,760 93,127 1.1976
04 Jul 2013 14:05:03 1169946 15854655 hadcm3n_3bjc_2020_40_008393120_0 51,840 61,840 1.1929
25 Jun 2013 17:21:36 1169946 15854655 hadcm3n_3bjc_2020_40_008393120_0 25,920 30,494 1.1765


©2024 climateprediction.net