climateprediction.net home page
Task 13639817

Task 13639817

Name hadcm3n_ym4j_1900_40_007524810_3
Workunit 7722285
Created 16 Nov 2011, 20:11:38 UTC
Sent 18 Nov 2011, 10:32:38 UTC
Report deadline 17 Feb 2012, 17:59:49 UTC
Received 17 Dec 2011, 12:48:26 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1114641
Run time 4 days 16 hours 58 min
CPU time 4 days 8 hours 29 min 45 sec
Validate state Invalid
Credit 2,177.28
Device peak FLOPS 2.62 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>6.12.34</core_client_version>
<![CDATA[
<message>
The device does not recognize the command. (0x16) - exit code 22 (0x16)
</message>
<stderr_txt>
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
06:17:45 (4200): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3064, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
20:01:16 (5988): No heartbeat from core client for 30 sec - exiting
20:01:17 (5988): No heartbeat from core client for 30 sec - exiting
20:01:18 (5988): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
09:23:54 (4440): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
09:25:26 (3436): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
12:21:46 (5428): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
BUFFIN: C I/O Error feof - Unit 63 - Return code = 16
BUFFIN: C I/O Error feof - Unit 64 - Return code = 16
BUFFIN: C I/O Error feof - Unit 65 - Return code = 16
BUFFIN: C I/O Error feof - Unit 66 - Return code = 16
BUFFIN: C I/O Error feof - Unit 67 - Return code = 16
BUFFIN: C I/O Error feof - Unit 68 - Return code = 16
BUFFIN: C I/O Error feof - Unit 69 - Return code = 16
Error converting file to netcdf: dataout/ym4jko.pja3c10
Error converting file to netcdf: dataout/ym4jko.pia3c10
Error converting file to netcdf: dataout/ym4jko.pfa3c10
Error converting file to netcdf: dataout/ym4jka.pha3c10
Error converting file to netcdf: dataout/ym4jka.pga3c10
Error converting file to netcdf: dataout/ym4jka.pea3c10
Error converting file to netcdf: dataout/ym4jka.pda3c10
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
08:00:45 (1968): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
07:34:32 (5876): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
21:32:33 (5180): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
08:16:37 (5036): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
17:10:43 (5984): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
06:47:13 (5224): No heartbeat from core client for 30 sec - exiting
06:47:14 (5224): No heartbeat from core client for 30 sec - exiting
06:47:15 (5224): No heartbeat from core client for 30 sec - exiting
06:47:16 (5224): No heartbeat from core client for 30 sec - exiting
06:47:17 (5224): No heartbeat from core client for 30 sec - exiting
06:47:18 (5224): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
17:46:22 (5512): No heartbeat from core client for 30 sec - exiting
17:46:23 (5512): No heartbeat from core client for 30 sec - exiting
17:46:24 (5512): No heartbeat from core client for 30 sec - exiting
17:46:25 (5512): No heartbeat from core client for 30 sec - exiting
17:46:26 (5512): No heartbeat from core client for 30 sec - exiting
17:46:27 (5512): No heartbeat from core client for 30 sec - exiting
17:46:28 (5512): No heartbeat from core client for 30 sec - exiting
17:46:29 (5512): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
18:01:07 (5732): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
19:00:18 (5160): No heartbeat from core client for 30 sec - exiting
19:00:19 (5160): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3320, iMonCtr=1
Model crash detected, will try to restart...
19:32:54 (1920): No heartbeat from core client for 30 sec - exiting
19:32:55 (1920): No heartbeat from core client for 30 sec - exiting
19:32:56 (1920): No heartbeat from core client for 30 sec - exiting
19:32:57 (1920): No heartbeat from core client for 30 sec - exiting
19:32:58 (1920): No heartbeat from core client for 30 sec - exiting
19:32:59 (1920): No heartbeat from core client for 30 sec - exiting
19:33:00 (1920): No heartbeat from core client for 30 sec - exiting
19:33:01 (1920): No heartbeat from core client for 30 sec - exiting
19:33:02 (1920): No heartbeat from core client for 30 sec - exiting
19:33:03 (1920): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
06:06:04 (5972): No heartbeat from core client for 30 sec - exiting
06:06:05 (5972): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5636, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5636, iMonCtr=1
Model crash detected, will try to restart...
06:43:43 (5636): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7156, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
CPDN Monitor - Quit request from BOINC...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1528, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1528, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1528, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
14 Dec 2011 21:35:57 1114641 13639817 hadcm3n_ym4j_1900_40_007524810_3 181,440 353,120 1.9462
11 Dec 2011 02:00:11 1114641 13639817 hadcm3n_ym4j_1900_40_007524810_3 155,520 302,276 1.9436
04 Dec 2011 18:07:29 1114641 13639817 hadcm3n_ym4j_1900_40_007524810_3 129,600 251,628 1.9416
02 Dec 2011 03:10:26 1114641 13639817 hadcm3n_ym4j_1900_40_007524810_3 103,680 200,749 1.9362
26 Nov 2011 19:41:05 1114641 13639817 hadcm3n_ym4j_1900_40_007524810_3 77,760 150,330 1.9333
25 Nov 2011 02:04:15 1114641 13639817 hadcm3n_ym4j_1900_40_007524810_3 51,840 100,297 1.9347
21 Nov 2011 00:36:29 1114641 13639817 hadcm3n_ym4j_1900_40_007524810_3 25,920 50,097 1.9328


©2024 climateprediction.net