climateprediction.net home page
Task 15525017

Task 15525017

Name hadcm3n_o3iw_2140_40_008269668_2
Workunit 8424792
Created 7 Jan 2013, 5:50:29 UTC
Sent 7 Jan 2013, 5:50:35 UTC
Report deadline 8 Apr 2013, 13:17:46 UTC
Received 19 Jan 2013, 4:49:05 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1259207
Run time 7 days 6 hours 6 min 24 sec
CPU time 5 days 21 hours 58 min 51 sec
Validate state Invalid
Credit 9,331.20
Device peak FLOPS 3.39 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>7.0.28</core_client_version>
<![CDATA[
<message>
The device does not recognize the command. (0x16) - exit code 22 (0x16)
</message>
<stderr_txt>
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=14892, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=14892, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
01:58:53 (6088): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
02:00:47 (1732): No heartbeat from core client for 30 sec - exiting
02:00:52 (1732): No heartbeat from core client for 30 sec - exiting
02:00:53 (1732): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
MainError:	02:50:49 PM	No files match the supplied pattern.
MainError:	02:50:49 PM	No files match the supplied pattern.
MainError:	08:38:39 PM	No files match the supplied pattern.
MainError:	08:38:39 PM	No files match the supplied pattern.
Suspended CPDN Monitor - Suspend request from BOINC...
MainError:	02:29:22 AM	No files match the supplied pattern.
MainError:	02:29:22 AM	No files match the supplied pattern.
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
MainError:	08:25:57 AM	No files match the supplied pattern.
MainError:	08:25:57 AM	No files match the supplied pattern.
MainError:	02:19:06 PM	No files match the supplied pattern.
MainError:	02:19:06 PM	No files match the supplied pattern.
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
No Process Handle
Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=8572, selfPID=8572, iMonCtr=1
Suspended CPDN Monitor - Suspend request from BOINC...
MainError:	08:18:22 PM	No files match the supplied pattern.
MainError:	08:18:22 PM	No files match the supplied pattern.
MainError:	02:09:29 AM	No files match the supplied pattern.
MainError:	02:09:29 AM	No files match the supplied pattern.
Suspended CPDN Monitor - Suspend request from BOINC...
MainError:	08:02:49 AM	No files match the supplied pattern.
MainError:	08:02:49 AM	No files match the supplied pattern.
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
MainError:	02:10:37 PM	No files match the supplied pattern.
MainError:	02:10:37 PM	No files match the supplied pattern.
MainError:	08:15:11 PM	No files match the supplied pattern.
MainError:	08:15:11 PM	No files match the supplied pattern.
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Error converting file to netcdf: dataout/o3iwka.ph11c10
Error converting file to netcdf: dataout/o3iwka.pg11c10
Error converting file to netcdf: dataout/o3iwka.pe11c10
MainError:	02:20:56 AM	No files match the supplied pattern.
MainError:	02:20:56 AM	No files match the supplied pattern.
BUFFIN: C I/O Error feof - Unit 67 - Return code = 16

Model crashed: STWORK  : I/O error - PP fixed length header                                                                                                                                                                                                                    tmp/pipe_dummy                                                                  2048    
BUFFIN: C I/O Error feof - Unit 64 - Return code = 16
BUFFIN: C I/O Error feof - Unit 66 - Return code = 16
BUFFIN: C I/O Error feof - Unit 67 - Return code = 16
BUFFIN: C I/O Error feof - Unit 68 - Return code = 16
BUFFIN: C I/O Error feof - Unit 69 - Return code = 16
BUFFIN: C I/O Error feof - Unit 67 - Return code = 16

Model crashed: STWORK  : I/O error - PP fixed length header                                                                                                                                                                                                                    tmp/pipe_dummy                                                                  2048    
BUFFIN: C I/O Error feof - Unit 64 - Return code = 16
BUFFIN: C I/O Error feof - Unit 66 - Return code = 16
BUFFIN: C I/O Error feof - Unit 67 - Return code = 16
BUFFIN: C I/O Error feof - Unit 68 - Return code = 16
BUFFIN: C I/O Error feof - Unit 69 - Return code = 16
BUFFIN: C I/O Error feof - Unit 67 - Return code = 16

Model crashed: STWORK  : I/O error - PP fixed length header                                                                                                                                                                                                                    tmp/pipe_dummy                                                                  2048    
BUFFIN: C I/O Error feof - Unit 64 - Return code = 16
BUFFIN: C I/O Error feof - Unit 66 - Return code = 16
BUFFIN: C I/O Error feof - Unit 67 - Return code = 16
BUFFIN: C I/O Error feof - Unit 68 - Return code = 16
BUFFIN: C I/O Error feof - Unit 69 - Return code = 16
BUFFIN: C I/O Error feof - Unit 67 - Return code = 16

Model crashed: STWORK  : I/O error - PP fixed length header                                                                                                                                                                                                                    tmp/pipe_dummy                                                                  2048    
BUFFIN: C I/O Error feof - Unit 64 - Return code = 16
BUFFIN: C I/O Error feof - Unit 66 - Return code = 16
BUFFIN: C I/O Error feof - Unit 67 - Return code = 16
BUFFIN: C I/O Error feof - Unit 68 - Return code = 16
BUFFIN: C I/O Error feof - Unit 69 - Return code = 16
BUFFIN: C I/O Error feof - Unit 67 - Return code = 16

Model crashed: STWORK  : I/O error - PP fixed length header                                                                                                                                                                                                                    tmp/pipe_dummy                                                                  2048    
BUFFIN: C I/O Error feof - Unit 64 - Return code = 16
BUFFIN: C I/O Error feof - Unit 66 - Return code = 16
BUFFIN: C I/O Error feof - Unit 67 - Return code = 16
BUFFIN: C I/O Error feof - Unit 68 - Return code = 16
BUFFIN: C I/O Error feof - Unit 69 - Return code = 16
BUFFIN: C I/O Error feof - Unit 67 - Return code = 16

Model crashed: STWORK  : I/O error - PP fixed length header                                                                                                                                                                                                                    tmp/pipe_dummy                                                                  2048    
Sorry, too many model crashes! :-(
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
19 Jan 2013 02:23:48 1259207 15525017 hadcm3n_o3iw_2140_40_008269668_2 777,600 620,736 0.7983
18 Jan 2013 20:16:18 1259207 15525017 hadcm3n_o3iw_2140_40_008269668_2 751,680 598,859 0.7967
18 Jan 2013 14:36:14 1259207 15525017 hadcm3n_o3iw_2140_40_008269668_2 725,760 577,036 0.7951
18 Jan 2013 08:04:55 1259207 15525017 hadcm3n_o3iw_2140_40_008269668_2 699,840 555,567 0.7938
18 Jan 2013 02:13:15 1259207 15525017 hadcm3n_o3iw_2140_40_008269668_2 673,920 534,423 0.7930
17 Jan 2013 20:21:05 1259207 15525017 hadcm3n_o3iw_2140_40_008269668_2 648,000 513,403 0.7923
17 Jan 2013 14:21:04 1259207 15525017 hadcm3n_o3iw_2140_40_008269668_2 622,080 492,504 0.7917
17 Jan 2013 08:29:33 1259207 15525017 hadcm3n_o3iw_2140_40_008269668_2 596,160 471,384 0.7907
17 Jan 2013 02:33:16 1259207 15525017 hadcm3n_o3iw_2140_40_008269668_2 570,240 450,239 0.7896
16 Jan 2013 21:26:33 1259207 15525017 hadcm3n_o3iw_2140_40_008269668_2 544,320 429,330 0.7887
16 Jan 2013 14:50:47 1259207 15525017 hadcm3n_o3iw_2140_40_008269668_2 518,400 408,548 0.7881
16 Jan 2013 08:52:36 1259207 15525017 hadcm3n_o3iw_2140_40_008269668_2 492,480 387,947 0.7877
15 Jan 2013 20:01:37 1259207 15525017 hadcm3n_o3iw_2140_40_008269668_2 466,560 367,898 0.7885
15 Jan 2013 11:59:16 1259207 15525017 hadcm3n_o3iw_2140_40_008269668_2 440,640 347,574 0.7888
15 Jan 2013 06:12:57 1259207 15525017 hadcm3n_o3iw_2140_40_008269668_2 414,720 326,995 0.7885
15 Jan 2013 00:51:47 1259207 15525017 hadcm3n_o3iw_2140_40_008269668_2 388,800 306,592 0.7886
14 Jan 2013 19:17:43 1259207 15525017 hadcm3n_o3iw_2140_40_008269668_2 362,880 286,421 0.7893
14 Jan 2013 13:40:31 1259207 15525017 hadcm3n_o3iw_2140_40_008269668_2 336,960 266,314 0.7903
14 Jan 2013 07:53:34 1259207 15525017 hadcm3n_o3iw_2140_40_008269668_2 311,040 245,823 0.7903
14 Jan 2013 01:52:03 1259207 15525017 hadcm3n_o3iw_2140_40_008269668_2 285,120 225,335 0.7903


©2024 climateprediction.net