climateprediction.net home page
Task 15796203

Task 15796203

Name hadcm3n_zhsj_1920_40_008316055_2
Workunit 8467190
Created 26 May 2013, 3:11:00 UTC
Sent 26 May 2013, 3:11:09 UTC
Report deadline 25 Aug 2013, 10:38:20 UTC
Received 13 Jun 2013, 8:46:59 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1276667
Run time 14 days 9 hours 6 min 28 sec
CPU time 14 days 5 hours 47 min 46 sec
Validate state Invalid
Credit 12,441.60
Device peak FLOPS 2.98 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>7.0.64</core_client_version>
<![CDATA[
<message>
The device does not recognize the command.
 (0x16) - exit code 22 (0x16)
</message>
<stderr_txt>
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=18364, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
05:37:19 (4456): No heartbeat from core client for 30 sec - exiting
05:37:20 (4456): No heartbeat from core client for 30 sec - exiting
05:37:21 (4456): No heartbeat from core client for 30 sec - exiting
05:37:22 (4456): No heartbeat from core client for 30 sec - exiting
05:37:23 (4456): No heartbeat from core client for 30 sec - exiting
05:37:24 (4456): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
21:10:19 (7580): No heartbeat from core client for 30 sec - exiting
21:10:20 (7580): No heartbeat from core client for 30 sec - exiting
21:10:21 (7580): No heartbeat from core client for 30 sec - exiting
21:10:22 (7580): No heartbeat from core client for 30 sec - exiting
21:10:23 (7580): No heartbeat from core client for 30 sec - exiting
21:10:24 (7580): No heartbeat from core client for 30 sec - exiting
21:10:25 (7580): No heartbeat from core client for 30 sec - exiting
21:10:26 (7580): No heartbeat from core client for 30 sec - exiting
21:10:27 (7580): No heartbeat from core client for 30 sec - exiting
21:10:28 (7580): No heartbeat from core client for 30 sec - exiting
21:10:29 (7580): No heartbeat from core client for 30 sec - exiting
21:10:30 (7580): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
07:48:39 (23852): Can't acquire lockfile (32) - waiting 35s
07:48:50 (25324): No heartbeat from core client for 30 sec - exiting
07:48:51 (25324): No heartbeat from core client for 30 sec - exiting
07:48:52 (25324): No heartbeat from core client for 30 sec - exiting
07:48:53 (25324): No heartbeat from core client for 30 sec - exiting
07:48:54 (25324): No heartbeat from core client for 30 sec - exiting
07:48:55 (25324): No heartbeat from core client for 30 sec - exiting
07:48:56 (25324): No heartbeat from core client for 30 sec - exiting
07:48:57 (25324): No heartbeat from core client for 30 sec - exiting
07:48:58 (25324): No heartbeat from core client for 30 sec - exiting
07:48:59 (25324): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
06:09:20 (8060): No heartbeat from core client for 30 sec - exiting
06:09:21 (8060): No heartbeat from core client for 30 sec - exiting
06:09:22 (8060): No heartbeat from core client for 30 sec - exiting
06:09:23 (8060): No heartbeat from core client for 30 sec - exiting
06:09:24 (8060): No heartbeat from core client for 30 sec - exiting
06:09:25 (8060): No heartbeat from core client for 30 sec - exiting
06:09:26 (8060): No heartbeat from core client for 30 sec - exiting
06:09:27 (8060): No heartbeat from core client for 30 sec - exiting
06:09:28 (8060): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
cpdnmonitor: cannot open input file G:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_zhsj_1920_40_008316055/dataout/atmos_restart.day after 11 attempts
cpdnmonitor: cannot open input file G:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_zhsj_1920_40_008316055/dataout/ocean_restart.day after 11 attempts

Model crashed: READ_FLH: I/O error                                                                                                                                                                                                                                             tmp/pipe_dummy                                                                  2048    
cpdnmonitor: cannot open input file G:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_zhsj_1920_40_008316055/dataout/atmos_restart.day after 11 attempts
cpdnmonitor: cannot open input file G:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_zhsj_1920_40_008316055/dataout/ocean_restart.day after 11 attempts

Model crashed: READ_FLH: I/O error                                                                                                                                                                                                                                             tmp/pipe_dummy                                                                  2048    
cpdnmonitor: cannot open input file G:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_zhsj_1920_40_008316055/dataout/atmos_restart.day after 11 attempts
cpdnmonitor: cannot open input file G:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_zhsj_1920_40_008316055/dataout/ocean_restart.day after 11 attempts

Model crashed: READ_FLH: I/O error                                                                                                                                                                                                                                             tmp/pipe_dummy                                                                  2048    
cpdnmonitor: cannot open input file G:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_zhsj_1920_40_008316055/dataout/atmos_restart.day after 11 attempts
cpdnmonitor: cannot open input file G:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_zhsj_1920_40_008316055/dataout/ocean_restart.day after 11 attempts

Model crashed: READ_FLH: I/O error                                                                                                                                                                                                                                             tmp/pipe_dummy                                                                  2048    
cpdnmonitor: cannot open input file G:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_zhsj_1920_40_008316055/dataout/atmos_restart.day after 11 attempts
cpdnmonitor: cannot open input file G:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_zhsj_1920_40_008316055/dataout/ocean_restart.day after 11 attempts

Model crashed: READ_FLH: I/O error                                                                                                                                                                                                                                             tmp/pipe_dummy                                                                  2048    
cpdnmonitor: cannot open input file G:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_zhsj_1920_40_008316055/dataout/atmos_restart.day after 11 attempts
cpdnmonitor: cannot open input file G:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_zhsj_1920_40_008316055/dataout/ocean_restart.day after 11 attempts

Model crashed: READ_FLH: I/O error                                                                                                                                                                                                                                             tmp/pipe_dummy                                                                  2048    
Sorry, too many model crashes! :-(
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
13 Jun 2013 06:58:08 1276667 15796203 hadcm3n_zhsj_1920_40_008316055_2 1,036,800 1,230,511 1.1868
12 Jun 2013 20:42:21 1276667 15796203 hadcm3n_zhsj_1920_40_008316055_2 1,010,880 1,199,906 1.1870
12 Jun 2013 11:54:43 1276667 15796203 hadcm3n_zhsj_1920_40_008316055_2 984,960 1,169,268 1.1871
12 Jun 2013 00:37:10 1276667 15796203 hadcm3n_zhsj_1920_40_008316055_2 959,040 1,138,956 1.1876
11 Jun 2013 05:01:38 1276667 15796203 hadcm3n_zhsj_1920_40_008316055_2 933,120 1,108,444 1.1879
10 Jun 2013 19:24:15 1276667 15796203 hadcm3n_zhsj_1920_40_008316055_2 907,200 1,077,944 1.1882
10 Jun 2013 10:58:43 1276667 15796203 hadcm3n_zhsj_1920_40_008316055_2 881,280 1,047,883 1.1890
10 Jun 2013 02:35:12 1276667 15796203 hadcm3n_zhsj_1920_40_008316055_2 855,360 1,017,811 1.1899
09 Jun 2013 18:13:47 1276667 15796203 hadcm3n_zhsj_1920_40_008316055_2 829,440 987,746 1.1909
09 Jun 2013 09:50:03 1276667 15796203 hadcm3n_zhsj_1920_40_008316055_2 803,520 957,730 1.1919
09 Jun 2013 01:21:47 1276667 15796203 hadcm3n_zhsj_1920_40_008316055_2 777,600 927,479 1.1927
08 Jun 2013 16:57:31 1276667 15796203 hadcm3n_zhsj_1920_40_008316055_2 751,680 897,361 1.1938
08 Jun 2013 08:38:37 1276667 15796203 hadcm3n_zhsj_1920_40_008316055_2 725,760 867,401 1.1952
08 Jun 2013 00:55:07 1276667 15796203 hadcm3n_zhsj_1920_40_008316055_2 699,840 837,109 1.1961
07 Jun 2013 16:21:59 1276667 15796203 hadcm3n_zhsj_1920_40_008316055_2 673,920 806,730 1.1971
07 Jun 2013 06:49:35 1276667 15796203 hadcm3n_zhsj_1920_40_008316055_2 648,000 775,867 1.1973
06 Jun 2013 16:28:19 1276667 15796203 hadcm3n_zhsj_1920_40_008316055_2 622,080 746,362 1.1998
06 Jun 2013 02:48:41 1276667 15796203 hadcm3n_zhsj_1920_40_008316055_2 596,160 715,336 1.1999
05 Jun 2013 18:12:47 1276667 15796203 hadcm3n_zhsj_1920_40_008316055_2 570,240 684,060 1.1996
05 Jun 2013 06:59:36 1276667 15796203 hadcm3n_zhsj_1920_40_008316055_2 544,320 652,281 1.1983


©2024 climateprediction.net