Name | hadcm3n_zhsj_1920_40_008316055_2 |
Workunit | 8467190 |
Created | 26 May 2013, 3:11:00 UTC |
Sent | 26 May 2013, 3:11:09 UTC |
Report deadline | 25 Aug 2013, 10:38:20 UTC |
Received | 13 Jun 2013, 8:46:59 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 22 (0x00000016) Unknown error code |
Computer ID | 1276667 |
Run time | 14 days 9 hours 6 min 28 sec |
CPU time | 14 days 5 hours 47 min 46 sec |
Validate state | Invalid |
Credit | 12,441.60 |
Device peak FLOPS | 2.98 GFLOPS |
Application version | UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86 |
Stderr | <core_client_version>7.0.64</core_client_version> <![CDATA[ <message> The device does not recognize the command. (0x16) - exit code 22 (0x16) </message> <stderr_txt> CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=18364, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 05:37:19 (4456): No heartbeat from core client for 30 sec - exiting 05:37:20 (4456): No heartbeat from core client for 30 sec - exiting 05:37:21 (4456): No heartbeat from core client for 30 sec - exiting 05:37:22 (4456): No heartbeat from core client for 30 sec - exiting 05:37:23 (4456): No heartbeat from core client for 30 sec - exiting 05:37:24 (4456): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 21:10:19 (7580): No heartbeat from core client for 30 sec - exiting 21:10:20 (7580): No heartbeat from core client for 30 sec - exiting 21:10:21 (7580): No heartbeat from core client for 30 sec - exiting 21:10:22 (7580): No heartbeat from core client for 30 sec - exiting 21:10:23 (7580): No heartbeat from core client for 30 sec - exiting 21:10:24 (7580): No heartbeat from core client for 30 sec - exiting 21:10:25 (7580): No heartbeat from core client for 30 sec - exiting 21:10:26 (7580): No heartbeat from core client for 30 sec - exiting 21:10:27 (7580): No heartbeat from core client for 30 sec - exiting 21:10:28 (7580): No heartbeat from core client for 30 sec - exiting 21:10:29 (7580): No heartbeat from core client for 30 sec - exiting 21:10:30 (7580): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 07:48:39 (23852): Can't acquire lockfile (32) - waiting 35s 07:48:50 (25324): No heartbeat from core client for 30 sec - exiting 07:48:51 (25324): No heartbeat from core client for 30 sec - exiting 07:48:52 (25324): No heartbeat from core client for 30 sec - exiting 07:48:53 (25324): No heartbeat from core client for 30 sec - exiting 07:48:54 (25324): No heartbeat from core client for 30 sec - exiting 07:48:55 (25324): No heartbeat from core client for 30 sec - exiting 07:48:56 (25324): No heartbeat from core client for 30 sec - exiting 07:48:57 (25324): No heartbeat from core client for 30 sec - exiting 07:48:58 (25324): No heartbeat from core client for 30 sec - exiting 07:48:59 (25324): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... 06:09:20 (8060): No heartbeat from core client for 30 sec - exiting 06:09:21 (8060): No heartbeat from core client for 30 sec - exiting 06:09:22 (8060): No heartbeat from core client for 30 sec - exiting 06:09:23 (8060): No heartbeat from core client for 30 sec - exiting 06:09:24 (8060): No heartbeat from core client for 30 sec - exiting 06:09:25 (8060): No heartbeat from core client for 30 sec - exiting 06:09:26 (8060): No heartbeat from core client for 30 sec - exiting 06:09:27 (8060): No heartbeat from core client for 30 sec - exiting 06:09:28 (8060): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... cpdnmonitor: cannot open input file G:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_zhsj_1920_40_008316055/dataout/atmos_restart.day after 11 attempts cpdnmonitor: cannot open input file G:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_zhsj_1920_40_008316055/dataout/ocean_restart.day after 11 attempts Model crashed: READ_FLH: I/O error tmp/pipe_dummy 2048 cpdnmonitor: cannot open input file G:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_zhsj_1920_40_008316055/dataout/atmos_restart.day after 11 attempts cpdnmonitor: cannot open input file G:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_zhsj_1920_40_008316055/dataout/ocean_restart.day after 11 attempts Model crashed: READ_FLH: I/O error tmp/pipe_dummy 2048 cpdnmonitor: cannot open input file G:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_zhsj_1920_40_008316055/dataout/atmos_restart.day after 11 attempts cpdnmonitor: cannot open input file G:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_zhsj_1920_40_008316055/dataout/ocean_restart.day after 11 attempts Model crashed: READ_FLH: I/O error tmp/pipe_dummy 2048 cpdnmonitor: cannot open input file G:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_zhsj_1920_40_008316055/dataout/atmos_restart.day after 11 attempts cpdnmonitor: cannot open input file G:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_zhsj_1920_40_008316055/dataout/ocean_restart.day after 11 attempts Model crashed: READ_FLH: I/O error tmp/pipe_dummy 2048 cpdnmonitor: cannot open input file G:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_zhsj_1920_40_008316055/dataout/atmos_restart.day after 11 attempts cpdnmonitor: cannot open input file G:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_zhsj_1920_40_008316055/dataout/ocean_restart.day after 11 attempts Model crashed: READ_FLH: I/O error tmp/pipe_dummy 2048 cpdnmonitor: cannot open input file G:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_zhsj_1920_40_008316055/dataout/atmos_restart.day after 11 attempts cpdnmonitor: cannot open input file G:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_zhsj_1920_40_008316055/dataout/ocean_restart.day after 11 attempts Model crashed: READ_FLH: I/O error tmp/pipe_dummy 2048 Sorry, too many model crashes! :-( Called boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
13 Jun 2013 06:58:08 | 1276667 | 15796203 | hadcm3n_zhsj_1920_40_008316055_2 | 1,036,800 | 1,230,511 | 1.1868 |
12 Jun 2013 20:42:21 | 1276667 | 15796203 | hadcm3n_zhsj_1920_40_008316055_2 | 1,010,880 | 1,199,906 | 1.1870 |
12 Jun 2013 11:54:43 | 1276667 | 15796203 | hadcm3n_zhsj_1920_40_008316055_2 | 984,960 | 1,169,268 | 1.1871 |
12 Jun 2013 00:37:10 | 1276667 | 15796203 | hadcm3n_zhsj_1920_40_008316055_2 | 959,040 | 1,138,956 | 1.1876 |
11 Jun 2013 05:01:38 | 1276667 | 15796203 | hadcm3n_zhsj_1920_40_008316055_2 | 933,120 | 1,108,444 | 1.1879 |
10 Jun 2013 19:24:15 | 1276667 | 15796203 | hadcm3n_zhsj_1920_40_008316055_2 | 907,200 | 1,077,944 | 1.1882 |
10 Jun 2013 10:58:43 | 1276667 | 15796203 | hadcm3n_zhsj_1920_40_008316055_2 | 881,280 | 1,047,883 | 1.1890 |
10 Jun 2013 02:35:12 | 1276667 | 15796203 | hadcm3n_zhsj_1920_40_008316055_2 | 855,360 | 1,017,811 | 1.1899 |
09 Jun 2013 18:13:47 | 1276667 | 15796203 | hadcm3n_zhsj_1920_40_008316055_2 | 829,440 | 987,746 | 1.1909 |
09 Jun 2013 09:50:03 | 1276667 | 15796203 | hadcm3n_zhsj_1920_40_008316055_2 | 803,520 | 957,730 | 1.1919 |
09 Jun 2013 01:21:47 | 1276667 | 15796203 | hadcm3n_zhsj_1920_40_008316055_2 | 777,600 | 927,479 | 1.1927 |
08 Jun 2013 16:57:31 | 1276667 | 15796203 | hadcm3n_zhsj_1920_40_008316055_2 | 751,680 | 897,361 | 1.1938 |
08 Jun 2013 08:38:37 | 1276667 | 15796203 | hadcm3n_zhsj_1920_40_008316055_2 | 725,760 | 867,401 | 1.1952 |
08 Jun 2013 00:55:07 | 1276667 | 15796203 | hadcm3n_zhsj_1920_40_008316055_2 | 699,840 | 837,109 | 1.1961 |
07 Jun 2013 16:21:59 | 1276667 | 15796203 | hadcm3n_zhsj_1920_40_008316055_2 | 673,920 | 806,730 | 1.1971 |
07 Jun 2013 06:49:35 | 1276667 | 15796203 | hadcm3n_zhsj_1920_40_008316055_2 | 648,000 | 775,867 | 1.1973 |
06 Jun 2013 16:28:19 | 1276667 | 15796203 | hadcm3n_zhsj_1920_40_008316055_2 | 622,080 | 746,362 | 1.1998 |
06 Jun 2013 02:48:41 | 1276667 | 15796203 | hadcm3n_zhsj_1920_40_008316055_2 | 596,160 | 715,336 | 1.1999 |
05 Jun 2013 18:12:47 | 1276667 | 15796203 | hadcm3n_zhsj_1920_40_008316055_2 | 570,240 | 684,060 | 1.1996 |
05 Jun 2013 06:59:36 | 1276667 | 15796203 | hadcm3n_zhsj_1920_40_008316055_2 | 544,320 | 652,281 | 1.1983 |
©2024 climateprediction.net