Name | hadcm3n_4clh_2020_40_008396266_0 |
Workunit | 8547125 |
Created | 26 Jun 2013, 1:45:55 UTC |
Sent | 27 Jun 2013, 12:38:02 UTC |
Report deadline | 26 Sep 2013, 20:05:13 UTC |
Received | 18 Aug 2013, 0:18:23 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 22 (0x00000016) Unknown error code |
Computer ID | 1257384 |
Run time | 39 days 22 hours 6 min 52 sec |
CPU time | 34 days 21 hours 27 min 21 sec |
Validate state | Invalid |
Credit | 12,441.60 |
Device peak FLOPS | 1.30 GFLOPS |
Application version | UK Met Office Coupled Model Full Resolution Ocean v6.07 i686-pc-linux-gnu |
Stderr | <core_client_version>6.10.58</core_client_version> <![CDATA[ <message> process exited with code 22 (0x16, -234) </message> <stderr_txt> Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 13:16:14 (1545): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 63 - Return code = 1 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 64 - Return code = 1 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 65 - Return code = 1 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 66 - Return code = 1 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 67 - Return code = 1 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 68 - Return code = 1 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 69 - Return code = 1 09:14:47 (13106): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 63 - Return code = 1 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 64 - Return code = 1 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 65 - Return code = 1 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 66 - Return code = 1 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 67 - Return code = 1 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 68 - Return code = 1 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 69 - Return code = 1 Suspended CPDN Monitor - Suspend request from BOINC... 04:29:49 (19933): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 12:53:17 (26571): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 19:39:15 (28012): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 21:15:57 (28989): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... 12:37:00 (1713): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 01:18:14 (3507): No heartbeat from core client for 30 sec - exiting 01:18:15 (3507): No heartbeat from core client for 30 sec - exiting 01:18:16 (3507): No heartbeat from core client for 30 sec - exiting 01:18:17 (3507): No heartbeat from core client for 30 sec - exiting 01:18:18 (3507): No heartbeat from core client for 30 sec - exiting 01:18:19 (3507): No heartbeat from core client for 30 sec - exiting 01:18:20 (3507): No heartbeat from core client for 30 sec - exiting 01:18:21 (3507): No heartbeat from core client for 30 sec - exiting 01:18:22 (3507): No heartbeat from core client for 30 sec - exiting 01:18:23 (3507): No heartbeat from core client for 30 sec - exiting 01:18:24 (3507): No heartbeat from core client for 30 sec - exiting 01:18:25 (3507): No heartbeat from core client for 30 sec - exiting 01:18:26 (3507): No heartbeat from core client for 30 sec - exiting 01:18:27 (3507): No heartbeat from core client for 30 sec - exiting 01:18:28 (3507): No heartbeat from core client for 30 sec - exiting 01:18:29 (3507): No heartbeat from core client for 30 sec - exiting 01:18:30 (3507): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... cpdnmonitor: cannot open input file /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_4clh_2020_40_008396266/dataout/atmos_restart.day after 11 attempts cpdnmonitor: cannot open input file /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_4clh_2020_40_008396266/dataout/ocean_restart.day after 11 attempts Model crashed: READ_FLH: I/O error tmp/pipe_dummy 2048 cpdnmonitor: cannot open input file /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_4clh_2020_40_008396266/dataout/atmos_restart.day after 11 attempts cpdnmonitor: cannot open input file /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_4clh_2020_40_008396266/dataout/ocean_restart.day after 11 attempts Model crashed: READ_FLH: I/O error tmp/pipe_dummy 2048 cpdnmonitor: cannot open input file /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_4clh_2020_40_008396266/dataout/atmos_restart.day after 11 attempts cpdnmonitor: cannot open input file /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_4clh_2020_40_008396266/dataout/ocean_restart.day after 11 attempts Model crashed: READ_FLH: I/O error tmp/pipe_dummy 2048 cpdnmonitor: cannot open input file /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_4clh_2020_40_008396266/dataout/atmos_restart.day after 11 attempts cpdnmonitor: cannot open input file /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_4clh_2020_40_008396266/dataout/ocean_restart.day after 11 attempts Model crashed: READ_FLH: I/O error tmp/pipe_dummy 2048 cpdnmonitor: cannot open input file /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_4clh_2020_40_008396266/dataout/atmos_restart.day after 11 attempts cpdnmonitor: cannot open input file /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_4clh_2020_40_008396266/dataout/ocean_restart.day after 11 attempts Model crashed: READ_FLH: I/O error tmp/pipe_dummy 2048 cpdnmonitor: cannot open input file /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_4clh_2020_40_008396266/dataout/atmos_restart.day after 11 attempts cpdnmonitor: cannot open input file /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_4clh_2020_40_008396266/dataout/ocean_restart.day after 11 attempts Model crashed: READ_FLH: I/O error tmp/pipe_dummy 2048 Sorry, too many model crashes! :-( Called boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
17 Aug 2013 23:22:33 | 1257384 | 15865082 | hadcm3n_4clh_2020_40_008396266_0 | 1,036,800 | 3,014,924 | 2.9079 |
17 Aug 2013 02:38:34 | 1257384 | 15865082 | hadcm3n_4clh_2020_40_008396266_0 | 1,010,880 | 2,940,524 | 2.9089 |
16 Aug 2013 05:58:52 | 1257384 | 15865082 | hadcm3n_4clh_2020_40_008396266_0 | 984,960 | 2,866,185 | 2.9100 |
15 Aug 2013 09:20:50 | 1257384 | 15865082 | hadcm3n_4clh_2020_40_008396266_0 | 959,040 | 2,791,909 | 2.9111 |
14 Aug 2013 16:00:14 | 1257384 | 15865082 | hadcm3n_4clh_2020_40_008396266_0 | 933,120 | 2,717,703 | 2.9125 |
14 Aug 2013 16:00:14 | 1257384 | 15865082 | hadcm3n_4clh_2020_40_008396266_0 | 907,200 | 2,643,379 | 2.9138 |
14 Aug 2013 16:00:14 | 1257384 | 15865082 | hadcm3n_4clh_2020_40_008396266_0 | 881,280 | 2,568,995 | 2.9151 |
14 Aug 2013 16:00:14 | 1257384 | 15865082 | hadcm3n_4clh_2020_40_008396266_0 | 855,360 | 2,494,256 | 2.9160 |
14 Aug 2013 16:00:14 | 1257384 | 15865082 | hadcm3n_4clh_2020_40_008396266_0 | 829,440 | 2,419,482 | 2.9170 |
30 Jul 2013 09:58:19 | 1257384 | 15865082 | hadcm3n_4clh_2020_40_008396266_0 | 803,520 | 2,344,551 | 2.9179 |
29 Jul 2013 13:15:55 | 1257384 | 15865082 | hadcm3n_4clh_2020_40_008396266_0 | 777,600 | 2,269,374 | 2.9184 |
29 Jul 2013 13:15:55 | 1257384 | 15865082 | hadcm3n_4clh_2020_40_008396266_0 | 751,680 | 2,194,243 | 2.9191 |
29 Jul 2013 13:15:55 | 1257384 | 15865082 | hadcm3n_4clh_2020_40_008396266_0 | 725,760 | 2,119,958 | 2.9210 |
26 Jul 2013 07:16:55 | 1257384 | 15865082 | hadcm3n_4clh_2020_40_008396266_0 | 699,840 | 2,045,610 | 2.9230 |
25 Jul 2013 09:25:13 | 1257384 | 15865082 | hadcm3n_4clh_2020_40_008396266_0 | 673,920 | 1,971,149 | 2.9249 |
24 Jul 2013 11:19:10 | 1257384 | 15865082 | hadcm3n_4clh_2020_40_008396266_0 | 648,000 | 1,891,781 | 2.9194 |
23 Jul 2013 22:10:20 | 1257384 | 15865082 | hadcm3n_4clh_2020_40_008396266_0 | 622,080 | 1,812,854 | 2.9142 |
23 Jul 2013 21:48:19 | 1257384 | 15865082 | hadcm3n_4clh_2020_40_008396266_0 | 596,160 | 1,738,523 | 2.9162 |
23 Jul 2013 21:23:11 | 1257384 | 15865082 | hadcm3n_4clh_2020_40_008396266_0 | 570,240 | 1,664,121 | 2.9183 |
23 Jul 2013 20:43:58 | 1257384 | 15865082 | hadcm3n_4clh_2020_40_008396266_0 | 544,320 | 1,589,788 | 2.9207 |
©2024 climateprediction.net