Name | famous_w00e_599_200_006746702_2 |
Workunit | 6950018 |
Created | 17 Dec 2010, 20:20:31 UTC |
Sent | 17 Dec 2010, 20:20:37 UTC |
Report deadline | 19 Mar 2011, 3:47:48 UTC |
Received | 23 Dec 2010, 13:42:56 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 22 (0x00000016) Unknown error code |
Computer ID | 925834 |
Run time | |
CPU time | 5 days 9 hours 37 min 43 sec |
Validate state | Invalid |
Credit | 710.36 |
Device peak FLOPS | 0.40 GFLOPS |
Application version | UK Met Office FAMOUS v6.11 windows_intelx86 |
Stderr | <core_client_version>6.2.19</core_client_version> <![CDATA[ <message> The device does not recognize the command. (0x16) - exit code 22 (0x16) </message> <stderr_txt> CPDN Monitor - Quit request from BOINC... 08:58:41 (3292): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 60 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 61 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 68 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 69 - Return code = 16 14:46:40 (1164): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 60 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 61 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 68 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 69 - Return code = 16 CPDN Monitor - Quit request from BOINC... 19:11:17 (3728): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 19:11:18 (3728): No heartbeat from core client for 30 sec - exiting 08:19:26 (2892): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 60 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 61 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 68 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 69 - Return code = 16 10:30:51 (2628): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... forrtl: The requested operation cannot be performed on a file with a user-mapped section open. forrtl: severe (38): error during write, unit 6, file C:\Program Files\BOINC\Application Data\projects\climateprediction.net\famous_w00e_599_200_006746702\dataout\stdout_um.txt Image PC Routine Line Source famous_um_6.11_wi 008846D2 Unknown Unknown Unknown famous_um_6.11_wi 0088196C Unknown Unknown Unknown famous_um_6.11_wi 0080BD3E Unknown Unknown Unknown famous_um_6.11_wi 0080B95B Unknown Unknown Unknown famous_um_6.11_wi 007F0945 Unknown Unknown Unknown famous_um_6.11_wi 007F00F5 Unknown Unknown Unknown famous_um_6.11_wi 007BA86B Unknown Unknown Unknown famous_um_6.11_wi 007BA0FF Unknown Unknown Unknown famous_um_6.11_wi 007BA0A0 Unknown Unknown Unknown famous_um_6.11_wi 00776C15 Unknown Unknown Unknown famous_um_6.11_wi 007CA0F6 Unknown Unknown Unknown kernel32.dll 7C817077 Unknown Unknown Unknown Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3072, iMonCtr=1 Model crash detected, will try to restart... 20:00:29 (3072): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 60 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 61 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 68 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 69 - Return code = 16 21:01:32 (3896): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 21:01:33 (3896): No heartbeat from core client for 30 sec - exiting 22:02:47 (2272): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 22:02:49 (2272): No heartbeat from core client for 30 sec - exiting 23:04:49 (3968): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 23:04:50 (3968): No heartbeat from core client for 30 sec - exiting 23:50:13 (884): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 23:50:15 (884): No heartbeat from core client for 30 sec - exiting 23:50:16 (884): No heartbeat from core client for 30 sec - exiting 23:50:17 (884): No heartbeat from core client for 30 sec - exiting 23:50:18 (884): No heartbeat from core client for 30 sec - exiting 23:50:19 (884): No heartbeat from core client for 30 sec - exiting 23:50:20 (884): No heartbeat from core client for 30 sec - exiting 23:50:21 (884): No heartbeat from core client for 30 sec - exiting 23:50:22 (884): No heartbeat from core client for 30 sec - exiting 23:50:24 (884): No heartbeat from core client for 30 sec - exiting 23:50:25 (884): No heartbeat from core client for 30 sec - exiting 00:05:53 (3840): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 01:07:01 (1840): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 01:07:03 (1840): No heartbeat from core client for 30 sec - exiting 02:08:13 (3924): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 02:08:14 (3924): No heartbeat from core client for 30 sec - exiting 04:22:52 (712): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Atmos Hold Restart file rename failed on atmos_restart.hold 05:33:03 (3608): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 05:33:04 (3608): No heartbeat from core client for 30 sec - exiting cpdnmonitor: cannot open input file C:\Program Files\BOINC\Application Data/projects/climateprediction.net/famous_w00e_599_200_006746702/dataout/atmos_restart.day after 11 attempts Model crashed: DRLANDF1 : Error in FILE_OPEN. tmp/pipe_dummy cpdnmonitor: cannot open input file C:\Program Files\BOINC\Application Data/projects/climateprediction.net/famous_w00e_599_200_006746702/dataout/atmos_restart.day after 11 attempts Model crashed: DRLANDF1 : Error in FILE_OPEN. tmp/pipe_dummy cpdnmonitor: cannot open input file C:\Program Files\BOINC\Application Data/projects/climateprediction.net/famous_w00e_599_200_006746702/dataout/atmos_restart.day after 11 attempts Model crashed: DRLANDF1 : Error in FILE_OPEN. tmp/pipe_dummy cpdnmonitor: cannot open input file C:\Program Files\BOINC\Application Data/projects/climateprediction.net/famous_w00e_599_200_006746702/dataout/atmos_restart.day after 11 attempts Model crashed: DRLANDF1 : Error in FILE_OPEN. tmp/pipe_dummy cpdnmonitor: cannot open input file C:\Program Files\BOINC\Application Data/projects/climateprediction.net/famous_w00e_599_200_006746702/dataout/atmos_restart.day after 11 attempts Model crashed: DRLANDF1 : Error in FILE_OPEN. tmp/pipe_dummy cpdnmonitor: cannot open input file C:\Program Files\BOINC\Application Data/projects/climateprediction.net/famous_w00e_599_200_006746702/dataout/atmos_restart.day after 11 attempts Model crashed: DRLANDF1 : Error in FILE_OPEN. tmp/pipe_dummy Sorry, too many model crashes! :-( 05:40:45 (5116): called boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
23 Dec 2010 11:12:56 | 925834 | 12392289 | famous_w00e_599_200_006746702_2 | 215,306 | 458,566 | 2.1298 |
23 Dec 2010 11:12:56 | 925834 | 12392289 | famous_w00e_599_200_006746702_2 | 205,946 | 439,569 | 2.1344 |
22 Dec 2010 22:13:47 | 925834 | 12392289 | famous_w00e_599_200_006746702_2 | 196,586 | 419,973 | 2.1363 |
22 Dec 2010 16:25:29 | 925834 | 12392289 | famous_w00e_599_200_006746702_2 | 187,226 | 400,437 | 2.1388 |
22 Dec 2010 12:44:38 | 925834 | 12392289 | famous_w00e_599_200_006746702_2 | 177,866 | 380,482 | 2.1391 |
22 Dec 2010 12:44:38 | 925834 | 12392289 | famous_w00e_599_200_006746702_2 | 168,506 | 360,670 | 2.1404 |
21 Dec 2010 22:52:29 | 925834 | 12392289 | famous_w00e_599_200_006746702_2 | 159,146 | 340,930 | 2.1422 |
21 Dec 2010 16:59:31 | 925834 | 12392289 | famous_w00e_599_200_006746702_2 | 149,786 | 321,012 | 2.1431 |
21 Dec 2010 11:17:45 | 925834 | 12392289 | famous_w00e_599_200_006746702_2 | 140,426 | 300,956 | 2.1432 |
21 Dec 2010 10:28:36 | 925834 | 12392289 | famous_w00e_599_200_006746702_2 | 131,066 | 280,958 | 2.1436 |
20 Dec 2010 23:41:48 | 925834 | 12392289 | famous_w00e_599_200_006746702_2 | 121,706 | 260,955 | 2.1441 |
20 Dec 2010 18:20:51 | 925834 | 12392289 | famous_w00e_599_200_006746702_2 | 112,346 | 240,923 | 2.1445 |
20 Dec 2010 12:11:25 | 925834 | 12392289 | famous_w00e_599_200_006746702_2 | 102,986 | 220,699 | 2.1430 |
20 Dec 2010 10:07:34 | 925834 | 12392289 | famous_w00e_599_200_006746702_2 | 93,626 | 200,429 | 2.1407 |
20 Dec 2010 00:27:59 | 925834 | 12392289 | famous_w00e_599_200_006746702_2 | 84,266 | 180,209 | 2.1386 |
19 Dec 2010 18:41:55 | 925834 | 12392289 | famous_w00e_599_200_006746702_2 | 74,906 | 160,129 | 2.1377 |
19 Dec 2010 14:50:05 | 925834 | 12392289 | famous_w00e_599_200_006746702_2 | 65,546 | 140,083 | 2.1372 |
19 Dec 2010 14:50:05 | 925834 | 12392289 | famous_w00e_599_200_006746702_2 | 56,186 | 120,063 | 2.1369 |
19 Dec 2010 14:50:05 | 925834 | 12392289 | famous_w00e_599_200_006746702_2 | 46,826 | 100,055 | 2.1367 |
18 Dec 2010 19:47:24 | 925834 | 12392289 | famous_w00e_599_200_006746702_2 | 37,466 | 80,070 | 2.1371 |
©2024 climateprediction.net