|
Name | famous_umop_1599_200_006662244_4 |
Workunit | 6865616 |
Created | 10 Jun 2010, 15:17:36 UTC |
Sent | 7 Jul 2010, 13:20:39 UTC |
Report deadline | 6 Oct 2010, 20:47:50 UTC |
Received | 8 Jul 2010, 7:30:51 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 22 (0x00000016) Unknown error code |
Computer ID | 821727 |
Run time | 15 hours 19 min 26 sec |
CPU time | 15 hours 19 min 26 sec |
Validate state | Invalid |
Credit | 555.96 |
Device peak FLOPS | 3.13 GFLOPS |
Application version | UK Met Office FAMOUS v6.11 windows_intelx86 |
Stderr | <core_client_version>5.10.13</core_client_version> <![CDATA[ <message> The device does not recognize the command. (0x16) - exit code 22 (0x16) </message> <stderr_txt> 16:19:12 (2456): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 60 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 61 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 68 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 69 - Return code = 16 17:17:16 (2620): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 60 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 61 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 68 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 69 - Return code = 16 MainError: 03:19:03 PM No files match the supplied pattern. MainError: 03:19:03 PM No files match the supplied pattern. 17:21:22 (3832): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 17:38:26 (3844): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 01:01:21 (432): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 01:51:59 (2856): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 60 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 61 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 68 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 69 - Return code = 16 MainError: 11:53:27 PM No files match the supplied pattern. MainError: 11:53:27 PM No files match the supplied pattern. 01:54:02 (3436): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... BUFFIN: Read Failed: Result too large BUFFIN: C I/O Error feof - Unit 60 - Return code = 16 BUFFIN: Read Failed: Result too large BUFFIN: C I/O Error feof - Unit 61 - Return code = 16 BUFFIN: Read Failed: Result too large BUFFIN: C I/O Error feof - Unit 68 - Return code = 16 BUFFIN: Read Failed: Result too large BUFFIN: C I/O Error feof - Unit 69 - Return code = 16 01:59:04 (1772): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 02:10:27 (3332): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 02:14:17 (4056): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 03:00:37 (3348): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 60 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 61 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 68 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 69 - Return code = 16 03:04:56 (3876): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 03:14:10 (3688): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 04:16:12 (2616): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 04:17:13 (3236): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 04:18:14 (200): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 04:19:16 (656): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 04:20:19 (2508): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 04:21:21 (2136): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 04:22:22 (536): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 04:23:23 (292): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 04:24:24 (368): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 04:25:25 (3000): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 04:26:28 (2556): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 04:59:27 (472): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 60 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 61 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 68 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 69 - Return code = 16 05:13:16 (3528): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 06:00:56 (3228): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 60 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 61 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 68 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 69 - Return code = 16 06:06:06 (876): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 06:16:03 (2136): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 07:05:55 (2912): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 60 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 61 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 68 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 69 - Return code = 16 07:11:16 (392): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 07:21:08 (1996): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 08:10:27 (1504): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 60 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 61 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 68 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 69 - Return code = 16 08:14:21 (864): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 09:16:41 (3060): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 09:21:57 (688): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 09:25:45 (3044): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... cpdnmonitor: cannot open input file \\Dns\Boinc\johnpc\BOINC/projects/climateprediction.net/famous_umop_1599_200_006662244/dataout/atmos_restart.day after 11 attempts Model crashed: DRLANDF1 : Error in FILE_OPEN. tmp/pipe_dummy cpdnmonitor: cannot open input file \\Dns\Boinc\johnpc\BOINC/projects/climateprediction.net/famous_umop_1599_200_006662244/dataout/atmos_restart.day after 11 attempts Model crashed: DRLANDF1 : Error in FILE_OPEN. tmp/pipe_dummy cpdnmonitor: cannot open input file \\Dns\Boinc\johnpc\BOINC/projects/climateprediction.net/famous_umop_1599_200_006662244/dataout/atmos_restart.day after 11 attempts Model crashed: DRLANDF1 : Error in FILE_OPEN. tmp/pipe_dummy cpdnmonitor: cannot open input file \\Dns\Boinc\johnpc\BOINC/projects/climateprediction.net/famous_umop_1599_200_006662244/dataout/atmos_restart.day after 11 attempts Model crashed: DRLANDF1 : Error in FILE_OPEN. tmp/pipe_dummy cpdnmonitor: cannot open input file \\Dns\Boinc\johnpc\BOINC/projects/climateprediction.net/famous_umop_1599_200_006662244/dataout/atmos_restart.day after 11 attempts Model crashed: DRLANDF1 : Error in FILE_OPEN. tmp/pipe_dummy cpdnmonitor: cannot open input file \\Dns\Boinc\johnpc\BOINC/projects/climateprediction.net/famous_umop_1599_200_006662244/dataout/atmos_restart.day after 11 attempts Model crashed: DRLANDF1 : Error in FILE_OPEN. tmp/pipe_dummy Sorry, too many model crashes! :-( 09:27:55 (580): called boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
08 Jul 2010 07:23:08 | 821727 | 11562079 | famous_umop_1599_200_006662244_4 | 168,506 | 54,814 | 0.3253 |
08 Jul 2010 06:23:29 | 821727 | 11562079 | famous_umop_1599_200_006662244_4 | 159,146 | 51,764 | 0.3253 |
08 Jul 2010 05:07:43 | 821727 | 11562079 | famous_umop_1599_200_006662244_4 | 149,786 | 48,570 | 0.3243 |
08 Jul 2010 04:03:50 | 821727 | 11562079 | famous_umop_1599_200_006662244_4 | 140,426 | 45,310 | 0.3227 |
08 Jul 2010 03:04:42 | 821727 | 11562079 | famous_umop_1599_200_006662244_4 | 131,066 | 42,049 | 0.3208 |
08 Jul 2010 02:07:53 | 821727 | 11562079 | famous_umop_1599_200_006662244_4 | 121,706 | 38,748 | 0.3184 |
08 Jul 2010 00:05:42 | 821727 | 11562079 | famous_umop_1599_200_006662244_4 | 112,346 | 35,688 | 0.3177 |
07 Jul 2010 23:04:37 | 821727 | 11562079 | famous_umop_1599_200_006662244_4 | 102,986 | 32,432 | 0.3149 |
07 Jul 2010 23:04:37 | 821727 | 11562079 | famous_umop_1599_200_006662244_4 | 93,626 | 29,515 | 0.3152 |
07 Jul 2010 23:04:37 | 821727 | 11562079 | famous_umop_1599_200_006662244_4 | 84,266 | 26,640 | 0.3161 |
07 Jul 2010 23:04:37 | 821727 | 11562079 | famous_umop_1599_200_006662244_4 | 74,906 | 23,743 | 0.3170 |
07 Jul 2010 23:04:37 | 821727 | 11562079 | famous_umop_1599_200_006662244_4 | 65,546 | 20,823 | 0.3177 |
07 Jul 2010 23:04:37 | 821727 | 11562079 | famous_umop_1599_200_006662244_4 | 56,186 | 17,955 | 0.3196 |
07 Jul 2010 23:04:37 | 821727 | 11562079 | famous_umop_1599_200_006662244_4 | 46,826 | 15,033 | 0.3210 |
07 Jul 2010 23:04:37 | 821727 | 11562079 | famous_umop_1599_200_006662244_4 | 37,466 | 12,174 | 0.3249 |
07 Jul 2010 23:04:37 | 821727 | 11562079 | famous_umop_1599_200_006662244_4 | 28,106 | 9,310 | 0.3312 |
07 Jul 2010 15:27:46 | 821727 | 11562079 | famous_umop_1599_200_006662244_4 | 18,746 | 6,498 | 0.3466 |
07 Jul 2010 14:24:41 | 821727 | 11562079 | famous_umop_1599_200_006662244_4 | 9,386 | 3,248 | 0.3460 |
©2024 climateprediction.net