|
Name | famous_v2uw_1799_200_006731145_1 |
Workunit | 6934486 |
Created | 24 Oct 2010, 4:44:37 UTC |
Sent | 24 Oct 2010, 23:04:05 UTC |
Report deadline | 24 Jan 2011, 6:31:16 UTC |
Received | 25 Oct 2010, 23:00:46 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 22 (0x00000016) Unknown error code |
Computer ID | 1106244 |
Run time | 10 hours 37 min 55 sec |
CPU time | 10 hours 26 min 23 sec |
Validate state | Invalid |
Credit | 370.67 |
Device peak FLOPS | 2.86 GFLOPS |
Application version | UK Met Office FAMOUS v6.11 windows_intelx86 |
Stderr | <core_client_version>6.10.58</core_client_version> <![CDATA[ <message> The device does not recognize the command. (0x16) - exit code 22 (0x16) </message> <stderr_txt> 03:39:42 (3252): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 06:56:45 (2488): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 06:56:46 (2488): No heartbeat from core client for 30 sec - exiting 06:56:47 (2488): No heartbeat from core client for 30 sec - exiting 06:56:48 (2488): No heartbeat from core client for 30 sec - exiting 06:56:49 (2488): No heartbeat from core client for 30 sec - exiting 06:56:50 (2488): No heartbeat from core client for 30 sec - exiting 06:56:51 (2488): No heartbeat from core client for 30 sec - exiting 06:56:52 (2488): No heartbeat from core client for 30 sec - exiting 06:56:53 (2488): No heartbeat from core client for 30 sec - exiting 06:59:24 (2284): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 06:59:25 (2284): No heartbeat from core client for 30 sec - exiting 07:00:07 (1100): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 07:01:00 (3024): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 07:01:01 (3024): No heartbeat from core client for 30 sec - exiting 07:01:02 (3024): No heartbeat from core client for 30 sec - exiting 07:01:03 (3024): No heartbeat from core client for 30 sec - exiting 07:01:04 (3024): No heartbeat from core client for 30 sec - exiting 09:32:01 (1400): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 09:32:58 (3924): No heartbeat from core client for 30 sec - exiting 09:32:59 (3924): No heartbeat from core client for 30 sec - exiting 09:33:00 (3924): No heartbeat from core client for 30 sec - exiting 09:33:01 (3924): No heartbeat from core client for 30 sec - exiting 09:33:02 (3924): No heartbeat from core client for 30 sec - exiting 09:33:03 (3924): No heartbeat from core client for 30 sec - exiting 09:33:04 (3924): No heartbeat from core client for 30 sec - exiting 09:33:05 (3924): No heartbeat from core client for 30 sec - exiting 09:33:06 (3924): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 60 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 61 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 68 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 69 - Return code = 16 09:36:11 (3772): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... BUFFIN: Read Failed: Result too large BUFFIN: C I/O Error feof - Unit 60 - Return code = 16 BUFFIN: Read Failed: Result too large BUFFIN: C I/O Error feof - Unit 61 - Return code = 16 BUFFIN: Read Failed: Result too large BUFFIN: C I/O Error feof - Unit 68 - Return code = 16 BUFFIN: Read Failed: Result too large BUFFIN: C I/O Error feof - Unit 69 - Return code = 16 09:38:24 (2828): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... BUFFIN: Read Failed: Result too large BUFFIN: C I/O Error feof - Unit 60 - Return code = 16 BUFFIN: Read Failed: Result too large BUFFIN: C I/O Error feof - Unit 61 - Return code = 16 BUFFIN: Read Failed: Result too large BUFFIN: C I/O Error feof - Unit 68 - Return code = 16 BUFFIN: Read Failed: Result too large BUFFIN: C I/O Error feof - Unit 69 - Return code = 16 12:08:14 (3896): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 12:08:15 (3896): No heartbeat from core client for 30 sec - exiting 12:08:16 (3896): No heartbeat from core client for 30 sec - exiting 12:08:17 (3896): No heartbeat from core client for 30 sec - exiting 12:08:18 (3896): No heartbeat from core client for 30 sec - exiting 12:08:19 (3896): No heartbeat from core client for 30 sec - exiting cpdnmonitor: cannot open input file D:\BOINC/projects/climateprediction.net/famous_v2uw_1799_200_006731145/dataout/ocean_restart.day after 11 attempts Model crashed: READ_FLH: I/O error tmp/pipe_dummy cpdnmonitor: cannot open input file D:\BOINC/projects/climateprediction.net/famous_v2uw_1799_200_006731145/dataout/ocean_restart.day after 11 attempts Model crashed: READ_FLH: I/O error tmp/pipe_dummy cpdnmonitor: cannot open input file D:\BOINC/projects/climateprediction.net/famous_v2uw_1799_200_006731145/dataout/ocean_restart.day after 11 attempts Model crashed: READ_FLH: I/O error tmp/pipe_dummy cpdnmonitor: cannot open input file D:\BOINC/projects/climateprediction.net/famous_v2uw_1799_200_006731145/dataout/ocean_restart.day after 11 attempts Model crashed: READ_FLH: I/O error tmp/pipe_dummy cpdnmonitor: cannot open input file D:\BOINC/projects/climateprediction.net/famous_v2uw_1799_200_006731145/dataout/ocean_restart.day after 11 attempts Model crashed: READ_FLH: I/O error tmp/pipe_dummy cpdnmonitor: cannot open input file D:\BOINC/projects/climateprediction.net/famous_v2uw_1799_200_006731145/dataout/ocean_restart.day after 11 attempts Model crashed: READ_FLH: I/O error tmp/pipe_dummy Sorry, too many model crashes! :-( 12:08:35 (3248): called boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
25 Oct 2010 23:02:59 | 1106244 | 11944808 | famous_v2uw_1799_200_006731145_1 | 112,346 | 36,869 | 0.3282 |
25 Oct 2010 23:02:59 | 1106244 | 11944808 | famous_v2uw_1799_200_006731145_1 | 102,986 | 34,392 | 0.3339 |
25 Oct 2010 23:02:59 | 1106244 | 11944808 | famous_v2uw_1799_200_006731145_1 | 93,626 | 31,812 | 0.3398 |
25 Oct 2010 23:02:59 | 1106244 | 11944808 | famous_v2uw_1799_200_006731145_1 | 84,266 | 28,974 | 0.3438 |
25 Oct 2010 23:02:59 | 1106244 | 11944808 | famous_v2uw_1799_200_006731145_1 | 74,906 | 26,120 | 0.3487 |
25 Oct 2010 23:02:59 | 1106244 | 11944808 | famous_v2uw_1799_200_006731145_1 | 65,546 | 23,258 | 0.3548 |
25 Oct 2010 23:02:59 | 1106244 | 11944808 | famous_v2uw_1799_200_006731145_1 | 56,186 | 20,409 | 0.3632 |
25 Oct 2010 23:02:59 | 1106244 | 11944808 | famous_v2uw_1799_200_006731145_1 | 46,826 | 17,382 | 0.3712 |
25 Oct 2010 23:02:59 | 1106244 | 11944808 | famous_v2uw_1799_200_006731145_1 | 37,466 | 14,177 | 0.3784 |
25 Oct 2010 03:14:39 | 1106244 | 11944808 | famous_v2uw_1799_200_006731145_1 | 28,106 | 10,987 | 0.3909 |
25 Oct 2010 03:14:39 | 1106244 | 11944808 | famous_v2uw_1799_200_006731145_1 | 18,746 | 7,638 | 0.4074 |
25 Oct 2010 03:14:39 | 1106244 | 11944808 | famous_v2uw_1799_200_006731145_1 | 9,386 | 3,837 | 0.4088 |
©2024 climateprediction.net