|
Name | famous_xnr0_1799_200_007092611_1 |
Workunit | 7295911 |
Created | 23 Dec 2010, 11:51:30 UTC |
Sent | 23 Dec 2010, 11:56:53 UTC |
Report deadline | 24 Mar 2011, 19:24:04 UTC |
Received | 26 Dec 2011, 20:18:38 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 22 (0x00000016) Unknown error code |
Computer ID | 596114 |
Run time | 2 days 9 hours 5 min 46 sec |
CPU time | 2 days 8 hours 8 min 25 sec |
Validate state | Invalid |
Credit | 1,297.11 |
Device peak FLOPS | 2.28 GFLOPS |
Application version | UK Met Office FAMOUS v6.11 windows_intelx86 |
Stderr | <core_client_version>6.10.18</core_client_version> <![CDATA[ <message> The device does not recognize the command. (0x16) - exit code 22 (0x16) </message> <stderr_txt> CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 16:44:55 (4324): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1064, iMonCtr=1 Model crash detected, will try to restart... 13:11:03 (5380): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 13:11:04 (5380): No heartbeat from core client for 30 sec - exiting 13:11:05 (5380): No heartbeat from core client for 30 sec - exiting 13:11:06 (5380): No heartbeat from core client for 30 sec - exiting 13:11:07 (5380): No heartbeat from core client for 30 sec - exiting 17:30:00 (2504): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... BUFFIN: Read Failed: Result too large BUFFIN: C I/O Error feof - Unit 60 - Return code = 16 BUFFIN: Read Failed: Result too large BUFFIN: C I/O Error feof - Unit 60 - Return code = 16 Model crashed: STWORK : I/O error - PP fixed length header tmp/pipe_dummy BUFFIN: Read Failed: Result too large BUFFIN: C I/O Error feof - Unit 60 - Return code = 16 BUFFIN: Read Failed: Result too large BUFFIN: C I/O Error feof - Unit 60 - Return code = 16 BUFFIN: Read Failed: Result too large BUFFIN: C I/O Error feof - Unit 60 - Return code = 16 Model crashed: STWORK : I/O error - PP fixed length header tmp/pipe_dummy BUFFIN: Read Failed: Result too large BUFFIN: C I/O Error feof - Unit 60 - Return code = 16 BUFFIN: Read Failed: Result too large BUFFIN: C I/O Error feof - Unit 60 - Return code = 16 Model crashed: STWORK : I/O error - PP fixed length header tmp/pipe_dummy BUFFIN: Read Failed: Result too large BUFFIN: C I/O Error feof - Unit 60 - Return code = 16 BUFFIN: Read Failed: Result too large BUFFIN: C I/O Error feof - Unit 60 - Return code = 16 Model crashed: STWORK : I/O error - PP fixed length header tmp/pipe_dummy BUFFIN: Read Failed: Result too large BUFFIN: C I/O Error feof - Unit 60 - Return code = 16 BUFFIN: Read Failed: Result too large BUFFIN: C I/O Error feof - Unit 60 - Return code = 16 Model crashed: STWORK : I/O error - PP fixed length header tmp/pipe_dummy BUFFIN: Read Failed: Result too large BUFFIN: C I/O Error feof - Unit 60 - Return code = 16 BUFFIN: Read Failed: Result too large BUFFIN: C I/O Error feof - Unit 60 - Return code = 16 Model crashed: STWORK : I/O error - PP fixed length header tmp/pipe_dummy Sorry, too many model crashes! :-( 23:40:32 (4460): called boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
26 Dec 2011 03:59:16 | 596114 | 12443338 | famous_xnr0_1799_200_007092611_1 | 393,146 | 199,584 | 0.5077 |
09 Jan 2011 18:25:54 | 596114 | 12443338 | famous_xnr0_1799_200_007092611_1 | 383,786 | 194,830 | 0.5077 |
08 Jan 2011 16:19:17 | 596114 | 12443338 | famous_xnr0_1799_200_007092611_1 | 374,426 | 189,980 | 0.5074 |
08 Jan 2011 14:42:50 | 596114 | 12443338 | famous_xnr0_1799_200_007092611_1 | 365,066 | 185,103 | 0.5070 |
07 Jan 2011 14:13:03 | 596114 | 12443338 | famous_xnr0_1799_200_007092611_1 | 355,706 | 180,310 | 0.5069 |
06 Jan 2011 11:56:52 | 596114 | 12443338 | famous_xnr0_1799_200_007092611_1 | 346,346 | 175,556 | 0.5069 |
06 Jan 2011 11:56:52 | 596114 | 12443338 | famous_xnr0_1799_200_007092611_1 | 336,986 | 170,799 | 0.5068 |
05 Jan 2011 02:44:22 | 596114 | 12443338 | famous_xnr0_1799_200_007092611_1 | 327,626 | 166,053 | 0.5068 |
04 Jan 2011 12:59:27 | 596114 | 12443338 | famous_xnr0_1799_200_007092611_1 | 318,266 | 161,240 | 0.5066 |
04 Jan 2011 11:28:11 | 596114 | 12443338 | famous_xnr0_1799_200_007092611_1 | 308,906 | 156,281 | 0.5059 |
03 Jan 2011 23:28:36 | 596114 | 12443338 | famous_xnr0_1799_200_007092611_1 | 299,546 | 151,371 | 0.5053 |
03 Jan 2011 22:07:57 | 596114 | 12443338 | famous_xnr0_1799_200_007092611_1 | 290,186 | 146,627 | 0.5053 |
03 Jan 2011 20:45:17 | 596114 | 12443338 | famous_xnr0_1799_200_007092611_1 | 280,826 | 141,897 | 0.5053 |
03 Jan 2011 19:27:25 | 596114 | 12443338 | famous_xnr0_1799_200_007092611_1 | 271,466 | 137,165 | 0.5053 |
03 Jan 2011 18:35:07 | 596114 | 12443338 | famous_xnr0_1799_200_007092611_1 | 262,106 | 132,351 | 0.5050 |
03 Jan 2011 16:44:02 | 596114 | 12443338 | famous_xnr0_1799_200_007092611_1 | 252,746 | 127,612 | 0.5049 |
03 Jan 2011 15:21:45 | 596114 | 12443338 | famous_xnr0_1799_200_007092611_1 | 243,386 | 122,934 | 0.5051 |
03 Jan 2011 14:02:01 | 596114 | 12443338 | famous_xnr0_1799_200_007092611_1 | 234,026 | 118,209 | 0.5051 |
03 Jan 2011 12:38:38 | 596114 | 12443338 | famous_xnr0_1799_200_007092611_1 | 224,666 | 113,499 | 0.5052 |
01 Jan 2011 12:04:30 | 596114 | 12443338 | famous_xnr0_1799_200_007092611_1 | 215,306 | 108,768 | 0.5052 |
©2024 climateprediction.net