Name | famous_ufe2_799_200_006652789_6 |
Workunit | 6856161 |
Created | 23 Aug 2010, 18:54:59 UTC |
Sent | 23 Aug 2010, 20:25:03 UTC |
Report deadline | 23 Nov 2010, 3:52:14 UTC |
Received | 30 Sep 2010, 4:55:05 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 22 (0x00000016) Unknown error code |
Computer ID | 955414 |
Run time | 17 days 20 hours 29 min 20 sec |
CPU time | 15 days 23 hours 4 min 53 sec |
Validate state | Invalid |
Credit | 5,435.25 |
Device peak FLOPS | 1.13 GFLOPS |
Application version | UK Met Office FAMOUS v6.11 windows_intelx86 |
Stderr | <core_client_version>6.6.20</core_client_version> <![CDATA[ <message> The device does not recognize the command. (0x16) - exit code 22 (0x16) </message> <stderr_txt> CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6484, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5752, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4920, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 60 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 61 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 68 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 69 - Return code = 16 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6152, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6624, iMonCtr=1 Model crash detected, will try to restart... 09:32:18 (4468): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exitinCPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2624, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 60 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 61 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 68 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 69 - Return code = 16 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 60 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 61 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 68 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 69 - Return code = 16 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4796, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7124, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 60 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 61 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 68 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 69 - Return code = 16 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4808, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4512, iMonCtr=1 Model crash detected, will try to restart... 12:52:06 (3828): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5560, iMonCtr=1 Model crash detected, will try to restart... CCPDN Monitor - Quit request from BOINC... Model crashed: ATM_DYN : INVALID THETA DETECTED. tmp/pipe_dummy Model crashed: ATM_DYN : INVALID THETA DETECTED. tmp/pipe_dummy Model crashed: ATM_DYN : INVALID THETA DETECTED. tmp/pipe_dummy Model crashed: ATM_DYN : INVALID THETA DETECTED. tmp/pipe_dummy Model crashed: ATM_DYN : INVALID THETA DETECTED. tmp/pipe_dummy Model crashed: ATM_DYN : INVALID THETA DETECTED. tmp/pipe_dummy Sorry, too many model crashes! :-( 23:43:44 (5464): called boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
29 Sep 2010 22:51:29 | 955414 | 11671662 | famous_ufe2_799_200_006652789_6 | 1,647,386 | 1,371,779 | 0.8327 |
29 Sep 2010 20:05:46 | 955414 | 11671662 | famous_ufe2_799_200_006652789_6 | 1,638,026 | 1,363,205 | 0.8322 |
29 Sep 2010 17:42:24 | 955414 | 11671662 | famous_ufe2_799_200_006652789_6 | 1,628,666 | 1,355,362 | 0.8322 |
29 Sep 2010 15:23:40 | 955414 | 11671662 | famous_ufe2_799_200_006652789_6 | 1,619,306 | 1,347,527 | 0.8322 |
29 Sep 2010 03:37:27 | 955414 | 11671662 | famous_ufe2_799_200_006652789_6 | 1,609,946 | 1,339,641 | 0.8321 |
29 Sep 2010 03:18:36 | 955414 | 11671662 | famous_ufe2_799_200_006652789_6 | 1,600,586 | 1,331,684 | 0.8320 |
28 Sep 2010 23:01:35 | 955414 | 11671662 | famous_ufe2_799_200_006652789_6 | 1,591,226 | 1,323,768 | 0.8319 |
28 Sep 2010 20:21:28 | 955414 | 11671662 | famous_ufe2_799_200_006652789_6 | 1,581,866 | 1,315,870 | 0.8318 |
28 Sep 2010 18:01:17 | 955414 | 11671662 | famous_ufe2_799_200_006652789_6 | 1,572,506 | 1,308,065 | 0.8318 |
28 Sep 2010 15:30:35 | 955414 | 11671662 | famous_ufe2_799_200_006652789_6 | 1,563,146 | 1,300,236 | 0.8318 |
28 Sep 2010 13:10:31 | 955414 | 11671662 | famous_ufe2_799_200_006652789_6 | 1,553,786 | 1,292,422 | 0.8318 |
28 Sep 2010 04:02:29 | 955414 | 11671662 | famous_ufe2_799_200_006652789_6 | 1,544,426 | 1,284,525 | 0.8317 |
28 Sep 2010 03:45:18 | 955414 | 11671662 | famous_ufe2_799_200_006652789_6 | 1,535,066 | 1,276,611 | 0.8316 |
27 Sep 2010 23:29:47 | 955414 | 11671662 | famous_ufe2_799_200_006652789_6 | 1,525,706 | 1,268,818 | 0.8316 |
27 Sep 2010 21:13:54 | 955414 | 11671662 | famous_ufe2_799_200_006652789_6 | 1,516,346 | 1,261,056 | 0.8316 |
27 Sep 2010 18:36:51 | 955414 | 11671662 | famous_ufe2_799_200_006652789_6 | 1,506,986 | 1,253,185 | 0.8316 |
27 Sep 2010 16:16:51 | 955414 | 11671662 | famous_ufe2_799_200_006652789_6 | 1,497,626 | 1,245,374 | 0.8316 |
27 Sep 2010 13:57:04 | 955414 | 11671662 | famous_ufe2_799_200_006652789_6 | 1,488,266 | 1,237,529 | 0.8315 |
27 Sep 2010 03:57:32 | 955414 | 11671662 | famous_ufe2_799_200_006652789_6 | 1,478,906 | 1,229,579 | 0.8314 |
27 Sep 2010 02:44:48 | 955414 | 11671662 | famous_ufe2_799_200_006652789_6 | 1,469,546 | 1,221,671 | 0.8313 |
©2024 climateprediction.net