Name | famous_ual3_599_200_006646562_5 |
Workunit | 6849934 |
Created | 10 Aug 2010, 4:57:26 UTC |
Sent | 10 Aug 2010, 6:18:28 UTC |
Report deadline | 9 Nov 2010, 13:45:39 UTC |
Received | 12 Aug 2010, 19:28:07 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 22 (0x00000016) Unknown error code |
Computer ID | 975011 |
Run time | 1 days 7 hours 4 min 32 sec |
CPU time | 20 hours 12 min 21 sec |
Validate state | Invalid |
Credit | 339.78 |
Device peak FLOPS | 2.12 GFLOPS |
Application version | UK Met Office FAMOUS v6.11 windows_intelx86 |
Stderr | <core_client_version>6.10.56</core_client_version> <![CDATA[ <message> The device does not recognize the command. (0x16) - exit code 22 (0x16) </message> <stderr_txt> CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 09:12:52 (7200): No heartbeat from core client for 30 sec - exiting 09:12:53 (7200): No heartbeat from core client for 30 sec - exiting 09:12:54 (7200): No heartbeat from core client for 30 sec - exiting 09:12:55 (7200): No heartbeat from core client for 30 sec - exiting 09:12:56 (7200): No heartbeat from core client for 30 sec - exiting 09:12:57 (7200): No heartbeat from core client for 30 sec - exiting 09:12:58 (7200): No heartbeat from core client for 30 sec - exiting 09:12:59 (7200): No heartbeat from core client for 30 sec - exiting 09:13:00 (7200): No heartbeat from core client for 30 sec - exiting 09:13:01 (7200): No heartbeat from core client for 30 sec - exiting 009:15:36 (6440): No heartbeat from core client for 30 sec - exiting 09:18:12 (6440): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... 09:24:16 (5652): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 09:26:32 (3032): No heartbeat from c09:28:32 (4180): No heartbeat from core client for 30 sec - exiting 09:29:35 (4180): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 09:29:59 (4180): No heartbeat from core client for 30 sec - exiting 09:37:30 (7920): No heartbeat from core client for 30 sec - eCPDN Monitor - No Process Handle Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6400, selfPID=6400, iMonCtr=1 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 60 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 61 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 68 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 69 - Return code = 16 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... BUFFIN: Read Failed: Result too large BUFFIN: C I/O Error feof - Unit 60 - Return code = 16 BUFFIN: Read Failed: Result too large BUFFIN: C I/O Error feof - Unit 61 - Return code = 16 BUFFIN: Read Failed: Result too large BUFFIN: C I/O Error feof - Unit 68 - Return code = 16 BUFFIN: Read Failed: Result too large BUFFIN: C I/O Error feof - Unit 69 - Return code = 16 16:35:14 (7840): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 22:26:46 (2876): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 22:26:55 (2876): No heartbeat from core client for 30 sec - exiting CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request frforrtl: severe (47): write to READONLY file, unit 6, file C:\ProgramData\BOINC\projects\climateprediction.net\famous_ual3_599_200_006646562\dataout\stdout_um.txt Image PC Routine Line Source famous_um_6.11_wi 008846D2 Unknown Unknown Unknown Stack trace terminated abnormally. 03:46:48 (4484): No heartbeat from core client for 30 sec - exiting Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4484, iMonCtr=1 Model crash detected, will try to restart... 03:48:19 (4484): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 03:53:02 (4484): No heartbeat from core client for 30 sec - exiting forrtl: severe (47): write to READONLY file, unit 6, file C:\ProgramData\BOINC\projects\climateprediction.net\famous_ual3_599_200_006646562\dataout\stdout_um.txt Image PC Routine Line Source famous_um_6.11_wi 008846D2 Unknown Unknown Unknown Stack trace terminated abnormally. Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5092, iMonCtr=1 Model crash detected, will try to restart... 03:56:46 (5092): No heartbeat frforrtl: severe (47): write to READONLY file, unit 6, file C:\ProgramData\BOINC\projects\climateprediction.net\famous_ual3_599_200_006646562\dataout\stdout_um.txt Image PC Routine Line Source famous_um_6.11_wi 008846D2 Unknown Unknown Unknown Stack trace terminated abnormally. core client for 30 sec - exiting Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5092, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - No 'heartbeat' from BOINC... forrtl: severe (47): write to READONLY file, unit 6, file C:\ProgramData\BOINC\projects\climateprediction.net\famous_ual3_599_200_006646562\dataout\stdout_um.txt Image PC Routine Line Source famous_um_6.11_wi 008846D2 Unknown Unknown Unknown Stack trace terminated abnormally. Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2676, iMonCtr=1 Model crash detected, will try to restart... 03:59:55 (2676): No heartbeatforrtl: severe (47): write to READONLY file, unit 6, file C:\ProgramData\BOINC\projects\climateprediction.net\famous_ual3_599_200_006646562\dataout\stdout_um.txt Image PC Routine Line Source famous_um_6.11_wi 008846D2 Unknown Unknown Unknown Stack trace terminated abnormally. rom core client for 30 sec - exiting Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2676, iMonCtr=1 Model crash detected, will try to restart... 04:01:15 (2676): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 04:01forrtl: severe (47): write to READONLY file, unit 6, file C:\ProgramData\BOINC\projects\climateprediction.net\famous_ual3_599_200_006646562\dataout\stdout_um.txt Image PC Routine Line Source famous_um_6.11_wi 008846D2 Unknown Unknown Unknown Stack trace terminated abnormally. Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4324, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... forrtl: severe (47): write to READONLY file, unit 6, file C:\ProgramData\BOINC\projects\climateprediction.net\famous_ual3_599_200_006646562\dataout\stdout_um.txt Image PC Routine Line Source famous_um_6.11_wi 008846D2 Unknown Unknown Unknown Stack trace terminated abnormally. Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=452, iMonCtr=1 Model crash detected, will try to restart... forrtl: severe (47): write to READONLY file, unit 6, file C:\ProgramData\BOINC\projects\climateprediction.net\famous_ual3_599_200_006646562\dataout\stdout_um.txt Image PC Routine Line Source famous_um_6.11_wi 008846D2 Unknown Unknown Unknown Stack trace terminated abnormally. Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=452, iMonCtr=1 Model crash detected, will try to restart... 04forrtl: severe (47): write to READONLY file, unit 6, file C:\ProgramData\BOINC\projects\climateprediction.net\famous_ual3_599_200_006646562\dataout\stdout_um.txt Image PC Routine Line Source famous_um_6.11_wi 008846D2 Unknown Unknown Unknown Stack trace terminated abnormally. Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2332, iMonCtr=1 Model crash detected, will try to restart... 04:09:42 (2332): No heartbeat from cforrtl: severe (47): write to READONLY file, unit 6, file C:\ProgramData\BOINC\projects\climateprediction.net\famous_ual3_599_200_006646562\dataout\stdout_um.txt Image PC Routine Line Source famous_um_6.11_wi 008846D2 Unknown Unknown Unknown Stack trace terminated abnormally. e client for 30 sec - exiting Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2332, iMonCtr=1 Model crash detected, will try to restart... 04:11:01 (2332): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5464, selfPID=5464, iMonCtr=1 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Model crashed: READHIST: End of file in READ from history file for namelist NLCFILES tmp/pipe_dummy Model crashed: READHIST: End of file in READ from history file for namelist NLCFILES tmp/pipe_dummy Model crashed: READHIST: End of file in READ from history file for namelist NLCFILES tmp/pipe_dummy Model crashed: READHIST: End of file in READ from history file for namelist NLCFILES tmp/pipe_dummy Model crashed: READHIST: End of file in READ from history file for namelist NLCFILES tmp/pipe_dummy Model crashed: READHIST: End of file in READ from history file for namelist NLCFILES tmp/pipe_dummy Sorry, too many model crashes! :-( 12:25:36 (3880): called boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
12 Aug 2010 16:27:22 | 975011 | 11649161 | famous_ual3_599_200_006646562_5 | 102,986 | 67,260 | 0.6531 |
12 Aug 2010 14:01:34 | 975011 | 11649161 | famous_ual3_599_200_006646562_5 | 93,626 | 61,479 | 0.6566 |
12 Aug 2010 09:48:42 | 975011 | 11649161 | famous_ual3_599_200_006646562_5 | 84,266 | 55,421 | 0.6577 |
11 Aug 2010 05:43:23 | 975011 | 11649161 | famous_ual3_599_200_006646562_5 | 74,906 | 49,413 | 0.6597 |
11 Aug 2010 02:54:03 | 975011 | 11649161 | famous_ual3_599_200_006646562_5 | 65,546 | 43,283 | 0.6603 |
11 Aug 2010 01:45:59 | 975011 | 11649161 | famous_ual3_599_200_006646562_5 | 56,186 | 36,876 | 0.6563 |
10 Aug 2010 20:17:05 | 975011 | 11649161 | famous_ual3_599_200_006646562_5 | 46,826 | 30,575 | 0.6529 |
10 Aug 2010 15:50:56 | 975011 | 11649161 | famous_ual3_599_200_006646562_5 | 37,466 | 24,348 | 0.6499 |
10 Aug 2010 13:35:18 | 975011 | 11649161 | famous_ual3_599_200_006646562_5 | 28,106 | 18,335 | 0.6524 |
10 Aug 2010 11:16:29 | 975011 | 11649161 | famous_ual3_599_200_006646562_5 | 18,746 | 12,236 | 0.6527 |
10 Aug 2010 08:56:51 | 975011 | 11649161 | famous_ual3_599_200_006646562_5 | 9,386 | 6,063 | 0.6460 |
©2024 climateprediction.net