Name | hadam3p_eu_wsja_1981_1_006880334_0 |
Workunit | 7083650 |
Created | 19 Nov 2010, 14:42:27 UTC |
Sent | 13 Mar 2011, 13:50:02 UTC |
Report deadline | 23 Feb 2012, 19:10:02 UTC |
Received | 3 Apr 2011, 21:54:18 UTC |
Server state | Over |
Outcome | No reply |
Client state | Compute error |
Exit status | 0 (0x00000000) |
Computer ID | 302859 |
Run time | 2 days 1 hours 45 min 13 sec |
CPU time | 1 days 13 hours 51 min 57 sec |
Validate state | Invalid |
Credit | 200.64 |
Device peak FLOPS | 1.18 GFLOPS |
Application version | UK Met Office HadAM3P-HadRM3P Europe v6.08 windows_intelx86 |
Stderr | <core_client_version>6.10.18</core_client_version> <![CDATA[ <stderr_txt> Suspended CPDN Monitor - Suspend request from BOINC... 14:57:34 (3220): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - No 'heartbeat' from BOINC... 14:57:37 (3220): No heartbeat from core client for 30 sec - exiting 14:57:38 (3220): No heartbeat from core client for 30 sec - exiting 14:57:39 (3220): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 07:18:56 (2352): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - No 'heartbeat' from BOINC... 07:18:57 (2352): No heartbeat from core client for 30 sec - exiting 07:18:58 (2352): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - Suspend request from BOINC... 13:19:48 (3580): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - No 'heartbeat' from BOINC... 13:19:49 (3580): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 16:25:40 (352): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - No 'heartbeat' from BOINC... 16:25:42 (352): No heartbeat from core client for 30 sec - exiting 16:25:43 (352): No heartbeat from core client for 30 sec - exiting 16:25:44 (352): No heartbeat from core client for 30 sec - exiting 16:25:46 (352): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - Suspend request from BOINC... 19:53:57 (2384): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - No 'heartbeat' from BOINC... 19:53:58 (2384): No heartbeat from core client for 30 sec - exiting 19:53:59 (2384): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 13:08:17 (992): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - No 'heartbeat' from BOINC... 13:08:18 (992): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 17:00:42 (2100): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - No 'heartbeat' from BOINC... 17:00:43 (2100): No heartbeat from core client for 30 sec - exiting 17:00:44 (2100): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 19:09:56 (348): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - No 'heartbeat' from BOINC... 19:09:58 (348): No heartbeat from core client for 30 sec - exiting 19:09:59 (348): No heartbeat from core client for 30 sec - exiting CPDN Monitor - Quit request from BOINC... 23:22:10 (3464): No heartbeat from core client for 30 sec - exiting 23:22:12 (3464): No heartbeat from core client for 30 sec - exiting 23:22:13 (3464): No heartbeat from core client for 30 sec - exiting 23:22:14 (3464): No heartbeat from core client for 30 sec - exiting 23:22:15 (3464): No heartbeat from core client for 30 sec - exiting 23:22:16 (3464): No heartbeat from core client for 30 sec - exiting 23:22:17 (3464): No heartbeat from core client for 30 sec - exiting 23:22:18 (3464): No heartbeat from core client for 30 sec - exiting 23:22:19 (3464): No heartbeat from core client for 30 sec - exiting 23:22:20 (3464): No heartbeat from core client for 30 sec - exiting 23:22:21 (3464): No heartbeat from core client for 30 sec - exiting 23:22:22 (3464): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 01:01:19 (3256): No heartbeat from core client for 30 sec - exiting 01:01:21 (3256): No heartbeat from core client for 30 sec - exiting 01:01:22 (3256): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 10:52:44 (2776): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - No 'heartbeat' from BOINC... 10:52:46 (2776): No heartbeat from core client for 30 sec - exiting 10:52:47 (2776): No heartbeat from core client for 30 sec - exiting 10:52:54 (2776): No heartbeat from core client for 30 sec - exiting 10:53:00 (2776): No heartbeat from core client for 30 sec - exiting 10:53:07 (2776): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 20:48:13 (2800): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - No 'heartbeat' from BOINC... 20:48:14 (2800): No heartbeat from core client for 30 sec - exiting 20:48:15 (2800): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - Suspend request from BOINC... 05:38:40 (2324): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 13:13:32 (3896): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - No 'heartbeat' from BOINC... 13:13:34 (3896): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - Suspend request from BOINC... 23:46:38 (2396): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - No 'heartbeat' from BOINC... 23:46:41 (2396): No heartbeat from core client for 30 sec - exiting 23:46:43 (2396): No heartbeat from core client for 30 sec - exiting 23:46:44 (2396): No heartbeat from core client for 30 sec - exiting 23:46:45 (2396): No heartbeat from core client for 30 sec - exiting CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 09:33:54 (4036): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - No 'heartbeat' from BOINC... 09:33:56 (4036): No heartbeat from core client for 30 sec - exiting 09:33:57 (4036): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 21:54:10 (2380): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - No 'heartbeat' from BOINC... 21:54:12 (2380): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - Suspend request from BOINC... Signal 11 received, exiting... 09:42:26 (3800): called boinc_finish Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3460, selfPID=3460, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3460, selfPID=1920, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... 09:43:53 (1920): called boinc_finish </stderr_txt> <message> <file_xfer_error> <file_name>hadam3p_eu_wsja_1981_1_006880334_0_2.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_eu_wsja_1981_1_006880334_0_3.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_eu_wsja_1981_1_006880334_0_4.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_eu_wsja_1981_1_006880334_0_5.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_eu_wsja_1981_1_006880334_0_6.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_eu_wsja_1981_1_006880334_0_7.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_eu_wsja_1981_1_006880334_0_8.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_eu_wsja_1981_1_006880334_0_9.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_eu_wsja_1981_1_006880334_0_10.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_eu_wsja_1981_1_006880334_0_11.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_eu_wsja_1981_1_006880334_0_12.zip</file_name> <error_code>-161</error_code> </file_xfer_error> </message> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
27 Mar 2011 21:23:19 | 302859 | 12152769 | hadam3p_eu_wsja_1981_1_006880334_0 | 11,631 | 91,899 | 7.9012 |
27 Mar 2011 03:26:32 | 302859 | 12152769 | hadam3p_eu_wsja_1981_1_006880334_0 | 11,618 | 90,405 | 7.7815 |
27 Mar 2011 00:51:14 | 302859 | 12152769 | hadam3p_eu_wsja_1981_1_006880334_0 | 11,616 | 88,948 | 7.6574 |
©2024 climateprediction.net