Name | hadam3p_anz_a1nb_2013_1_009460438_0 |
Workunit | 9542672 |
Created | 14 Jan 2015, 10:44:43 UTC |
Sent | 20 Jan 2015, 17:11:14 UTC |
Report deadline | 2 Jan 2016, 22:31:14 UTC |
Received | 6 Feb 2015, 17:21:13 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 0 (0x00000000) |
Computer ID | 1327399 |
Run time | 3 days 4 hours 13 min 43 sec |
CPU time | 2 days 19 hours 57 min 30 sec |
Validate state | Invalid |
Credit | 2,000.18 |
Device peak FLOPS | 3.25 GFLOPS |
Application version | UK Met Office HadAM3P-HadRM3P Australia New Zealand v6.10 windows_intelx86 |
Stderr | <core_client_version>7.2.42</core_client_version> <![CDATA[ <stderr_txt> Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 12:05:34 (1784): No heartbeat from core client for 30 sec - exiting 12:05:35 (1784): No heartbeat from core client for 30 sec - exiting 12:05:36 (1784): No heartbeat from core client for 30 sec - exiting 12:05:37 (1784): No heartbeat from core client for 30 sec - exiting 12:05:38 (1784): No heartbeat from core client for 30 sec - exiting 12:05:39 (1784): No heartbeat from core client for 30 sec - exiting 12:05:40 (1784): No heartbeat from core client for 30 sec - exiting 12:05:41 (1784): No heartbeat from core client for 30 sec - exiting 12:05:42 (1784): No heartbeat from core client for 30 sec - exiting 12:05:43 (1784): No heartbeat from core client for 30 sec - exiting 12:05:44 (1784): No heartbeat from core client for 30 sec - exiting 12:05:45 (1784): No heartbeat from core client for 30 sec - exiting 12:05:46 (1784): No heartbeat from core client for 30 sec - exiting 12:05:47 (1784): No heartbeat from core client for 30 sec - exiting 12:05:48 (1784): No heartbeat from core client for 30 sec - exiting 12:05:49 (1784): No heartbeat from core client for 30 sec - exiting 12:05:50 (1784): No heartbeat from core client for 30 sec - exiting 12:05:51 (1784): No heartbeat from core client for 30 sec - exiting 12:05:52 (1784): No heartbeat from core client for 30 sec - exiting 12:05:53 (1784): No heartbeat from core client for 30 sec - exiting 12:05:54 (1784): No heartbeat from core client for 30 sec - exiting 12:05:55 (1784): No heartbeat from core client for 30 sec - exiting 12:05:56 (1784): No heartbeat from core client for 30 sec - exiting 12:05:57 (1784): No heartbeat from core client for 30 sec - exiting 12:05:58 (1784): No heartbeat from core client for 30 sec - exiting 12:05:59 (1784): No heartbeat from core client for 30 sec - exiting 12:06:00 (1784): No heartbeat from core client for 30 sec - exiting 12:06:01 (1784): No heartbeat from core client for 30 sec - exiting 12:06:02 (1784): No heartbeat from core client for 30 sec - exiting 12:06:03 (1784): No heartbeat from core client for 30 sec - exiting 12:06:04 (1784): No heartbeat from core client for 30 sec - exiting 12:06:05 (1784): No heartbeat from core client for 30 sec - exiting 12:06:06 (1784): No heartbeat from core client for 30 sec - exiting 12:06:07 (1784): No heartbeat from core client for 30 sec - exiting 12:06:08 (1784): No heartbeat from core client for 30 sec - exiting 12:06:09 (1784): No heartbeat from core client for 30 sec - exiting 12:06:10 (1784): No heartbeat from core client for 30 sec - exiting 12:06:11 (1784): No heartbeat from core client for 30 sec - exiting 12:06:12 (1784): No heartbeat from core client for 30 sec - exiting 12:06:13 (1784): No heartbeat from core client for 30 sec - exiting 12:06:14 (1784): No heartbeat from core client for 30 sec - exiting 12:06:15 (1784): No heartbeat from core client for 30 sec - exiting 12:06:16 (1784): No heartbeat from core client for 30 sec - exiting 12:06:17 (1784): No heartbeat from core client for 30 sec - exiting 12:06:18 (1784): No heartbeat from core client for 30 sec - exiting 12:06:19 (1784): No heartbeat from core client for 30 sec - exiting 12:06:20 (1784): No heartbeat from core client for 30 sec - exiting 12:06:21 (1784): No heartbeat from core client for 30 sec - exiting 12:06:22 (1784): No heartbeat from core client for 30 sec - exiting 12:06:23 (1784): No heartbeat from core client for 30 sec - exiting 12:06:24 (1784): No heartbeat from core client for 30 sec - exiting 12:06:25 (1784): No heartbeat from core client for 30 sec - exiting 12:06:26 (1784): No heartbeat from core client for 30 sec - exiting 12:06:27 (1784): No heartbeat from core client for 30 sec - exiting 12:06:28 (1784): No heartbeat from core client for 30 sec - exiting 12:06:29 (1784): No heartbeat from core client for 30 sec - exiting 12:06:30 (1784): No heartbeat from core client for 30 sec - exiting 12:06:31 (1784): No heartbeat from core client for 30 sec - exiting 12:06:32 (1784): No heartbeat from core client for 30 sec - exiting 12:06:33 (1784): No heartbeat from core client for 30 sec - exiting 12:06:34 (1784): No heartbeat from core client for 30 sec - exiting 12:06:35 (1784): No heartbeat from core client for 30 sec - exiting 12:06:36 (1784): No heartbeat from core client for 30 sec - exiting 12:06:37 (1784): No heartbeat from core client for 30 sec - exiting 12:06:38 (1784): No heartbeat from core client for 30 sec - exiting 12:06:39 (1784): No heartbeat from core client for 30 sec - exiting 12:06:40 (1784): No heartbeat from core client for 30 sec - exiting 12:06:41 (1784): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 08:02:07 (1456): No heartbeat from core client for 30 sec - exiting 08:02:08 (1456): No heartbeat from core client for 30 sec - exiting 08:02:09 (1456): No heartbeat from core client for 30 sec - exiting 08:02:10 (1456): No heartbeat from core client for 30 sec - exiting 08:02:11 (1456): No heartbeat from core client for 30 sec - exiting 08:02:12 (1456): No heartbeat from core client for 30 sec - exiting 08:02:13 (1456): No heartbeat from core client for 30 sec - exiting 08:02:14 (1456): No heartbeat from core client for 30 sec - exiting 08:02:15 (1456): No heartbeat from core client for 30 sec - exiting 08:02:16 (1456): No heartbeat from core client for 30 sec - exiting 08:02:17 (1456): No heartbeat from core client for 30 sec - exiting 08:02:18 (1456): No heartbeat from core client for 30 sec - exiting 08:02:19 (1456): No heartbeat from core client for 30 sec - exiting 08:02:20 (1456): No heartbeat from core client for 30 sec - exiting 08:02:21 (1456): No heartbeat from core client for 30 sec - exiting 08:02:22 (1456): No heartbeat from core client for 30 sec - exiting 08:02:23 (1456): No heartbeat from core client for 30 sec - exiting 08:02:24 (1456): No heartbeat from core client for 30 sec - exiting 08:02:25 (1456): No heartbeat from core client for 30 sec - exiting 08:02:26 (1456): No heartbeat from core client for 30 sec - exiting 08:02:27 (1456): No heartbeat from core client for 30 sec - exiting 08:02:28 (1456): No heartbeat from core client for 30 sec - exiting 08:02:29 (1456): No heartbeat from core client for 30 sec - exiting 08:02:30 (1456): No heartbeat from core client for 30 sec - exiting 08:02:31 (1456): No heartbeat from core client for 30 sec - exiting 08:02:32 (1456): No heartbeat from core client for 30 sec - exiting 08:02:33 (1456): No heartbeat from core client for 30 sec - exiting 08:02:34 (1456): No heartbeat from core client for 30 sec - exiting 08:02:35 (1456): No heartbeat from core client for 30 sec - exiting 08:02:36 (1456): No heartbeat from core client for 30 sec - exiting 08:02:37 (1456): No heartbeat from core client for 30 sec - exiting 08:02:38 (1456): No heartbeat from core client for 30 sec - exiting 08:02:39 (1456): No heartbeat from core client for 30 sec - exiting 08:02:40 (1456): No heartbeat from core client for 30 sec - exiting 08:02:41 (1456): No heartbeat from core client for 30 sec - exiting 08:02:42 (1456): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 09:58:02 (8416): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 09:58:47 (21496): Can't acquire lockfile (32) - waiting 35s CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... 16:12:23 (9136): No heartbeat from core client for 30 sec - exiting 16:12:24 (9136): No heartbeat from core client for 30 sec - exiting 16:12:25 (9136): No heartbeat from core client for 30 sec - exiting 16:12:26 (9136): No heartbeat from core client for 30 sec - exiting 16:12:27 (9136): No heartbeat from core client for 30 sec - exiting 16:12:28 (9136): No heartbeat from core client for 30 sec - exiting 16:12:29 (9136): No heartbeat from core client for 30 sec - exiting 16:12:30 (9136): No heartbeat from core client for 30 sec - exiting 16:12:31 (9136): No heartbeat from core client for 30 sec - exiting 16:12:32 (9136): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 0, checkPID=0, selfPID=12112, iMonCtr=1 Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=0, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=12200, selfPID=7748, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Called boinc_finish </stderr_txt> <message> upload failure: <file_xfer_error> <file_name>hadam3p_anz_a1nb_2013_1_009460438_0_5.zip</file_name> <error_code>-161 (not found)</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_anz_a1nb_2013_1_009460438_0_6.zip</file_name> <error_code>-161 (not found)</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_anz_a1nb_2013_1_009460438_0_7.zip</file_name> <error_code>-161 (not found)</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_anz_a1nb_2013_1_009460438_0_8.zip</file_name> <error_code>-161 (not found)</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_anz_a1nb_2013_1_009460438_0_9.zip</file_name> <error_code>-161 (not found)</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_anz_a1nb_2013_1_009460438_0_10.zip</file_name> <error_code>-161 (not found)</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_anz_a1nb_2013_1_009460438_0_11.zip</file_name> <error_code>-161 (not found)</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_anz_a1nb_2013_1_009460438_0_12.zip</file_name> <error_code>-161 (not found)</error_code> </file_xfer_error> </message> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
06 Feb 2015 17:24:36 | 1327399 | 17788560 | hadam3p_anz_a1nb_2013_1_009460438_0 | 46,379 | 227,637 | 4.9082 |
03 Feb 2015 22:58:28 | 1327399 | 17788560 | hadam3p_anz_a1nb_2013_1_009460438_0 | 34,859 | 171,210 | 4.9115 |
29 Jan 2015 16:46:49 | 1327399 | 17788560 | hadam3p_anz_a1nb_2013_1_009460438_0 | 23,339 | 112,281 | 4.8109 |
24 Jan 2015 19:32:12 | 1327399 | 17788560 | hadam3p_anz_a1nb_2013_1_009460438_0 | 11,819 | 56,930 | 4.8168 |
©2024 climateprediction.net