Name | hadam3p_pnw_yx6f_1998_1_006898063_0 |
Workunit | 7101379 |
Created | 20 Nov 2010, 13:08:06 UTC |
Sent | 24 Apr 2011, 11:04:05 UTC |
Report deadline | 5 Apr 2012, 16:24:05 UTC |
Received | 18 Jun 2011, 0:16:12 UTC |
Server state | Over |
Outcome | Success |
Client state | Done |
Exit status | 0 (0x00000000) |
Computer ID | 859116 |
Run time | 8 days 15 hours 18 min 46 sec |
CPU time | 7 days 20 hours 4 min 8 sec |
Validate state | Workunit error - check skipped |
Credit | 3,003.83 |
Device peak FLOPS | 1.86 GFLOPS |
Application version | UK Met Office HadAM3P-HadRM3P Pacific North West v6.09 windows_intelx86 |
Stderr | <core_client_version>6.6.36</core_client_version> <![CDATA[ <stderr_txt> CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3496, selfPID=2700, iMonCtr=1 Model crash detected, will try to restart... 17:53:04 (2368): No heartbeat from core client for 30 sec - exiting 17:53:05 (2368): No heartbeat from core client for 30 sec - exiting 17:53:06 (2368): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4028, iMonCtr=2 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4508, iMonCtr=2 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... CPDN Monitor - Quit request from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3772, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=740, selfPID=3932, iMonCtr=1 Model crash detected, will try to restart... C19:35:28 (1752): No heartbeat from core client for 30 sec - exiting 19:35:33 (1752): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2364, iMonCtr=2 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Regional yearly means requires 12 input files got 1 CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3172, iMonCtr=2 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... 19:04:50 (1364): No heartbeat from core client for 30 sec - exiting 19:04:52 (1364): No heartbeat from core client for 30 sec - exiting 19:04:53 (1364): No heartbeat from core client for 30 sec - exiting 19:04:54 (1364): No heartbeat from core client for 30 sec - exiting 19:04:55 (1364): No heartbeat from core client for 30 sec - exiting 19:04:56 (1364): No heartbeat from core client for 30 sec - exiting 19:04:57 (1364): No heartbeat from core client for 30 sec - exiting 19:04:58 (1364): No heartbeat from core client for 30 sec - exiting 19:04:59 (1364): No heartbeat from core client for 30 sec - exiting 19:05:00 (1364): No heartbeat from core client for 30 sec - exiting 19:05:01 (1364): No heartbeat from core client for 30 sec - exiting 19:05:02 (1364): No heartbeat from core client for 30 sec - exiting 19:05:03 (1364): No heartbeat from core client for 30 sec - exiting 19:05:04 (1364): No heartbeat from core client for 30 sec - exiting 19:05:05 (1364): No heartbeat from core client for 30 sec - exiting 19:05:06 (1364): No heartbeat from core client for 30 sec - exiting 19:05:07 (1364): No heartbeat from core client for 30 sec - exiting 19:05:08 (1364): No heartbeat from core client for 30 sec - exiting 19:05:09 (1364): No heartbeat from core client for 30 sec - exiting 19:05:10 (1364): No heartbeat from core client for 30 sec - exiting 19:05:11 (1364): No heartbeat from core client for 30 sec - exiting 19:05:12 (1364): No heartbeat from core client for 30 sec - exiting 19:05:13 (1364): No heartbeat from core client for 30 sec - exiting 19:05:14 (1364): No heartbeat from core client for 30 sec - exiting 19:05:15 (1364): No heartbeat from core client for 30 sec - exiting 19:05:16 (1364): No heartbeat from core client for 30 sec - exiting 19:05:17 (1364): No heartbeat from core client for 30 sec - exiting 19:05:18 (1364): No heartbeat from core client for 30 sec - exiting 19:05:19 (1364): No heartbeat from core client for 30 sec - exiting 19:05:20 (1364): No heartbeat from core client for 30 sec - exiting 19:05:21 (1364): No heartbeat from core client for 30 sec - exiting 19:05:22 (1364): No heartbeat from core client for 30 sec - exiting 19:05:23 (1364): No heartbeat from core client for 30 sec - exiting 19:05:24 (1364): No heartbeat from core client for 30 sec - exiting 19:05:25 (1364): No heartbeat from core client for 30 sec - exiting 19:05:26 (1364): No heartbeat from core client for 30 sec - exiting 19:05:27 (1364): No heartbeat from core client for 30 sec - exiting 19:05:28 (1364): No heartbeat from core client for 30 sec - exiting 19:05:29 (1364): No heartbeat from core client for 30 sec - exiting 19:05:30 (1364): No heartbeat from core client for 30 sec - exiting 19:05:31 (1364): No heartbeat from core client for 30 sec - exiting 19:05:32 (1364): No heartbeat from core client for 30 sec - exiting 19:05:33 (1364): No heartbeat from core client for 30 sec - exiting 19:05:34 (1364): No heartbeat from core client for 30 sec - exiting 19:05:35 (1364): No heartbeat from core client for 30 sec - exiting 19:05:36 (1364): No heartbeat from core client for 30 sec - exiting 19:05:37 (1364): No heartbeat from core client for 30 sec - exiting 19:05:38 (1364): No heartbeat from core client for 30 sec - exiting 19:05:39 (1364): No heartbeat from core client for 30 sec - exiting 19:05:40 (1364): No heartbeat from core client for 30 sec - exiting 19:05:41 (1364): No heartbeat from core client for 30 sec - exiting 19:05:42 (1364): No heartbeat from core client for 30 sec - exiting 19:05:43 (1364): No heartbeat from core client for 30 sec - exiting 19:05:44 (1364): No heartbeat from core client for 30 sec - exiting 19:05:45 (1364): No heartbeat from core client for 30 sec - exiting 19:05:46 (1364): No heartbeat from core client for 30 sec - exiting 19:05:47 (1364): No heartbeat from core client for 30 sec - exiting 19:05:48 (1364): No heartbeat from core client for 30 sec - exiting 19:05:49 (1364): No heartbeat from core client for 30 sec - exiting 19:05:50 (1364): No heartbeat from core client for 30 sec - exiting 19:05:51 (1364): No heartbeat from core client for 30 sec - exiting 19:05:52 (1364): No heartbeat from core client for 30 sec - exiting 19:05:53 (1364): No heartbeat from core client for 30 sec - exiting 19:05:54 (1364): No heartbeat from core client for 30 sec - exiting 19:05:55 (1364): No heartbeat from core client for 30 sec - exiting 19:05:56 (1364): No heartbeat from core client for 30 sec - exiting 19:05:57 (1364): No heartbeat from core client for 30 sec - exiting 19:05:58 (1364): No heartbeat from core client for 30 sec - exiting 19:05:59 (1364): No heartbeat from core client for 30 sec - exiting 19:06:00 (1364): No heartbeat from core client for 30 sec - exiting 19:06:01 (1364): No heartbeat from core client for 30 sec - exiting 19:06:02 (1364): No heartbeat from core client for 30 sec - exiting 19:06:03 (1364): No heartbeat from core client for 30 sec - exiting 19:06:04 (1364): No heartbeat from core client for 30 sec - exiting 19:06:05 (1364): No heartbeat from core client for 30 sec - exiting 19:06:06 (1364): No heartbeat from core client for 30 sec - exiting 19:06:07 (1364): No heartbeat from core client for 30 sec - exiting 19:06:08 (1364): No heartbeat from core client for 30 sec - exiting 19:06:09 (1364): No heartbeat from core client for 30 sec - exiting 19:06:10 (1364): No heartbeat from core client for 30 sec - exiting 19:06:11 (1364): No heartbeat from core client for 30 sec - exiting 19:06:12 (1364): No heartbeat from core client for 30 sec - exiting 19:06:13 (1364): No heartbeat from core client for 30 sec - exiting 19:06:14 (1364): No heartbeat from core client for 30 sec - exiting 19:06:15 (1364): No heartbeat from core client for 30 sec - exiting 19:06:16 (1364): No heartbeat from core client for 30 sec - exiting 19:06:17 (1364): No heartbeat from core client for 30 sec - exiting 19:06:18 (1364): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 19:06:19 (1364): No heartbeat from core client for 30 sec - exiting 19:06:20 (1364): No heartbeat from core client for 30 sec - exiting Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1436, iMonCtr=2 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4668, selfPID=2712, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1388, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2960, iMonCtr=2 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=848, iMonCtr=2 Model crash detected, will try to restart... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4524, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=488, iMonCtr=2 Model crash detected, will try to restart... 07:56:51 (2516): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4532, selfPID=5796, iMonCtr=1 Model crash detected, will try to restart... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3684, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2344, iMonCtr=2 Model crash detected, will try to restart... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4584, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4592, selfPID=3976, iMonCtr=1 Model crash detected, will try to restart... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2600, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1812, iMonCtr=2 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2028, selfPID=4088, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3320, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2776, selfPID=2688, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2420, iMonCtr=2 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1428, iMonCtr=2 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3084, selfPID=3716, iMonCtr=1 Model crash detected, will try to restart... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4092, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2508, selfPID=3584, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3724, selfPID=3724, iMonCtr=2 CPDN Monitor - Quit request from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4360, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4128, iMonCtr=2 Model crash detected, will try to restart... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2152, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2368, iMonCtr=2 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2684, selfPID=960, iMonCtr=1 Model crash detected, will try to restart... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3528, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3844, selfPID=644, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1512, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5824, iMonCtr=2 Model crash detected, will try to restart... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4292, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2056, iMonCtr=2 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1292, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2584, iMonCtr=2 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4324, iMonCtr=2 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5476, selfPID=4716, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Leaving CPDN_Main::Monitor... Called boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
19 Jun 2011 21:58:12 | 859116 | 12173094 | hadam3p_pnw_yx6f_1998_1_006898063_0 | 138,242 | 675,125 | 4.8836 |
16 Jun 2011 23:55:24 | 859116 | 12173094 | hadam3p_pnw_yx6f_1998_1_006898063_0 | 138,240 | 674,385 | 4.8784 |
12 Jun 2011 13:39:57 | 859116 | 12173094 | hadam3p_pnw_yx6f_1998_1_006898063_0 | 126,720 | 616,110 | 4.8620 |
06 Jun 2011 14:19:15 | 859116 | 12173094 | hadam3p_pnw_yx6f_1998_1_006898063_0 | 115,296 | 559,205 | 4.8502 |
04 Jun 2011 23:31:49 | 859116 | 12173094 | hadam3p_pnw_yx6f_1998_1_006898063_0 | 103,776 | 507,190 | 4.8874 |
02 Jun 2011 22:48:38 | 859116 | 12173094 | hadam3p_pnw_yx6f_1998_1_006898063_0 | 92,256 | 452,566 | 4.9055 |
30 May 2011 15:41:04 | 859116 | 12173094 | hadam3p_pnw_yx6f_1998_1_006898063_0 | 80,736 | 399,418 | 4.9472 |
28 May 2011 12:50:48 | 859116 | 12173094 | hadam3p_pnw_yx6f_1998_1_006898063_0 | 69,216 | 347,706 | 5.0235 |
21 May 2011 21:16:43 | 859116 | 12173094 | hadam3p_pnw_yx6f_1998_1_006898063_0 | 57,696 | 294,658 | 5.1071 |
17 May 2011 22:16:56 | 859116 | 12173094 | hadam3p_pnw_yx6f_1998_1_006898063_0 | 46,176 | 239,540 | 5.1875 |
14 May 2011 13:44:10 | 859116 | 12173094 | hadam3p_pnw_yx6f_1998_1_006898063_0 | 34,656 | 181,139 | 5.2268 |
08 May 2011 17:08:17 | 859116 | 12173094 | hadam3p_pnw_yx6f_1998_1_006898063_0 | 23,136 | 121,419 | 5.2481 |
01 May 2011 19:00:46 | 859116 | 12173094 | hadam3p_pnw_yx6f_1998_1_006898063_0 | 11,616 | 63,324 | 5.4514 |
©2024 climateprediction.net