Name | hadam3p_pnw_q5z5_2013_1_010029582_2 |
Workunit | 10027644 |
Created | 25 Jul 2015, 11:24:20 UTC |
Sent | 28 Jul 2015, 18:52:16 UTC |
Report deadline | 10 Jul 2016, 0:12:16 UTC |
Received | 18 Aug 2015, 16:24:40 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 0 (0x00000000) |
Computer ID | 1329645 |
Run time | 5 days 22 hours 43 min 49 sec |
CPU time | 8 hours 47 min 49 sec |
Validate state | Invalid |
Credit | 1,758.71 |
Device peak FLOPS | 2.98 GFLOPS |
Application version | UK Met Office HadAM3P-HadRM3P Pacific North West v7.27 windows_intelx86 |
Stderr | <core_client_version>7.4.27</core_client_version> <![CDATA[ <stderr_txt> Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4408, selfPID=4568, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5896, selfPID=4276, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5976, selfPID=5344, iMonCtr=1 Model crash detected, will try to restart... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1420, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=424, selfPID=4776, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5604, selfPID=4048, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4892, selfPID=5076, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3168, selfPID=3848, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5800, selfPID=1596, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5720, selfPID=4128, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6012, selfPID=3280, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5240, selfPID=4312, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4116, iMonCtr=2 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4120, selfPID=3836, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4800, selfPID=840, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... 23:06:46 (840): called boinc_finish(0) Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4264, selfPID=4100, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... 16:52:22 (4100): called boinc_finish(0) Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5640, selfPID=4780, iMonCtr=1 Model crash detected, will try to restart... CGntrollerl:ba CPDN pr:: CPDN process is not running,bRetVal = 1, chec = 1, checkPID=0, selfPID=2980, iMonCtr=2 r=2 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5532, selfPID=2564, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5640, selfPID=4688, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... 12:21:44 (4688): called boinc_finish(0) </stderr_txt><message> upload failure: <file_xfer_error> <file_name>hadam3p_pnw_q5z5_2013_1_010029582_2_8.zip</file_name> <error_code>-161 (not found)</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_pnw_q5z5_2013_1_010029582_2_9.zip</file_name> <error_code>-161 (not found)</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_pnw_q5z5_2013_1_010029582_2_10.zip</file_name> <error_code>-161 (not found)</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_pnw_q5z5_2013_1_010029582_2_11.zip</file_name> <error_code>-161 (not found)</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_pnw_q5z5_2013_1_010029582_2_12.zip</file_name> <error_code>-161 (not found)</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_pnw_q5z5_2013_1_010029582_2_13.zip</file_name> <error_code>-161 (not found)</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_pnw_q5z5_2013_1_010029582_2_14.zip</file_name> <error_code>-161 (not found)</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_pnw_q5z5_2013_1_010029582_2_15.zip</file_name> <error_code>-161 (not found)</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_pnw_q5z5_2013_1_010029582_2_16.zip</file_name> <error_code>-161 (not found)</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_pnw_q5z5_2013_1_010029582_2_17.zip</file_name> <error_code>-161 (not found)</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_pnw_q5z5_2013_1_010029582_2_18.zip</file_name> <error_code>-161 (not found)</error_code> </file_xfer_error> </message> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
12 Aug 2015 12:45:11 | 1329645 | 18738711 | hadam3p_pnw_q5z5_2013_1_010029582_2 | 80,939 | 355,037 | 4.3865 |
11 Aug 2015 12:52:13 | 1329645 | 18738711 | hadam3p_pnw_q5z5_2013_1_010029582_2 | 69,419 | 298,347 | 4.2978 |
09 Aug 2015 22:17:49 | 1329645 | 18738711 | hadam3p_pnw_q5z5_2013_1_010029582_2 | 57,899 | 251,238 | 4.3392 |
05 Aug 2015 21:12:52 | 1329645 | 18738711 | hadam3p_pnw_q5z5_2013_1_010029582_2 | 46,379 | 205,525 | 4.4314 |
04 Aug 2015 09:00:28 | 1329645 | 18738711 | hadam3p_pnw_q5z5_2013_1_010029582_2 | 34,859 | 160,909 | 4.6160 |
30 Jul 2015 20:27:57 | 1329645 | 18738711 | hadam3p_pnw_q5z5_2013_1_010029582_2 | 23,339 | 113,469 | 4.8618 |
29 Jul 2015 14:51:42 | 1329645 | 18738711 | hadam3p_pnw_q5z5_2013_1_010029582_2 | 11,819 | 57,452 | 4.8610 |
©2024 climateprediction.net