Name | hadam3p_eu_l86z_2013_1_008821844_0 |
Workunit | 8967773 |
Created | 8 Jul 2014, 10:16:26 UTC |
Sent | 29 Jul 2014, 3:03:25 UTC |
Report deadline | 11 Jul 2015, 8:23:25 UTC |
Received | 30 Jul 2014, 23:38:43 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 0 (0x00000000) |
Computer ID | 1298068 |
Run time | 10 hours 51 min 23 sec |
CPU time | |
Validate state | Invalid |
Credit | 200.38 |
Device peak FLOPS | 3.05 GFLOPS |
Application version | UK Met Office HadAM3P-HadRM3P Europe v6.09 i686-apple-darwin |
Stderr | <core_client_version>7.0.65</core_client_version> <![CDATA[ <stderr_txt> 23:23:38 (6388): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... SIGSEGV: segmentation violation 00:22:10 (7840): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 01:18:03 (10362): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 02:16:47 (14715): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 04:16:26 (19101): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 05:16:45 (28189): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 06:42:36 (32791): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 09:15:34 (45210): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 10:20:44 (62437): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... forrtl: No space left on device forrtl: severe (38): error during write, unit 6, file /Library/Application Support/BOINC Data/projects/climateprediction.net/hadam3p_eu_l86z_2013_1_008821844/dataout/xaakg.out Image PC Routine Line Source hadrm3p_eu_um_6.0 00365662 Unknown Unknown Unknown hadrm3p_eu_um_6.0 0036412B Unknown Unknown Unknown hadrm3p_eu_um_6.0 003289AD Unknown Unknown Unknown hadrm3p_eu_um_6.0 002DA7E5 Unknown Unknown Unknown hadrm3p_eu_um_6.0 002D9F47 Unknown Unknown Unknown hadrm3p_eu_um_6.0 0031F669 Unknown Unknown Unknown hadrm3p_eu_um_6.0 00001D4F Unknown Unknown Unknown hadrm3p_eu_um_6.0 00020A67 Unknown Unknown Unknown hadrm3p_eu_um_6.0 0002204D Unknown Unknown Unknown hadrm3p_eu_um_6.0 002B9D2C Unknown Unknown Unknown hadrm3p_eu_um_6.0 0000192B Unknown Unknown Unknown hadrm3p_eu_um_6.0 00001859 Unknown Unknown Unknown Unknown 00000003 Unknown Unknown Unknown Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=68742, iMonCtr=2 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Called boinc_finish Signal 3 received, exiting... Called boinc_finish Signal 3 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=78133, iMonCtr=2 Signal 3 received, exiting... Signal 3 received, exiting... Called boinc_finish Called boinc_finish Signal 3 received, exiting... Signal 3 received, exiting... Called boinc_finish Called boinc_finish Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=86798, selfPID=86799, iMonCtr=1 Signal 3 received, exiting... Called boinc_finish Signal 3 received, exiting... Signal 3 received, exiting... Called boinc_finish Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=95897, iMonCtr=2 Model crash detected, will try to restart... Signal 3 received, exiting... Signal 3 received, exiting... Called boinc_finish Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=99744, iMonCtr=2 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=13205, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=16979, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=17569, iMonCtr=2 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=21340, iMonCtr=2 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=23284, iMonCtr=2 Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=24657, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=25915, iMonCtr=2 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=26834, iMonCtr=2 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=26944, iMonCtr=2 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=29304, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=30921, iMonCtr=2 forrtl: No space left on device forrtl: severe (38): error during write, unit 8, file /Library/Application Support/BOINC Data/projects/climateprediction.net/hadam3p_eu_l86z_2013_1_008821844/tmp/xaakg.pipe_dummy Image PC Routine Line Source hadrm3p_eu_um_6.0 00365662 Unknown Unknown Unknown hadrm3p_eu_um_6.0 0036412B Unknown Unknown Unknown hadrm3p_eu_um_6.0 003289AD Unknown Unknown Unknown hadrm3p_eu_um_6.0 002DA7E5 Unknown Unknown Unknown hadrm3p_eu_um_6.0 002D9F47 Unknown Unknown Unknown hadrm3p_eu_um_6.0 0031B0C7 Unknown Unknown Unknown hadrm3p_eu_um_6.0 0031899C Unknown Unknown Unknown hadrm3p_eu_um_6.0 00021A51 Unknown Unknown Unknown hadrm3p_eu_um_6.0 002B9D2C Unknown Unknown Unknown hadrm3p_eu_um_6.0 0000192B Unknown Unknown Unknown hadrm3p_eu_um_6.0 00001859 Unknown Unknown Unknown Unknown 00000003 Unknown Unknown Unknown forrtl: No space left on device forrtl: severe (38): error during write, unit 6, file /Library/Application Support/BOINC Data/projects/climateprediction.net/hadam3p_eu_l86z_2013_1_008821844/dataout/xaakm.out Image PC Routine Line Source hadam3p_eu_um_6.0 00340DC2 Unknown Unknown Unknown hadam3p_eu_um_6.0 0033F88B Unknown Unknown Unknown hadam3p_eu_um_6.0 0030427D Unknown Unknown Unknown hadam3p_eu_um_6.0 002B88E5 Unknown Unknown Unknown hadam3p_eu_um_6.0 002B8047 Unknown Unknown Unknown hadam3p_eu_um_6.0 002F6BD2 Unknown Unknown Unknown hadam3p_eu_um_6.0 002F426C Unknown Unknown Unknown hadam3p_eu_um_6.0 002675CB Unknown Unknown Unknown hadam3p_eu_um_6.0 0021D485 Unknown Unknown Unknown hadam3p_eu_um_6.0 00290029 Unknown Unknown Unknown hadam3p_eu_um_6.0 002992D1 Unknown Unknown Unknown hadam3p_eu_um_6.0 000025DB Unknown Unknown Unknown hadam3p_eu_um_6.0 00002509 Unknown Unknown Unknown Unknown 0000000A Unknown Unknown Unknown Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=32578, selfPID=32575, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Called boinc_finish </stderr_txt> <message> upload failure: <file_xfer_error> <file_name>hadam3p_eu_l86z_2013_1_008821844_0_2.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_eu_l86z_2013_1_008821844_0_3.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_eu_l86z_2013_1_008821844_0_4.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_eu_l86z_2013_1_008821844_0_5.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_eu_l86z_2013_1_008821844_0_6.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_eu_l86z_2013_1_008821844_0_7.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_eu_l86z_2013_1_008821844_0_8.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_eu_l86z_2013_1_008821844_0_9.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_eu_l86z_2013_1_008821844_0_10.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_eu_l86z_2013_1_008821844_0_11.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_eu_l86z_2013_1_008821844_0_12.zip</file_name> <error_code>-161</error_code> </file_xfer_error> </message> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
29 Jul 2014 12:10:44 | 1298068 | 16740494 | hadam3p_eu_l86z_2013_1_008821844_0 | 11,616 | 28,274 | 2.4341 |
©2024 climateprediction.net