Name | hadam3p_eu_na3q_2013_1_008807590_0 |
Workunit | 8953568 |
Created | 7 Jul 2014, 17:06:58 UTC |
Sent | 1 Aug 2014, 16:18:06 UTC |
Report deadline | 14 Jul 2015, 21:38:06 UTC |
Received | 3 Aug 2014, 16:30:54 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 1 (0x00000001) Unknown error code |
Computer ID | 1298068 |
Run time | 10 hours 32 min 29 sec |
CPU time | |
Validate state | Invalid |
Credit | 200.38 |
Device peak FLOPS | 3.05 GFLOPS |
Application version | UK Met Office HadAM3P-HadRM3P Europe v6.09 i686-apple-darwin |
Stderr | <core_client_version>7.0.65</core_client_version> <![CDATA[ <message> process exited with code 1 (0x1, -255) </message> <stderr_txt> 17:13:30 (16915): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... SIGSEGV: segmentation violation forrtl: No space left on device forrtl: severe (38): error during write, unit 6, file /Library/Application Support/BOINC Data/projects/climateprediction.net/hadam3p_eu_na3q_2013_1_008807590/dataout/xaakg.out Image PC Routine Line Source hadrm3p_eu_um_6.0 00365662 Unknown Unknown Unknown hadrm3p_eu_um_6.0 0036412B Unknown Unknown Unknown hadrm3p_eu_um_6.0 003289AD Unknown Unknown Unknown hadrm3p_eu_um_6.0 002DA7E5 Unknown Unknown Unknown hadrm3p_eu_um_6.0 002D9F47 Unknown Unknown Unknown hadrm3p_eu_um_6.0 0031F669 Unknown Unknown Unknown hadrm3p_eu_um_6.0 00001D4F Unknown Unknown Unknown hadrm3p_eu_um_6.0 00020A67 Unknown Unknown Unknown hadrm3p_eu_um_6.0 0002204D Unknown Unknown Unknown hadrm3p_eu_um_6.0 002B9D2C Unknown Unknown Unknown hadrm3p_eu_um_6.0 0000192B Unknown Unknown Unknown hadrm3p_eu_um_6.0 00001859 Unknown Unknown Unknown Unknown 00000003 Unknown Unknown Unknown Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=19731, iMonCtr=2 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... zip I/O error: No space left on device zip error: Could not create output file (../hadam3p_eu_na3q_2013_1_008807590_0_13.zip) Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=30580, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=31757, iMonCtr=2 Signal 3 received, exiting... Signal 3 received, exiting... Called boinc_finish Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=40631, iMonCtr=2 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=43765, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=44488, selfPID=44484, iMonCtr=1 Model crash detected, will try to restart... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=44756, selfPID=44757, iMonCtr=1 Signal 3 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=46004, selfPID=45999, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=47153, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=47418, iMonCtr=2 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=48654, iMonCtr=2 Model crash detected, will try to restart... Signal 3 received, exiting... Signal 3 received, exiting... Called boinc_finish Called boinc_finish Signal 3 received, exiting... Called boinc_finish Signal 3 received, exiting... Called boinc_finish Signal 3 received, exiting... Called boinc_finish Signal 3 received, exiting... Called boinc_finish Signal 3 received, exiting... Signal 3 received, exiting... Called boinc_finish Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=58419, iMonCtr=2 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=62085, iMonCtr=2 Model crash detected, will try to restart... Signal 3 received, exiting... Signal 3 received, exiting... Called boinc_finish Called boinc_finish Signal 3 received, exiting... Called boinc_finish Signal 3 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=69900, iMonCtr=2 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=81137, iMonCtr=2 Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=86933, selfPID=86934, iMonCtr=1 Signal 3 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=88050, iMonCtr=2 Model crash detected, will try to restart... Signal 3 received, exiting... Called boinc_finish Regional Worker:: CPDN process is not running, exiting, bRetVal = 0, checkPID=0, selfPID=0, iMonCtr=0 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=90042, iMonCtr=2 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=90240, iMonCtr=2 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=92579, iMonCtr=2 Signal 3 received, exiting... Signal 3 received, exiting... Called boinc_finish Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=97043, selfPID=97039, iMonCtr=1 Signal 3 received, exiting... Called boinc_finish Signal 3 received, exiting... Called boinc_finish Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=99004, selfPID=99005, iMonCtr=1 Signal 3 received, exiting... Called boinc_finish Signal 3 received, exiting... Signal 3 received, exiting... Called boinc_finish Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2291, iMonCtr=2 Signal 3 received, exiting... Signal 3 received, exiting... Called boinc_finish Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6820, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=10500, selfPID=10494, iMonCtr=1 Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=11185, selfPID=11186, iMonCtr=1 Signal 3 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=11218, iMonCtr=2 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=12422, selfPID=12415, iMonCtr=1 Global Worker:: CPDN process is not running, exiting, bRetVal = 0, checkPID=0, selfPID=0, iMonCtr=0 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=12946, iMonCtr=2 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=16213, iMonCtr=2 Global Worker:: CPDN process is not running, exiting, bRetVal = 0, checkPID=0, selfPID=0, iMonCtr=0 Signal 3 received, exiting... Called boinc_finish Regional Worker:: CPDN process is not running, exiting, bRetVal = 0, checkPID=0, selfPID=0, iMonCtr=0 Signal 3 received, exiting... Signal 3 received, exiting... Called boinc_finish Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=18148, selfPID=18129, iMonCtr=1 Model crash detected, will try to restart... Signal 3 received, exiting... Called boinc_finish Signal 3 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=20051, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=20650, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=23224, iMonCtr=2 Signal 3 received, exiting... Called boinc_finish Signal 3 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=23416, iMonCtr=2 Model crash detected, will try to restart... Signal 3 received, exiting... Signal 3 received, exiting... Called boinc_finish Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=24792, iMonCtr=2 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=25630, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=27847, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=28556, iMonCtr=2 Signal 3 received, exiting... Signal 3 received, exiting... Called boinc_finish Called boinc_finish Signal 3 received, exiting... Signal 3 received, exiting... Called boinc_finish Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=35725, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=38272, iMonCtr=2 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=40090, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=42620, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=52631, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=57864, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=58515, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=64489, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=68495, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=70824, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=72868, iMonCtr=2 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=78085, selfPID=78075, iMonCtr=1 Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=78085, selfPID=78085, iMonCtr=2 Model crash detected, will try to restart... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=91964, selfPID=91965, iMonCtr=1 Signal 3 received, exiting... Called boinc_finish Signal 3 received, exiting... Called boinc_finish Signal 3 received, exiting... Called boinc_finish Signal 3 received, exiting... Called boinc_finish Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=92052, selfPID=92053, iMonCtr=1 Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=92052, selfPID=92052, iMonCtr=1 Signal 3 received, exiting... Called boinc_finish Signal 3 received, exiting... Called boinc_finish Signal 3 received, exiting... Called boinc_finish Signal 3 received, exiting... Called boinc_finish Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=92185, selfPID=92186, iMonCtr=1 Signal 3 received, exiting... Called boinc_finish Signal 3 received, exiting... Called boinc_finish Signal 3 received, exiting... Called boinc_finish Signal 3 received, exiting... Signal 3 received, exiting... Called boinc_finish Called boinc_finish Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=99844, selfPID=99845, iMonCtr=1 Signal 3 received, exiting... Called boinc_finish Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=8497, selfPID=8498, iMonCtr=1 Signal 3 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=8915, selfPID=8909, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=17415, selfPID=17409, iMonCtr=1 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=19979, selfPID=19973, iMonCtr=1 Model crash detected, will try to restart... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=27335, selfPID=27336, iMonCtr=1 Signal 3 received, exiting... Called boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
02 Aug 2014 01:12:49 | 1298068 | 16725065 | hadam3p_eu_na3q_2013_1_008807590_0 | 11,616 | 27,289 | 2.3493 |
©2024 climateprediction.net