climateprediction.net home page
Task 14388833

Task 14388833

Name hadam3p_eu_adt4_1977_1_007865662_0
Workunit 8020774
Created 10 Apr 2012, 16:48:23 UTC
Sent 10 Apr 2012, 16:48:35 UTC
Report deadline 23 Mar 2013, 22:08:35 UTC
Received 29 Apr 2012, 6:25:10 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 0 (0x00000000)
Computer ID 1169010
Run time 7 days 6 hours 35 min 45 sec
CPU time 4 days 3 hours 53 min 45 sec
Validate state Invalid
Credit 1,790.21
Device peak FLOPS 2.46 GFLOPS
Application version UK Met Office HadAM3P-HadRM3P Europe v6.09
windows_intelx86
Stderr
<core_client_version>7.0.25</core_client_version>
<![CDATA[
<stderr_txt>
CPDN Monitor - Quit request from BOINC...
17:58:20 (2556): No heartbeat from core client for 30 sec - exiting
17:58:21 (2556): No heartbeat from core client for 30 sec - exiting
17:58:22 (2556): No heartbeat from core client for 30 sec - exiting
17:58:24 (2556): No heartbeat from core client for 30 sec - exiting
17:58:25 (2556): No heartbeat from core client for 30 sec - exiting
17:58:26 (2556): No heartbeat from core client for 30 sec - exiting
17:58:27 (2556): No heartbeat from core client for 30 sec - exiting
17:58:28 (2556): No heartbeat from core client for 30 sec - exiting
17:58:29 (2556): No heartbeat from core client for 30 sec - exiting
17:58:30 (2556): No heartbeat from core client for 30 sec - exiting
17:58:31 (2556): No heartbeat from core client for 30 sec - exiting
17:58:32 (2556): No heartbeat from core client for 30 sec - exiting
17:58:33 (2556): No heartbeat from core client for 30 sec - exiting
17:58:34 (2556): No heartbeat from core client for 30 sec - exiting
17:58:36 (2556): No heartbeat from core client for 30 sec - exiting
17:58:37 (2556): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
07:33:26 (3832): No heartbeat from core client for 30 sec - exiting
07:33:27 (3832): No heartbeat from core client for 30 sec - exiting
07:33:28 (3832): No heartbeat from core client for 30 sec - exiting
07:33:29 (3832): No heartbeat from core client for 30 sec - exiting
07:33:30 (3832): No heartbeat from core client for 30 sec - exiting
07:33:31 (3832): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6060, iMonCtr=2
Model crash detected, will try to restart...
08:41:03 (4380): No heartbeat from core client for 30 sec - exiting
08:41:04 (4380): No heartbeat from core client for 30 sec - exiting
08:41:05 (4380): No heartbeat from core client for 30 sec - exiting
08:41:07 (4380): No heartbeat from core client for 30 sec - exiting
08:41:08 (4380): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
16:10:23 (2648): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4644, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4652, selfPID=4388, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4600, iMonCtr=2
Model crash detected, will try to restart...
07:24:48 (4732): No heartbeat from core client for 30 sec - exiting
07:24:50 (4732): No heartbeat from core client for 30 sec - exiting
07:24:51 (4732): No heartbeat from core client for 30 sec - exiting
07:24:52 (4732): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
07:24:53 (4732): No heartbeat from core client for 30 sec - exiting
07:24:54 (4732): No heartbeat from core client for 30 sec - exiting
07:35:39 (4156): No heartbeat from core client for 30 sec - exiting
07:35:40 (4156): No heartbeat from core client for 30 sec - exiting
07:35:41 (4156): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5444, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5392, iMonCtr=2
Model crash detected, will try to restart...
CController:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4336, selfPID=5128, iMonCtr=1
Model crash detected, will try to restart...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6104, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6120, selfPID=4416, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5720, iMonCtr=2
Model crash detected, will try to restart...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5764, iMonCtr=
2
del crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4716, selfPID=4716, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...

Model crashed: P_TH_ADJ : NEGATIVE PRESSURE VALUE CREATED.                                                                                                                                                                                                                     tmp/xaakm.pipe_dummy                                                            2048    
Leaving CPDN_Main::Monitor...
Called boinc_finish

</stderr_txt>
<message>
upload failure: <file_xfer_error>
  <file_name>hadam3p_eu_adt4_1977_1_007865662_0_10.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_eu_adt4_1977_1_007865662_0_11.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_eu_adt4_1977_1_007865662_0_12.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>

</message>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
28 Apr 2012 20:30:21 1169010 14388833 hadam3p_eu_adt4_1977_1_007865662_0 103,776 354,545 3.4164
27 Apr 2012 12:06:31 1169010 14388833 hadam3p_eu_adt4_1977_1_007865662_0 92,256 321,731 3.4874
23 Apr 2012 22:42:59 1169010 14388833 hadam3p_eu_adt4_1977_1_007865662_0 80,736 253,981 3.1458
19 Apr 2012 16:34:13 1169010 14388833 hadam3p_eu_adt4_1977_1_007865662_0 69,216 199,095 2.8764
18 Apr 2012 17:19:35 1169010 14388833 hadam3p_eu_adt4_1977_1_007865662_0 57,696 165,749 2.8728
17 Apr 2012 17:00:46 1169010 14388833 hadam3p_eu_adt4_1977_1_007865662_0 46,176 133,068 2.8818
16 Apr 2012 18:02:51 1169010 14388833 hadam3p_eu_adt4_1977_1_007865662_0 34,656 99,115 2.8600
15 Apr 2012 18:18:39 1169010 14388833 hadam3p_eu_adt4_1977_1_007865662_0 23,136 66,715 2.8836
14 Apr 2012 18:09:33 1169010 14388833 hadam3p_eu_adt4_1977_1_007865662_0 11,616 34,590 2.9778


©2024 climateprediction.net