climateprediction.net home page
Task 16774713

Task 16774713

Name hadam3p_eu_g5wz_2013_1_008855612_0
Workunit 9001541
Created 8 Jul 2014, 20:31:35 UTC
Sent 9 Jul 2014, 8:24:35 UTC
Report deadline 21 Jun 2015, 13:44:35 UTC
Received 22 Jul 2014, 14:33:36 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 0 (0x00000000)
Computer ID 1326982
Run time 1 days 1 hours 37 min 13 sec
CPU time 22 hours 16 min 48 sec
Validate state Invalid
Credit 399.11
Device peak FLOPS 1.95 GFLOPS
Application version UK Met Office HadAM3P-HadRM3P Europe v6.09
windows_intelx86
Stderr
<core_client_version>7.2.42</core_client_version>
<![CDATA[
<stderr_txt>
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=380, selfPID=5392, iMonCtr=1
Model crash detected, will try to restart...
11:17:25 (5044): No heartbeat from core client for 30 sec - exiting
11:17:26 (5044): No heartbeat from core client for 30 sec - exiting
11:17:27 (5044): No heartbeat from core client for 30 sec - exiting
11:17:29 (5044): No heartbeat from core client for 30 sec - exiting
11:17:30 (5044): No heartbeat from core client for 30 sec - exiting
11:17:31 (5044): No heartbeat from core client for 30 sec - exiting
11:17:32 (5044): No heartbeat from core client for 30 sec - exiting
11:17:33 (5044): No heartbeat from core client for 30 sec - exiting
11:17:34 (5044): No heartbeat from core client for 30 sec - exiting
11:17:35 (5044): No heartbeat from core client for 30 sec - exiting
11:17:36 (5044): No heartbeat from core client for 30 sec - exiting
11:17:37 (5044): No heartbeat from core client for 30 sec - exiting
11:17:38 (5044): No heartbeat from core client for 30 sec - exiting
11:17:39 (5044): No heartbeat from core client for 30 sec - exiting
11:17:41 (5044): No heartbeat from core client for 30 sec - exiting
11:17:42 (5044): No heartbeat from core client for 30 sec - exiting
11:17:43 (5044): No heartbeat from core client for 30 sec - exiting
11:17:44 (5044): No heartbeat from core client for 30 sec - exiting
11:17:45 (5044): No heartbeat from core client for 30 sec - exiting
11:17:46 (5044): No heartbeat from core client for 30 sec - exiting
11:17:47 (5044): No heartbeat from core client for 30 sec - exiting
11:17:48 (5044): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
15:34:45 (4988): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
17:29:06 (6044): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=7308, selfPID=7308, iMonCtr=2
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
19:50:24 (7400): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
05:36:19 (608): No heartbeat from core client for 30 sec - exiting
05:36:21 (608): No heartbeat from core client for 30 sec - exiting
05:36:22 (608): No heartbeat from core client for 30 sec - exiting
05:36:23 (608): No heartbeat from core client for 30 sec - exiting
05:36:24 (608): No heartbeat from core client for 30 sec - exiting
05:36:25 (608): No heartbeat from core client for 30 sec - exiting
05:36:26 (608): No heartbeat from core client for 30 sec - exiting
05:36:27 (608): No heartbeat from core client for 30 sec - exiting
05:36:28 (608): No heartbeat from core client for 30 sec - exiting
05:36:29 (608): No heartbeat from core client for 30 sec - exiting
05:36:30 (608): No heartbeat from core client for 30 sec - exiting
05:36:32 (608): No heartbeat from core client for 30 sec - exiting
05:36:33 (608): No heartbeat from core client for 30 sec - exiting
05:36:34 (608): No heartbeat from core client for 30 sec - exiting
05:36:35 (608): No heartbeat from core client for 30 sec - exiting
05:36:36 (608): No heartbeat from core client for 30 sec - exiting
05:36:37 (608): No heartbeat from core client for 30 sec - exiting
05:36:38 (608): No heartbeat from core client for 30 sec - exiting
05:36:39 (608): No heartbeat from core client for 30 sec - exiting
05:36:40 (608): No heartbeat from core client for 30 sec - exiting
05:36:41 (608): No heartbeat from core client for 30 sec - exiting
05:36:42 (608): No heartbeat from core client for 30 sec - exiting
05:36:44 (608): No heartbeat from core client for 30 sec - exiting
05:36:45 (608): No heartbeat from core client for 30 sec - exiting
05:36:46 (608): No heartbeat from core client for 30 sec - exiting
05:36:47 (608): No heartbeat from core client for 30 sec - exiting
05:36:48 (608): No heartbeat from core client for 30 sec - exiting
05:36:49 (608): No heartbeat from core client for 30 sec - exiting
05:36:50 (608): No heartbeat from core client for 30 sec - exiting
05:36:51 (608): No heartbeat from core client for 30 sec - exiting
05:36:52 (608): No heartbeat from core client for 30 sec - exiting
05:36:53 (608): No heartbeat from core client for 30 sec - exiting
05:36:54 (608): No heartbeat from core client for 30 sec - exiting
05:36:56 (608): No heartbeat from core client for 30 sec - exiting
05:36:57 (608): No heartbeat from core client for 30 sec - exiting
05:36:58 (608): No heartbeat from core client for 30 sec - exiting
05:36:59 (608): No heartbeat from core client for 30 sec - exiting
05:37:00 (608): No heartbeat from core client for 30 sec - exiting
05:37:01 (608): No heartbeat from core client for 30 sec - exiting
05:37:02 (608): No heartbeat from core client for 30 sec - exiting
05:37:03 (608): No heartbeat from core client for 30 sec - exiting
05:37:04 (608): No heartbeat from core client for 30 sec - exiting
05:37:05 (608): No heartbeat from core client for 30 sec - exiting
05:37:06 (608): No heartbeat from core client for 30 sec - exiting
05:37:08 (608): No heartbeat from core client for 30 sec - exiting
05:37:09 (608): No heartbeat from core client for 30 sec - exiting
05:37:10 (608): No heartbeat from core client for 30 sec - exiting
05:37:11 (608): No heartbeat from core client for 30 sec - exiting
05:37:12 (608): No heartbeat from core client for 30 sec - exiting
05:37:13 (608): No heartbeat from core client for 30 sec - exiting
05:37:14 (608): No heartbeat from core client for 30 sec - exiting
05:37:15 (608): No heartbeat from core client for 30 sec - exiting
05:37:16 (608): No heartbeat from core client for 30 sec - exiting
05:37:17 (608): No heartbeat from core client for 30 sec - exiting
05:37:18 (608): No heartbeat from core client for 30 sec - exiting
05:37:20 (608): No heartbeat from core client for 30 sec - exiting
05:37:21 (608): No heartbeat from core client for 30 sec - exiting
05:37:22 (608): No heartbeat from core client for 30 sec - exiting
05:37:23 (608): No heartbeat from core client for 30 sec - exiting
05:37:24 (608): No heartbeat from core client for 30 sec - exiting
05:37:25 (608): No heartbeat from core client for 30 sec - exiting
05:37:26 (608): No heartbeat from core client for 30 sec - exiting
05:37:27 (608): No heartbeat from core client for 30 sec - exiting
05:37:28 (608): No heartbeat from core client for 30 sec - exiting
05:37:29 (608): No heartbeat from core client for 30 sec - exiting
05:37:30 (608): No heartbeat from core client for 30 sec - exiting
05:37:32 (608): No heartbeat from core client for 30 sec - exiting
05:37:33 (608): No heartbeat from core client for 30 sec - exiting
05:37:34 (608): No heartbeat from core client for 30 sec - exiting
05:37:35 (608): No heartbeat from core client for 30 sec - exiting
05:37:36 (608): No heartbeat from core client for 30 sec - exiting
05:37:37 (608): No heartbeat from core client for 30 sec - exiting
05:37:38 (608): No heartbeat from core client for 30 sec - exiting
05:37:39 (608): No heartbeat from core client for 30 sec - exiting
05:37:40 (608): No heartbeat from core client for 30 sec - exiting
05:37:41 (608): No heartbeat from core client for 30 sec - exiting
05:37:42 (608): No heartbeat from core client for 30 sec - exiting
05:37:44 (608): No heartbeat from core client for 30 sec - exiting
05:37:45 (608): No heartbeat from core client for 30 sec - exiting
05:37:46 (608): No heartbeat from core client for 30 sec - exiting
05:37:47 (608): No heartbeat from core client for 30 sec - exiting
05:37:48 (608): No heartbeat from core client for 30 sec - exiting
05:37:49 (608): No heartbeat from core client for 30 sec - exiting
05:37:50 (608): No heartbeat from core client for 30 sec - exiting
05:37:51 (608): No heartbeat from core client for 30 sec - exiting
05:37:52 (608): No heartbeat from core client for 30 sec - exiting
05:37:53 (608): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5932, iMonCtr=2
Model crash detected, will try to restart...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5792, iMonCtr=2
Leaving CPDN_Main::Monitor...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4124, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4736, iMonCtr=2
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
09:00:56 (4696): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3640, selfPID=6084, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3920, selfPID=3920, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2860, selfPID=3372, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5028, iMonCtr=2
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadam3p_eu_g5wz_2013_1_008855612/dataout/atmos_restart.day after 11 attempts
forrtl: severe (24): end-of-file during read, unit 9, file C:\ProgramData\BOINC\projects\climateprediction.net\hadam3p_eu_g5wz_2013_1_008855612\tmp\xaakg.namelists

Image              PC        Routine            Line        Source             
hadrm3p_eu_um_6.0  0111C52A  Unknown               Unknown  Unknown
hadrm3p_eu_um_6.0  010C4460  Unknown               Unknown  Unknown
hadrm3p_eu_um_6.0  010C362A  Unknown               Unknown  Unknown
hadrm3p_eu_um_6.0  010A2469  Unknown               Unknown  Unknown
hadrm3p_eu_um_6.0  00FA66EB  Unknown               Unknown  Unknown
hadrm3p_eu_um_6.0  01042AE2  Unknown               Unknown  Unknown
hadrm3p_eu_um_6.0  010435AF  Unknown               Unknown  Unknown
hadrm3p_eu_um_6.0  00DE9860  Unknown               Unknown  Unknown
hadrm3p_eu_um_6.0  01100893  Unknown               Unknown  Unknown
kernel32.dll       7572338A  Unknown               Unknown  Unknown
ntdll.dll          77C89F72  Unknown               Unknown  Unknown
ntdll.dll          77C89F45  Unknown               Unknown  Unknown
forrtl: severe (24): end-of-file during read, unit 9, file C:\ProgramData\BOINC\projects\climateprediction.net\hadam3p_eu_g5wz_2013_1_008855612\tmp\xaakm.namelists

Image              PC        Routine            Line        Source             
hadam3p_eu_um_6.0  00D3A39A  Unknown               Unknown  Unknown
hadam3p_eu_um_6.0  00CE2CD0  Unknown               Unknown  Unknown
hadam3p_eu_um_6.0  00CE1E9A  Unknown               Unknown  Unknown
hadam3p_eu_um_6.0  00CC2819  Unknown               Unknown  Unknown
hadam3p_eu_um_6.0  00BC2287  Unknown               Unknown  Unknown
hadam3p_eu_um_6.0  00C5E7B2  Unknown               Unknown  Unknown
hadam3p_eu_um_6.0  00C5F2DA  Unknown               Unknown  Unknown
hadam3p_eu_um_6.0  009D9BD2  Unknown               Unknown  Unknown
hadam3p_eu_um_6.0  00D1E638  Unknown               Unknown  Unknown
kernel32.dll       7572338A  Unknown               Unknown  Unknown
ntdll.dll          77C89F72  Unknown               Unknown  Unknown
ntdll.dll          77C89F45  Unknown               Unknown  Unknown
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2060, selfPID=4820, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Called boinc_finish

</stderr_txt>
<message>
upload failure: <file_xfer_error>
  <file_name>hadam3p_eu_g5wz_2013_1_008855612_0_3.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_eu_g5wz_2013_1_008855612_0_4.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_eu_g5wz_2013_1_008855612_0_5.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_eu_g5wz_2013_1_008855612_0_6.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_eu_g5wz_2013_1_008855612_0_7.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_eu_g5wz_2013_1_008855612_0_8.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_eu_g5wz_2013_1_008855612_0_9.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_eu_g5wz_2013_1_008855612_0_10.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_eu_g5wz_2013_1_008855612_0_11.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_eu_g5wz_2013_1_008855612_0_12.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>

</message>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
18 Jul 2014 08:33:12 1326982 16774713 hadam3p_eu_g5wz_2013_1_008855612_0 23,136 71,250 3.0796
11 Jul 2014 20:13:12 1326982 16774713 hadam3p_eu_g5wz_2013_1_008855612_0 11,616 35,094 3.0212


©2024 climateprediction.net