climateprediction.net home page
Task 15197317

Task 15197317

Name hadam3p_eu_2tei_1962_1_008155673_2
Workunit 8310797
Created 28 Aug 2012, 19:14:28 UTC
Sent 28 Aug 2012, 19:33:07 UTC
Report deadline 11 Aug 2013, 0:53:07 UTC
Received 25 Nov 2012, 14:34:50 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 0 (0x00000000)
Computer ID 1080754
Run time 7 days 3 hours 36 min 39 sec
CPU time 4 days 6 hours 4 min 36 sec
Validate state Invalid
Credit 1,194.02
Device peak FLOPS 1.25 GFLOPS
Application version UK Met Office HadAM3P-HadRM3P Europe v6.09
windows_intelx86
Stderr
<core_client_version>6.10.18</core_client_version>
<![CDATA[
<stderr_txt>
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4560, selfPID=4560, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4204, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5520, selfPID=4924, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4876, iMonCtr=2
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5464, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
GController:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5836, iMonCtr=2
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=7732, selfPID=5492, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5560, selfPID=5280, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4956, selfPID=5224, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1080, selfPID=5668, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4356, selfPID=5296, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1264, selfPID=5016, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
GController:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5356, selfPID=5408, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
GCPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6064, selfPID=5312, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=232, selfPID=4884, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5036, selfPID=5100, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
15:43:01 (4896): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
15:43:02 (4896): No heartbeat from core client for 30 sec - exiting
15:43:03 (4896): No heartbeat from core client for 30 sec - exiting
15:43:04 (4896): No heartbeat from core client for 30 sec - exiting
13:42:47 (4844): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
13:42:48 (4844): No heartbeat from core client for 30 sec - exiting
13:42:49 (4844): No heartbeat from core client for 30 sec - exiting
13:42:50 (4844): No heartbeat from core client for 30 sec - exiting
13:42:51 (4844): No heartbeat from core client for 30 sec - exiting
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4136, selfPID=4136, iMonCtr=2
13:42:52 (4844): No heartbeat from core client for 30 sec - exiting
13:42:53 (4844): No heartbeat from core client for 30 sec - exiting
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4648, selfPID=5492, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
13:03:53 (2852): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
13:03:54 (2852): No heartbeat from core client for 30 sec - exiting
GController:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5204, selfPID=1660, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5584, selfPID=5356, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
GController:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4604, selfPID=5176, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5556, selfPID=4776, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4572, selfPID=5212, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
17:13:08 (5204): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6108, selfPID=5140, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
forrtl: severe (24): end-of-file during read, unit 9, file C:\ProgramData\BOINC\projects\climateprediction.net\hadam3p_eu_2tei_1962_1_008155673\tmp\xaakg.namelists
Image              PC        Routine            Line        Source             
hadrm3p_eu_um_6.0  003DC52A  Unknown               Unknown  Unknown
hadrm3p_eu_um_6.0  00384460  Unknown               Unknown  Unknown
hadrm3p_eu_um_6.0  0038362A  Unknown               Unknown  Unknown
hadrm3p_eu_um_6.0  00362469  Unknown               Unknown  Unknown
hadrm3p_eu_um_6.0  002666EB  Unknown               Unknown  Unknown
hadrm3p_eu_um_6.0  00302AE2  Unknown               Unknown  Unknown
hadrm3p_eu_um_6.0  003035AF  Unknown               Unknown  Unknown
hadrm3p_eu_um_6.0  000A9860  Unknown               Unknown  Unknown
hadrm3p_eu_um_6.0  003C0893  Unknown               Unknown  Unknown
KERNEL32.dll       75DBED6C  Unknown               Unknown  Unknown
ntdll.dll          770F377B  Unknown               Unknown  Unknown
ntdll.dll          770F374E  Unknown               Unknown  Unknown
rrtl: severe (24): end-of-file during read, unit 9, file C:\ProgramData\BOINC\projects\climateprediction.net\hadam3p_eu_2tei_1962_1_008155673\tmp\xaakm.namelists
Image              PC        Routine            Line        Source             
hadam3p_eu_um_6.0  016EA39A  Unknown               Unknown  Unknown
hadam3p_eu_um_6.0  01692CD0  Unknown               Unknown  Unknown
hadam3p_eu_um_6.0  01691E9A  Unknown               Unknown  Unknown
hadam3p_eu_um_6.0  01672819  Unknown               Unknown  Unknown
hadam3p_eu_um_6.0  01572287  Unknown               Unknown  Unknown
hadam3p_eu_um_6.0  0160E7B2  Unknown               Unknown  Unknown
hadam3p_eu_um_6.0  0160F2DA  Unknown               Unknown  Unknown
hadam3p_eu_um_6.0  01389BD2  Unknown               Unknown  Unknown
hadam3p_eu_um_6.0  016CE638  Unknown               Unknown  Unknown
KERNEL32.dll       75DBED6C  Unknown               Unknown  Unknown
ntdll.dll          770F377B  Unknown               Unknown  Unknown
ntdll.dll          770F374E  Unknown               Unknown  Unknown
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4224, selfPID=5064, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Called boinc_finish

</stderr_txt>
<message>
<file_xfer_error>
  <file_name>hadam3p_eu_2tei_1962_1_008155673_2_7.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_eu_2tei_1962_1_008155673_2_8.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_eu_2tei_1962_1_008155673_2_9.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_eu_2tei_1962_1_008155673_2_10.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_eu_2tei_1962_1_008155673_2_11.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_eu_2tei_1962_1_008155673_2_12.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>

</message>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
22 Nov 2012 15:23:58 1080754 15197317 hadam3p_eu_2tei_1962_1_008155673_2 69,216 323,675 4.6763
10 Nov 2012 19:39:08 1080754 15197317 hadam3p_eu_2tei_1962_1_008155673_2 57,696 269,883 4.6777
01 Nov 2012 09:41:42 1080754 15197317 hadam3p_eu_2tei_1962_1_008155673_2 46,176 216,136 4.6807
21 Oct 2012 06:34:43 1080754 15197317 hadam3p_eu_2tei_1962_1_008155673_2 34,656 162,816 4.6981
05 Oct 2012 10:22:50 1080754 15197317 hadam3p_eu_2tei_1962_1_008155673_2 23,136 108,862 4.7053
26 Sep 2012 18:29:07 1080754 15197317 hadam3p_eu_2tei_1962_1_008155673_2 11,616 55,120 4.7452


©2024 climateprediction.net