climateprediction.net home page
Task 13323617

Task 13323617

Name hadam3p_saf_2cpd_1981_1_007434514_1
Workunit 7632017
Created 1 Sep 2011, 16:51:11 UTC
Sent 1 Sep 2011, 16:51:29 UTC
Report deadline 13 Aug 2012, 22:11:29 UTC
Received 25 Oct 2011, 18:21:36 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 0 (0x00000000)
Computer ID 1120776
Run time 6 days 6 hours 58 min 3 sec
CPU time 3 days 10 hours 38 min 55 sec
Validate state Invalid
Credit 1,870.33
Device peak FLOPS 3.19 GFLOPS
Application version UK Met Office HadAM3P-HadRM3P Southern Africa v6.09
windows_intelx86
Stderr
<core_client_version>6.10.58</core_client_version>
<![CDATA[
<stderr_txt>
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3440, iMonCtr=2
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1392, selfPID=5088, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3908, selfPID=1856, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3264, selfPID=5504, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Suspended CPDN Monitor - Suspend request from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4212, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4204, selfPID=5504, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5928, selfPID=4280, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4140, selfPID=5632, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4976, selfPID=5744, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2072, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1208, selfPID=5720, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1288, selfPID=5620, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4916, selfPID=5144, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3608, selfPID=5608, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1708, selfPID=5316, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1976, selfPID=5744, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3668, selfPID=5144, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1656, selfPID=5512, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2992, selfPID=5404, iMonCtr=1
Model crash detected, will try to restart...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5408, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5432, selfPID=5732, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6084, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5012, selfPID=1412, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3656, selfPID=5356, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5708, selfPID=5928, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4860, selfPID=5648, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3668, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3700, selfPID=4620, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3400, selfPID=5544, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Suspended CPDN Monitor - Suspend request from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5944, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4056, selfPID=5760, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=348, selfPID=5860, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=964, selfPID=3240, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4348, selfPID=5540, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Suspended CPDN Monitor - Suspend request from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5880, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5932, selfPID=5692, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5796, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5572, iMonCtr=2
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Suspended CPDN Monitor - Suspend request from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4292, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4084, selfPID=1200, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
forrtl: severe (24): end-of-file during read, unit 9, file C:\ProgramData\BOINC\projects\climateprediction.net\hadam3p_saf_2cpd_1981_1_007434514\tmp\xaakg.namelists

Image              PC        Routine            Line        Source             
hadrm3p_saf_um_6.  0113C52A  Unknown               Unknown  Unknown
hadrm3p_saf_um_6.  010E4460  Unknown               Unknown  Unknown
hadrm3p_saf_um_6.  010E362A  Unknown               Unknown  Unknown
hadrm3p_saf_um_6.  010C2469  Unknown               Unknown  Unknown
hadrm3p_saf_um_6.  00FC66EB  Unknown               Unknown  Unknown
hadrm3p_saf_um_6.  01062AE2  Unknown               Unknown  Unknown
hadrm3p_saf_um_6.  010635AF  Unknown               Unknown  Unknown
hadrm3p_saf_um_6.  00E09860  Unknown               Unknown  Unknown
hadrm3p_saf_um_6.  01120893  Unknown               Unknown  Unknown
kernel32.dll       7586ED6C  Unknown               Unknown  Unknown
ntdll.dll          771137F5  Unknown               Unknown  Unknown
ntdll.dll          771137C8  Unknown               Unknown  Unknown
forrtl: severe (24): end-of-file during read, unit 9, file C:\ProgramData\BOINC\projects\climateprediction.net\hadam3p_saf_2cpd_1981_1_007434514\tmp\xaakm.namelists

Image              PC        Routine            Line        Source             
hadam3p_saf_um_6.  00E9A39A  Unknown               Unknown  Unknown
hadam3p_saf_um_6.  00E42CD0  Unknown               Unknown  Unknown
hadam3p_saf_um_6.  00E41E9A  Unknown               Unknown  Unknown
hadam3p_saf_um_6.  00E22819  Unknown               Unknown  Unknown
hadam3p_saf_um_6.  00D22287  Unknown               Unknown  Unknown
hadam3p_saf_um_6.  00DBE7B2  Unknown               Unknown  Unknown
hadam3p_saf_um_6.  00DBF2DA  Unknown               Unknown  Unknown
hadam3p_saf_um_6.  00B39BD2  Unknown               Unknown  Unknown
hadam3p_saf_um_6.  00E7E638  Unknown               Unknown  Unknown
kernel32.dll       7586ED6C  Unknown               Unknown  Unknown
ntdll.dll          771137F5  Unknown               Unknown  Unknown
ntdll.dll          771137C8  Unknown               Unknown  Unknown
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3080, selfPID=5464, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Called boinc_finish

</stderr_txt>
<message>
<file_xfer_error>
  <file_name>hadam3p_saf_2cpd_1981_1_007434514_1_11.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_saf_2cpd_1981_1_007434514_1_12.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>

</message>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
31 Oct 2011 14:55:45 1120776 13323617 hadam3p_saf_2cpd_1981_1_007434514_1 115,296 272,298 2.3617
31 Oct 2011 14:55:45 1120776 13323617 hadam3p_saf_2cpd_1981_1_007434514_1 103,776 245,018 2.3610
21 Sep 2011 22:46:19 1120776 13323617 hadam3p_saf_2cpd_1981_1_007434514_1 92,256 221,831 2.4045
20 Sep 2011 19:16:21 1120776 13323617 hadam3p_saf_2cpd_1981_1_007434514_1 80,736 195,067 2.4161
16 Sep 2011 22:44:55 1120776 13323617 hadam3p_saf_2cpd_1981_1_007434514_1 69,216 167,920 2.4260
15 Sep 2011 21:57:57 1120776 13323617 hadam3p_saf_2cpd_1981_1_007434514_1 57,696 140,456 2.4344
13 Sep 2011 17:42:53 1120776 13323617 hadam3p_saf_2cpd_1981_1_007434514_1 46,176 113,078 2.4488
09 Sep 2011 20:31:01 1120776 13323617 hadam3p_saf_2cpd_1981_1_007434514_1 34,656 85,297 2.4612
07 Sep 2011 22:03:01 1120776 13323617 hadam3p_saf_2cpd_1981_1_007434514_1 23,136 56,966 2.4622
06 Sep 2011 18:01:04 1120776 13323617 hadam3p_saf_2cpd_1981_1_007434514_1 11,624 28,967 2.4920
02 Sep 2011 20:55:27 1120776 13323617 hadam3p_saf_2cpd_1981_1_007434514_1 11,616 28,555 2.4582


©2024 climateprediction.net