climateprediction.net home page
Task 13093419

Task 13093419

Name hadcm3n_y973_1900_40_007344777_0
Workunit 7542207
Created 6 Jul 2011, 13:26:44 UTC
Sent 22 Jul 2011, 13:11:28 UTC
Report deadline 21 Oct 2011, 20:38:39 UTC
Received 7 Aug 2011, 18:27:30 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 0 (0x00000000)
Computer ID 852771
Run time 9 days 3 hours 34 min 52 sec
CPU time 5 days 22 hours 12 min 13 sec
Validate state Invalid
Credit 3,110.40
Device peak FLOPS 2.07 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>6.10.18</core_client_version>
<![CDATA[
<stderr_txt>
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3680, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3740, iMonCtr=1
Model crash detected, will try to restart...
16:46:48 (3656): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
16:46:50 (3656): No heartbeat from core client for 30 sec - exiting
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2372, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2000, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3136, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3716, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3352, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2372, iMonCtr=1
Model crash detected, will try to restart...
17:14:52 (1592): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1508, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3952, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3952, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3952, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3952, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3952, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3952, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
Leaving CPDN_Main::Monitor...
Called boinc_finish

</stderr_txt>
<message>
<file_xfer_error>
  <file_name>hadcm3n_y973_1900_40_007344777_0_2.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadcm3n_y973_1900_40_007344777_0_3.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadcm3n_y973_1900_40_007344777_0_4.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>

</message>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
06 Aug 2011 08:24:58 852771 13093419 hadcm3n_y973_1900_40_007344777_0 259,200 477,923 1.8438
04 Aug 2011 11:37:02 852771 13093419 hadcm3n_y973_1900_40_007344777_0 233,280 429,627 1.8417
03 Aug 2011 09:34:21 852771 13093419 hadcm3n_y973_1900_40_007344777_0 207,360 381,199 1.8383
01 Aug 2011 16:55:40 852771 13093419 hadcm3n_y973_1900_40_007344777_0 181,440 333,240 1.8366
31 Jul 2011 10:24:24 852771 13093419 hadcm3n_y973_1900_40_007344777_0 155,520 283,919 1.8256
28 Jul 2011 11:12:07 852771 13093419 hadcm3n_y973_1900_40_007344777_0 129,600 236,761 1.8269
26 Jul 2011 20:12:02 852771 13093419 hadcm3n_y973_1900_40_007344777_0 103,680 185,786 1.7919
25 Jul 2011 23:10:47 852771 13093419 hadcm3n_y973_1900_40_007344777_0 77,760 138,778 1.7847
25 Jul 2011 22:23:22 852771 13093419 hadcm3n_y973_1900_40_007344777_0 51,840 92,714 1.7885
25 Jul 2011 21:01:26 852771 13093419 hadcm3n_y973_1900_40_007344777_0 25,920 46,781 1.8048


©2024 climateprediction.net