climateprediction.net home page
Task 10889761

Task 10889761

Name hadam3p_n8ag_1975_2_006159234_3
Workunit 6425291
Created 7 Mar 2010, 17:53:37 UTC
Sent 4 Jun 2010, 1:02:36 UTC
Report deadline 17 May 2011, 6:22:36 UTC
Received 8 Jun 2010, 1:03:00 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1054113
Run time 3 days 10 hours 22 min 23 sec
CPU time 3 days 10 hours 22 min 23 sec
Validate state Invalid
Credit 1,829.52
Device peak FLOPS 2.69 GFLOPS
Application version UK Met Office HADAM3P v6.14
i686-pc-linux-gnu
Stderr
<core_client_version>6.4.5</core_client_version>
<![CDATA[
<message>
process exited with code 22 (0x16, -234)
</message>
<stderr_txt>
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=15292, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=15292, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=15292, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=15292, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=15292, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=15292, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=15292, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=15292, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=15292, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=15292, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=15292, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=15292, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=15292, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=15292, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=15292, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=15292, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=15292, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=15292, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=15292, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=15292, iMonCtr=1
Model crash detected, will try to restart...
 (15292): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
 (15292): No heartbeat from core client for 30 sec - exiting
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=13587, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=13587, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=13587, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=13587, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=13587, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=13587, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=13587, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=13587, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=13587, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=13587, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=13587, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=13587, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=13587, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=13587, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=13587, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=13587, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=13587, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=13587, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
 (13587): called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
07 Jun 2010 18:21:07 1054113 10889761 hadam3p_n8ag_1975_2_006159234_3 63,360 286,302 4.5187
07 Jun 2010 12:30:32 1054113 10889761 hadam3p_n8ag_1975_2_006159234_3 60,480 268,084 4.4326
07 Jun 2010 08:20:46 1054113 10889761 hadam3p_n8ag_1975_2_006159234_3 57,600 254,056 4.4107
07 Jun 2010 04:16:43 1054113 10889761 hadam3p_n8ag_1975_2_006159234_3 54,720 240,003 4.3860
07 Jun 2010 00:30:46 1054113 10889761 hadam3p_n8ag_1975_2_006159234_3 51,840 226,831 4.3756
06 Jun 2010 20:48:01 1054113 10889761 hadam3p_n8ag_1975_2_006159234_3 48,960 214,075 4.3724
06 Jun 2010 17:10:59 1054113 10889761 hadam3p_n8ag_1975_2_006159234_3 46,080 201,460 4.3720
06 Jun 2010 13:32:55 1054113 10889761 hadam3p_n8ag_1975_2_006159234_3 43,200 188,861 4.3718
06 Jun 2010 09:56:09 1054113 10889761 hadam3p_n8ag_1975_2_006159234_3 40,320 176,366 4.3742
06 Jun 2010 06:19:02 1054113 10889761 hadam3p_n8ag_1975_2_006159234_3 37,440 163,748 4.3736
06 Jun 2010 03:05:47 1054113 10889761 hadam3p_n8ag_1975_2_006159234_3 34,560 151,212 4.3753
05 Jun 2010 23:05:47 1054113 10889761 hadam3p_n8ag_1975_2_006159234_3 31,680 138,687 4.3777
05 Jun 2010 19:30:15 1054113 10889761 hadam3p_n8ag_1975_2_006159234_3 28,800 125,980 4.3743
05 Jun 2010 15:51:14 1054113 10889761 hadam3p_n8ag_1975_2_006159234_3 25,920 113,315 4.3717
05 Jun 2010 12:13:40 1054113 10889761 hadam3p_n8ag_1975_2_006159234_3 23,040 100,718 4.3714
05 Jun 2010 08:37:44 1054113 10889761 hadam3p_n8ag_1975_2_006159234_3 20,160 88,017 4.3659
05 Jun 2010 05:00:15 1054113 10889761 hadam3p_n8ag_1975_2_006159234_3 17,280 75,400 4.3634
05 Jun 2010 01:25:58 1054113 10889761 hadam3p_n8ag_1975_2_006159234_3 14,400 62,823 4.3627
04 Jun 2010 21:45:26 1054113 10889761 hadam3p_n8ag_1975_2_006159234_3 11,520 50,129 4.3515
04 Jun 2010 18:10:41 1054113 10889761 hadam3p_n8ag_1975_2_006159234_3 8,640 37,554 4.3465


©2024 climateprediction.net