climateprediction.net home page
Task 15450006

Task 15450006

Name hadcm3n_zjms_1880_40_008249732_1
Workunit 8404856
Created 22 Nov 2012, 0:58:52 UTC
Sent 22 Nov 2012, 0:58:57 UTC
Report deadline 21 Feb 2013, 8:26:08 UTC
Received 15 Dec 2012, 17:58:46 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1140527
Run time 21 days 20 hours 54 min 48 sec
CPU time 17 days 9 hours 54 min 48 sec
Validate state Invalid
Credit 9,642.24
Device peak FLOPS 2.89 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
i686-pc-linux-gnu
Stderr
<core_client_version>6.10.58</core_client_version>
<![CDATA[
<message>
process exited with code 22 (0x16, -234)
</message>
<stderr_txt>
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
20:02:21 (24159): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
20:02:22 (24159): No heartbeat from core client for 30 sec - exiting
20:02:23 (24159): No heartbeat from core client for 30 sec - exiting
20:02:24 (24159): No heartbeat from core client for 30 sec - exiting
20:02:25 (24159): No heartbeat from core client for 30 sec - exiting
20:02:26 (24159): No heartbeat from core client for 30 sec - exiting
20:02:27 (24159): No heartbeat from core client for 30 sec - exiting
20:02:28 (24159): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
03:22:55 (22763): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
03:37:54 (5876): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
03:45:08 (6974): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
03:46:00 (7473): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
03:46:01 (7473): No heartbeat from core client for 30 sec - exiting
03:46:02 (7473): No heartbeat from core client for 30 sec - exiting
03:46:03 (7473): No heartbeat from core client for 30 sec - exiting
03:46:04 (7473): No heartbeat from core client for 30 sec - exiting
03:46:05 (7473): No heartbeat from core client for 30 sec - exiting
03:46:06 (7473): No heartbeat from core client for 30 sec - exiting
03:46:07 (7473): No heartbeat from core client for 30 sec - exiting
04:11:13 (7583): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
07:32:52 (9318): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
07:32:53 (9318): No heartbeat from core client for 30 sec - exiting
07:32:54 (9318): No heartbeat from core client for 30 sec - exiting
Atmos Hold Restart file rename failed on atmos_restart.hold
07:54:39 (22823): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
07:54:40 (22823): No heartbeat from core client for 30 sec - exiting
08:08:21 (24272): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
08:37:42 (25355): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
08:38:25 (27250): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
08:38:26 (27250): No heartbeat from core client for 30 sec - exiting
08:38:27 (27250): No heartbeat from core client for 30 sec - exiting
08:38:28 (27250): No heartbeat from core client for 30 sec - exiting
08:38:29 (27250): No heartbeat from core client for 30 sec - exiting
08:38:30 (27250): No heartbeat from core client for 30 sec - exiting
08:38:31 (27250): No heartbeat from core client for 30 sec - exiting
08:38:32 (27250): No heartbeat from core client for 30 sec - exiting
08:52:01 (27426): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
08:52:02 (27426): No heartbeat from core client for 30 sec - exiting
08:52:03 (27426): No heartbeat from core client for 30 sec - exiting
08:52:04 (27426): No heartbeat from core client for 30 sec - exiting
Atmos Hold Restart file rename failed on atmos_restart.hold
09:17:12 (28353): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
09:37:58 (29996): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
10:03:33 (31398): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
10:06:34 (767): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
10:13:51 (1103): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
10:13:52 (1103): No heartbeat from core client for 30 sec - exiting
10:13:53 (1103): No heartbeat from core client for 30 sec - exiting
10:57:08 (1690): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
15:59:13 (4765): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
15:59:14 (4765): No heartbeat from core client for 30 sec - exiting
SIGABRT: abort called
Stack trace (10 frames):
/igel2/Vol1/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f]
[0xf774b400]
[0xf774b430]
/lib32/libc.so.6(gsignal+0x51)[0xf75b3951]
/lib32/libc.so.6(abort+0x182)[0xf75b6d82]
/igel2/Vol1/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3]
/igel2/Vol1/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395]
/igel2/Vol1/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8]
/lib32/libc.so.6(__libc_start_main+0xe6)[0xf759fbd6]
/igel2/Vol1/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x804cb11]

Exiting...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=22258, iMonCtr=1
Model crash detected, will try to restart...
SIGABRT: abort called
Stack trace (10 frames):
/igel2/Vol1/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f]
[0xf7726400]
[0xf7726430]
/lib32/libc.so.6(gsignal+0x51)[0xf758e951]
/lib32/libc.so.6(abort+0x182)[0xf7591d82]
/igel2/Vol1/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3]
/igel2/Vol1/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395]
/igel2/Vol1/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8]
/lib32/libc.so.6(__libc_start_main+0xe6)[0xf757abd6]
/igel2/Vol1/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x804cb11]

Exiting...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=22258, iMonCtr=1
Model crash detected, will try to restart...
SIGABRT: abort called
Stack trace (10 frames):
/igel2/Vol1/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f]
[0xf772e400]
[0xf772e430]
/lib32/libc.so.6(gsignal+0x51)[0xf7596951]
/lib32/libc.so.6(abort+0x182)[0xf7599d82]
/igel2/Vol1/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3]
/igel2/Vol1/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395]
/igel2/Vol1/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8]
/lib32/libc.so.6(__libc_start_main+0xe6)[0xf7582bd6]
/igel2/Vol1/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x804cb11]

Exiting...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=22258, iMonCtr=1
Model crash detected, will try to restart...
SIGABRT: abort called
Stack trace (10 frames):
/igel2/Vol1/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f]
[0xf773f400]
[0xf773f430]
/lib32/libc.so.6(gsignal+0x51)[0xf75a7951]
/lib32/libc.so.6(abort+0x182)[0xf75aad82]
/igel2/Vol1/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3]
/igel2/Vol1/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395]
/igel2/Vol1/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8]
/lib32/libc.so.6(__libc_start_main+0xe6)[0xf7593bd6]
/igel2/Vol1/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x804cb11]

Exiting...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=22258, iMonCtr=1
Model crash detected, will try to restart...
SIGABRT: abort called
Stack trace (10 frames):
/igel2/Vol1/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f]
[0xf76f3400]
[0xf76f3430]
/lib32/libc.so.6(gsignal+0x51)[0xf755b951]
/lib32/libc.so.6(abort+0x182)[0xf755ed82]
/igel2/Vol1/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3]
/igel2/Vol1/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395]
/igel2/Vol1/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8]
/lib32/libc.so.6(__libc_start_main+0xe6)[0xf7547bd6]
/igel2/Vol1/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x804cb11]

Exiting...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=22258, iMonCtr=1
Model crash detected, will try to restart...
SIGABRT: abort called
Stack trace (10 frames):
/igel2/Vol1/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f]
[0xf7777400]
[0xf7777430]
/lib32/libc.so.6(gsignal+0x51)[0xf75df951]
/lib32/libc.so.6(abort+0x182)[0xf75e2d82]
/igel2/Vol1/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3]
/igel2/Vol1/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395]
/igel2/Vol1/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8]
/lib32/libc.so.6(__libc_start_main+0xe6)[0xf75cbbd6]
/igel2/Vol1/BOINC/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x804cb11]

Exiting...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=22258, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
15 Dec 2012 14:40:38 1140527 15450006 hadcm3n_zjms_1880_40_008249732_1 803,520 1,503,386 1.8710
15 Dec 2012 00:16:46 1140527 15450006 hadcm3n_zjms_1880_40_008249732_1 777,600 1,456,388 1.8729
14 Dec 2012 14:47:39 1140527 15450006 hadcm3n_zjms_1880_40_008249732_1 751,680 1,407,637 1.8727
14 Dec 2012 14:47:39 1140527 15450006 hadcm3n_zjms_1880_40_008249732_1 725,760 1,359,576 1.8733
14 Dec 2012 14:47:39 1140527 15450006 hadcm3n_zjms_1880_40_008249732_1 699,840 1,311,914 1.8746
14 Dec 2012 14:47:39 1140527 15450006 hadcm3n_zjms_1880_40_008249732_1 673,920 1,264,665 1.8766
14 Dec 2012 14:47:39 1140527 15450006 hadcm3n_zjms_1880_40_008249732_1 648,000 1,218,015 1.8797
14 Dec 2012 14:47:39 1140527 15450006 hadcm3n_zjms_1880_40_008249732_1 622,080 1,171,247 1.8828
14 Dec 2012 14:47:39 1140527 15450006 hadcm3n_zjms_1880_40_008249732_1 596,160 1,121,016 1.8804
14 Dec 2012 14:47:39 1140527 15450006 hadcm3n_zjms_1880_40_008249732_1 570,240 1,072,903 1.8815
14 Dec 2012 14:47:39 1140527 15450006 hadcm3n_zjms_1880_40_008249732_1 544,320 1,021,792 1.8772
14 Dec 2012 14:47:39 1140527 15450006 hadcm3n_zjms_1880_40_008249732_1 518,400 972,072 1.8751
14 Dec 2012 14:47:39 1140527 15450006 hadcm3n_zjms_1880_40_008249732_1 492,480 925,062 1.8784
08 Dec 2012 01:09:42 1140527 15450006 hadcm3n_zjms_1880_40_008249732_1 466,560 877,849 1.8815
07 Dec 2012 11:37:27 1140527 15450006 hadcm3n_zjms_1880_40_008249732_1 440,640 829,501 1.8825
06 Dec 2012 22:25:09 1140527 15450006 hadcm3n_zjms_1880_40_008249732_1 414,720 781,606 1.8847
06 Dec 2012 09:03:24 1140527 15450006 hadcm3n_zjms_1880_40_008249732_1 388,800 734,198 1.8884
05 Dec 2012 11:24:58 1140527 15450006 hadcm3n_zjms_1880_40_008249732_1 362,880 685,371 1.8887
04 Dec 2012 12:38:03 1140527 15450006 hadcm3n_zjms_1880_40_008249732_1 336,960 636,150 1.8879
03 Dec 2012 14:10:35 1140527 15450006 hadcm3n_zjms_1880_40_008249732_1 311,040 586,331 1.8851


©2024 climateprediction.net