climateprediction.net home page
Task 15777014

Task 15777014

Name hadcm3n_n2sm_1920_40_008366634_0
Workunit 8517493
Created 11 May 2013, 6:16:20 UTC
Sent 11 May 2013, 6:21:24 UTC
Report deadline 10 Aug 2013, 13:48:35 UTC
Received 3 Sep 2013, 16:13:09 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1291528
Run time 2 days 7 hours 53 min 39 sec
CPU time 2 days 5 hours 34 min 17 sec
Validate state Invalid
Credit 933.12
Device peak FLOPS 2.00 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
i686-pc-linux-gnu
Stderr
<core_client_version>7.0.27</core_client_version>
<![CDATA[
<message>
process exited with code 22 (0x16, -234)
</message>
<stderr_txt>
00:12:18 (14049): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
05:55:34 (14894): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
05:59:35 (15079): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
05:59:36 (15079): No heartbeat from core client for 30 sec - exiting
05:59:37 (15079): No heartbeat from core client for 30 sec - exiting
06:03:30 (15198): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
06:03:31 (15198): No heartbeat from core client for 30 sec - exiting
06:03:32 (15198): No heartbeat from core client for 30 sec - exiting
06:07:42 (15311): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
18:06:18 (15384): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
18:09:58 (16020): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
18:09:59 (16020): No heartbeat from core client for 30 sec - exiting
18:10:00 (16020): No heartbeat from core client for 30 sec - exiting
18:10:01 (16020): No heartbeat from core client for 30 sec - exiting
18:10:02 (16020): No heartbeat from core client for 30 sec - exiting
18:10:03 (16020): No heartbeat from core client for 30 sec - exiting
18:13:38 (16096): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
18:13:39 (16096): No heartbeat from core client for 30 sec - exiting
18:13:40 (16096): No heartbeat from core client for 30 sec - exiting
18:13:41 (16096): No heartbeat from core client for 30 sec - exiting
18:13:42 (16096): No heartbeat from core client for 30 sec - exiting
18:13:43 (16096): No heartbeat from core client for 30 sec - exiting
18:13:44 (16096): No heartbeat from core client for 30 sec - exiting
18:13:45 (16096): No heartbeat from core client for 30 sec - exiting
18:17:24 (16176): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
18:17:25 (16176): No heartbeat from core client for 30 sec - exiting
18:17:26 (16176): No heartbeat from core client for 30 sec - exiting
18:17:27 (16176): No heartbeat from core client for 30 sec - exiting
18:17:28 (16176): No heartbeat from core client for 30 sec - exiting
18:17:29 (16176): No heartbeat from core client for 30 sec - exiting
18:17:30 (16176): No heartbeat from core client for 30 sec - exiting
18:17:31 (16176): No heartbeat from core client for 30 sec - exiting
18:17:32 (16176): No heartbeat from core client for 30 sec - exiting
18:17:33 (16176): No heartbeat from core client for 30 sec - exiting
18:17:34 (16176): No heartbeat from core client for 30 sec - exiting
18:17:35 (16176): No heartbeat from core client for 30 sec - exiting
18:17:36 (16176): No heartbeat from core client for 30 sec - exiting
18:17:37 (16176): No heartbeat from core client for 30 sec - exiting
18:17:38 (16176): No heartbeat from core client for 30 sec - exiting
18:17:39 (16176): No heartbeat from core client for 30 sec - exiting
18:17:40 (16176): No heartbeat from core client for 30 sec - exiting
18:17:41 (16176): No heartbeat from core client for 30 sec - exiting
18:17:42 (16176): No heartbeat from core client for 30 sec - exiting
18:21:14 (16259): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
18:21:15 (16259): No heartbeat from core client for 30 sec - exiting
18:21:16 (16259): No heartbeat from core client for 30 sec - exiting
18:21:17 (16259): No heartbeat from core client for 30 sec - exiting
18:21:18 (16259): No heartbeat from core client for 30 sec - exiting
18:21:19 (16259): No heartbeat from core client for 30 sec - exiting
18:21:20 (16259): No heartbeat from core client for 30 sec - exiting
18:21:21 (16259): No heartbeat from core client for 30 sec - exiting
18:21:22 (16259): No heartbeat from core client for 30 sec - exiting
18:21:23 (16259): No heartbeat from core client for 30 sec - exiting
18:21:24 (16259): No heartbeat from core client for 30 sec - exiting
18:21:25 (16259): No heartbeat from core client for 30 sec - exiting
18:21:26 (16259): No heartbeat from core client for 30 sec - exiting
18:25:12 (16339): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
18:25:13 (16339): No heartbeat from core client for 30 sec - exiting
18:29:06 (16419): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
18:29:07 (16419): No heartbeat from core client for 30 sec - exiting
18:29:08 (16419): No heartbeat from core client for 30 sec - exiting
18:29:09 (16419): No heartbeat from core client for 30 sec - exiting
18:29:10 (16419): No heartbeat from core client for 30 sec - exiting
18:32:55 (16503): No heartbeat from core client for 30 sec - exiting
18:32:56 (16503): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
18:36:46 (16593): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
18:36:47 (16593): No heartbeat from core client for 30 sec - exiting
18:36:48 (16593): No heartbeat from core client for 30 sec - exiting
18:36:49 (16593): No heartbeat from core client for 30 sec - exiting
18:36:50 (16593): No heartbeat from core client for 30 sec - exiting
18:36:51 (16593): No heartbeat from core client for 30 sec - exiting
22:19:56 (16669): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
22:19:57 (16669): No heartbeat from core client for 30 sec - exiting
22:19:58 (16669): No heartbeat from core client for 30 sec - exiting
22:19:59 (16669): No heartbeat from core client for 30 sec - exiting
22:20:00 (16669): No heartbeat from core client for 30 sec - exiting
22:23:21 (16846): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
22:23:22 (16846): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - Quit request from BOINC...
15:48:56 (3918): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
16:08:34 (4318): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
16:08:35 (4318): No heartbeat from core client for 30 sec - exiting
16:08:36 (4318): No heartbeat from core client for 30 sec - exiting
16:08:37 (4318): No heartbeat from core client for 30 sec - exiting
16:23:11 (4525): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
16:23:12 (4525): No heartbeat from core client for 30 sec - exiting
16:23:13 (4525): No heartbeat from core client for 30 sec - exiting
16:23:14 (4525): No heartbeat from core client for 30 sec - exiting
16:39:50 (4849): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
16:39:51 (4849): No heartbeat from core client for 30 sec - exiting
16:42:10 (4953): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
16:42:11 (4953): No heartbeat from core client for 30 sec - exiting
16:42:12 (4953): No heartbeat from core client for 30 sec - exiting
16:42:13 (4953): No heartbeat from core client for 30 sec - exiting
16:42:14 (4953): No heartbeat from core client for 30 sec - exiting
16:42:15 (4953): No heartbeat from core client for 30 sec - exiting
16:42:16 (4953): No heartbeat from core client for 30 sec - exiting
16:42:17 (4953): No heartbeat from core client for 30 sec - exiting
16:42:18 (4953): No heartbeat from core client for 30 sec - exiting
16:42:19 (4953): No heartbeat from core client for 30 sec - exiting
16:42:20 (4953): No heartbeat from core client for 30 sec - exiting
16:42:21 (4953): No heartbeat from core client for 30 sec - exiting
16:42:22 (4953): No heartbeat from core client for 30 sec - exiting
16:42:23 (4953): No heartbeat from core client for 30 sec - exiting
16:42:24 (4953): No heartbeat from core client for 30 sec - exiting
16:42:25 (4953): No heartbeat from core client for 30 sec - exiting
16:42:26 (4953): No heartbeat from core client for 30 sec - exiting
16:44:56 (5063): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
16:44:57 (5063): No heartbeat from core client for 30 sec - exiting
16:50:59 (5134): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
16:58:08 (5248): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
16:58:09 (5248): No heartbeat from core client for 30 sec - exiting
16:58:10 (5248): No heartbeat from core client for 30 sec - exiting
16:58:11 (5248): No heartbeat from core client for 30 sec - exiting
16:58:12 (5248): No heartbeat from core client for 30 sec - exiting
16:58:13 (5248): No heartbeat from core client for 30 sec - exiting
16:58:14 (5248): No heartbeat from core client for 30 sec - exiting
16:58:15 (5248): No heartbeat from core client for 30 sec - exiting
16:58:16 (5248): No heartbeat from core client for 30 sec - exiting
17:00:46 (5341): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
17:00:47 (5341): No heartbeat from core client for 30 sec - exiting
17:05:19 (5642): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
17:05:20 (5642): No heartbeat from core client for 30 sec - exiting
17:05:21 (5642): No heartbeat from core client for 30 sec - exiting
17:05:22 (5642): No heartbeat from core client for 30 sec - exiting
SIGABRT: abort called
Stack trace (9 frames):
/var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f]
[0xf77a6400]
[0xf77a6430]
/lib/i386-linux-gnu/libc.so.6(gsignal+0x4f)[0xf75c31df]
/lib/i386-linux-gnu/libc.so.6(abort+0x175)[0xf75c6825]
/var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3]
/var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395]
/var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8]
/lib/i386-linux-gnu/libc.so.6(__libc_start_main+0xf3)[0xf75ae4d3]

Exiting...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5705, iMonCtr=1
Model crash detected, will try to restart...
SIGABRT: abort called
Stack trace (9 frames):
/var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f]
[0xf7790400]
[0xf7790430]
/lib/i386-linux-gnu/libc.so.6(gsignal+0x4f)[0xf75ad1df]
/lib/i386-linux-gnu/libc.so.6(abort+0x175)[0xf75b0825]
/var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3]
/var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395]
/var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8]
/lib/i386-linux-gnu/libc.so.6(__libc_start_main+0xf3)[0xf75984d3]

Exiting...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5705, iMonCtr=1
Model crash detected, will try to restart...
SIGABRT: abort called
Stack trace (9 frames):
/var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f]
[0xf76e8400]
[0xf76e8430]
/lib/i386-linux-gnu/libc.so.6(gsignal+0x4f)[0xf75051df]
/lib/i386-linux-gnu/libc.so.6(abort+0x175)[0xf7508825]
/var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3]
/var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395]
/var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8]
/lib/i386-linux-gnu/libc.so.6(__libc_start_main+0xf3)[0xf74f04d3]

Exiting...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5705, iMonCtr=1
Model crash detected, will try to restart...
SIGABRT: abort called
Stack trace (9 frames):
/var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f]
[0xf7757400]
[0xf7757430]
/lib/i386-linux-gnu/libc.so.6(gsignal+0x4f)[0xf75741df]
/lib/i386-linux-gnu/libc.so.6(abort+0x175)[0xf7577825]
/var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3]
/var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395]
/var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8]
/lib/i386-linux-gnu/libc.so.6(__libc_start_main+0xf3)[0xf755f4d3]

Exiting...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5705, iMonCtr=1
Model crash detected, will try to restart...
SIGABRT: abort called
Stack trace (9 frames):
/var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f]
[0xf7761400]
[0xf7761430]
/lib/i386-linux-gnu/libc.so.6(gsignal+0x4f)[0xf757e1df]
/lib/i386-linux-gnu/libc.so.6(abort+0x175)[0xf7581825]
/var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3]
/var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395]
/var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8]
/lib/i386-linux-gnu/libc.so.6(__libc_start_main+0xf3)[0xf75694d3]

Exiting...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5705, iMonCtr=1
Model crash detected, will try to restart...
SIGABRT: abort called
Stack trace (9 frames):
/var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f]
[0xf77af400]
[0xf77af430]
/lib/i386-linux-gnu/libc.so.6(gsignal+0x4f)[0xf75cc1df]
/lib/i386-linux-gnu/libc.so.6(abort+0x175)[0xf75cf825]
/var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3]
/var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395]
/var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8]
/lib/i386-linux-gnu/libc.so.6(__libc_start_main+0xf3)[0xf75b74d3]

Exiting...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5705, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
03 Sep 2013 16:13:34 1291528 15777014 hadcm3n_n2sm_1920_40_008366634_0 77,760 192,419 2.4745
12 May 2013 23:49:11 1281428 15777014 hadcm3n_n2sm_1920_40_008366634_0 51,840 135,167 2.6074
12 May 2013 06:45:22 1281428 15777014 hadcm3n_n2sm_1920_40_008366634_0 25,920 77,375 2.9851


©2024 climateprediction.net