Name | hadcm3n_4lqa_1980_40_008389933_0 |
Workunit | 8540792 |
Created | 4 Jun 2013, 1:59:00 UTC |
Sent | 4 Jun 2013, 23:42:28 UTC |
Report deadline | 4 Sep 2013, 7:09:39 UTC |
Received | 6 Jun 2013, 10:27:48 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 22 (0x00000016) Unknown error code |
Computer ID | 1282401 |
Run time | 1 days 6 hours 19 min 53 sec |
CPU time | 1 days 5 hours 28 min 12 sec |
Validate state | Invalid |
Credit | 311.04 |
Device peak FLOPS | 1.99 GFLOPS |
Application version | UK Met Office Coupled Model Full Resolution Ocean v6.07 i686-pc-linux-gnu |
Stderr | <core_client_version>7.0.27</core_client_version> <![CDATA[ <message> process exited with code 22 (0x16, -234) </message> <stderr_txt> 01:46:39 (8512): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 01:50:06 (9129): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Atmos Hold Restart file rename failed on atmos_restart.hold 03:34:08 (9255): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 03:37:51 (10201): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 05:26:07 (10314): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 05:30:01 (11298): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 05:37:50 (11401): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 05:46:12 (11553): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 05:46:13 (11553): No heartbeat from core client for 30 sec - exiting 05:46:14 (11553): No heartbeat from core client for 30 sec - exiting 09:24:26 (11730): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 09:28:25 (13891): No heartbeat from core client for 30 sec - exiting 09:28:26 (13891): No heartbeat from core client for 30 sec - exiting 09:28:27 (13891): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 09:32:13 (14016): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 09:32:14 (14016): No heartbeat from core client for 30 sec - exiting 11:28:44 (14125): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 11:32:30 (15168): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 11:32:31 (15168): No heartbeat from core client for 30 sec - exiting 11:32:32 (15168): No heartbeat from core client for 30 sec - exiting 11:32:33 (15168): No heartbeat from core client for 30 sec - exiting 11:32:34 (15168): No heartbeat from core client for 30 sec - exiting 11:32:35 (15168): No heartbeat from core client for 30 sec - exiting 11:32:36 (15168): No heartbeat from core client for 30 sec - exiting 11:32:37 (15168): No heartbeat from core client for 30 sec - exiting 11:32:38 (15168): No heartbeat from core client for 30 sec - exiting 11:32:39 (15168): No heartbeat from core client for 30 sec - exiting 14:35:29 (15286): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 14:35:30 (15286): No heartbeat from core client for 30 sec - exiting 14:35:31 (15286): No heartbeat from core client for 30 sec - exiting 15:30:11 (16894): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 15:34:28 (17461): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 15:34:29 (17461): No heartbeat from core client for 30 sec - exiting 15:34:30 (17461): No heartbeat from core client for 30 sec - exiting 15:34:31 (17461): No heartbeat from core client for 30 sec - exiting 15:34:32 (17461): No heartbeat from core client for 30 sec - exiting 15:34:33 (17461): No heartbeat from core client for 30 sec - exiting 15:34:34 (17461): No heartbeat from core client for 30 sec - exiting 15:34:35 (17461): No heartbeat from core client for 30 sec - exiting 15:34:36 (17461): No heartbeat from core client for 30 sec - exiting 15:34:37 (17461): No heartbeat from core client for 30 sec - exiting 15:34:38 (17461): No heartbeat from core client for 30 sec - exiting 15:34:39 (17461): No heartbeat from core client for 30 sec - exiting 15:34:40 (17461): No heartbeat from core client for 30 sec - exiting 15:34:41 (17461): No heartbeat from core client for 30 sec - exiting 15:38:38 (17574): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 15:38:39 (17574): No heartbeat from core client for 30 sec - exiting 15:38:40 (17574): No heartbeat from core client for 30 sec - exiting 15:38:41 (17574): No heartbeat from core client for 30 sec - exiting Atmos Hold Restart file rename failed on atmos_restart.hold 17:16:13 (17729): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 17:24:24 (18636): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 17:28:43 (18822): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 17:28:44 (18822): No heartbeat from core client for 30 sec - exiting Atmos Hold Restart file rename failed on atmos_restart.hold 17:33:22 (18968): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 19:32:47 (19084): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 19:32:48 (19084): No heartbeat from core client for 30 sec - exiting 19:32:49 (19084): No heartbeat from core client for 30 sec - exiting 19:32:50 (19084): No heartbeat from core client for 30 sec - exiting 19:32:51 (19084): No heartbeat from core client for 30 sec - exiting 19:32:52 (19084): No heartbeat from core client for 30 sec - exiting 19:32:53 (19084): No heartbeat from core client for 30 sec - exiting 19:32:54 (19084): No heartbeat from core client for 30 sec - exiting 19:32:55 (19084): No heartbeat from core client for 30 sec - exiting 20:28:18 (20177): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 20:32:34 (20740): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 20:33:06 (20740): No heartbeat from core client for 30 sec - exiting 20:37:33 (20890): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 20:37:34 (20890): No heartbeat from core client for 30 sec - exiting 20:37:35 (20890): No heartbeat from core client for 30 sec - exiting 20:37:36 (20890): No heartbeat from core client for 30 sec - exiting 20:37:37 (20890): No heartbeat from core client for 30 sec - exiting 20:37:38 (20890): No heartbeat from core client for 30 sec - exiting 20:41:56 (21054): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 22:04:19 (21207): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 22:04:20 (21207): No heartbeat from core client for 30 sec - exiting 22:24:59 (21961): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 22:28:42 (22229): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 22:28:43 (22229): No heartbeat from core client for 30 sec - exiting 22:28:44 (22229): No heartbeat from core client for 30 sec - exiting 22:28:45 (22229): No heartbeat from core client for 30 sec - exiting 22:28:46 (22229): No heartbeat from core client for 30 sec - exiting 22:28:47 (22229): No heartbeat from core client for 30 sec - exiting Atmos Hold Restart file rename failed on atmos_restart.hold 22:32:27 (22321): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 23:36:08 (22445): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 23:40:27 (23058): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 23:45:12 (23168): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 00:44:34 (23310): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 03:26:30 (23905): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 03:30:05 (25310): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 09:25:50 (25431): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 09:30:06 (28736): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 10:42:57 (28848): No heartbeat from core client for 30 sec - exiting 10:42:58 (28848): No heartbeat from core client for 30 sec - exiting 10:42:59 (28848): No heartbeat from core client for 30 sec - exiting 10:43:00 (28848): No heartbeat from core client for 30 sec - exiting 10:43:01 (28848): No heartbeat from core client for 30 sec - exiting 10:43:02 (28848): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 10:47:16 (29569): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 10:51:15 (29685): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 10:55:34 (29812): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 10:55:35 (29812): No heartbeat from core client for 30 sec - exiting 10:55:36 (29812): No heartbeat from core client for 30 sec - exiting 10:55:37 (29812): No heartbeat from core client for 30 sec - exiting 10:55:38 (29812): No heartbeat from core client for 30 sec - exiting SIGABRT: abort called Stack trace (9 frames): /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f] [0xf77bf400] [0xf77bf425] /lib/i386-linux-gnu/libc.so.6(gsignal+0x4f)[0xf75dc1df] /lib/i386-linux-gnu/libc.so.6(abort+0x175)[0xf75df825] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8] /lib/i386-linux-gnu/libc.so.6(__libc_start_main+0xf3)[0xf75c74d3] Exiting... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=29950, iMonCtr=1 Model crash detected, will try to restart... SIGABRT: abort called Stack trace (9 frames): /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f] [0xf774b400] [0xf774b425] /lib/i386-linux-gnu/libc.so.6(gsignal+0x4f)[0xf75681df] /lib/i386-linux-gnu/libc.so.6(abort+0x175)[0xf756b825] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8] /lib/i386-linux-gnu/libc.so.6(__libc_start_main+0xf3)[0xf75534d3] Exiting... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=29950, iMonCtr=1 Model crash detected, will try to restart... SIGABRT: abort called Stack trace (9 frames): /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f] [0xf776e400] [0xf776e425] /lib/i386-linux-gnu/libc.so.6(gsignal+0x4f)[0xf758b1df] /lib/i386-linux-gnu/libc.so.6(abort+0x175)[0xf758e825] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8] /lib/i386-linux-gnu/libc.so.6(__libc_start_main+0xf3)[0xf75764d3] Exiting... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=29950, iMonCtr=1 Model crash detected, will try to restart... SIGABRT: abort called Stack trace (9 frames): /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f] [0xf77ad400] [0xf77ad425] /lib/i386-linux-gnu/libc.so.6(gsignal+0x4f)[0xf75ca1df] /lib/i386-linux-gnu/libc.so.6(abort+0x175)[0xf75cd825] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8] /lib/i386-linux-gnu/libc.so.6(__libc_start_main+0xf3)[0xf75b54d3] Exiting... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=29950, iMonCtr=1 Model crash detected, will try to restart... SIGABRT: abort called Stack trace (9 frames): /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f] [0xf778f400] [0xf778f425] /lib/i386-linux-gnu/libc.so.6(gsignal+0x4f)[0xf75ac1df] /lib/i386-linux-gnu/libc.so.6(abort+0x175)[0xf75af825] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8] /lib/i386-linux-gnu/libc.so.6(__libc_start_main+0xf3)[0xf75974d3] Exiting... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=29950, iMonCtr=1 Model crash detected, will try to restart... SIGABRT: abort called Stack trace (9 frames): /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f] [0xf77bb400] [0xf77bb425] /lib/i386-linux-gnu/libc.so.6(gsignal+0x4f)[0xf75d81df] /lib/i386-linux-gnu/libc.so.6(abort+0x175)[0xf75db825] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395] /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8] /lib/i386-linux-gnu/libc.so.6(__libc_start_main+0xf3)[0xf75c34d3] Exiting... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=29950, iMonCtr=1 Model crash detected, will try to restart... Sorry, too many model crashes! :-( Called boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
05 Jun 2013 19:22:39 | 1282401 | 15827188 | hadcm3n_4lqa_1980_40_008389933_0 | 25,920 | 61,109 | 2.3576 |
©2024 climateprediction.net