Name | hadcm3n_820w_1980_40_008460563_1 |
Workunit | 8611419 |
Created | 10 Nov 2013, 9:14:14 UTC |
Sent | 10 Nov 2013, 9:14:21 UTC |
Report deadline | 9 Feb 2014, 16:41:32 UTC |
Received | 15 Nov 2013, 12:21:15 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 22 (0x00000016) Unknown error code |
Computer ID | 1266912 |
Run time | 1 days 4 hours 55 min 8 sec |
CPU time | 1 days 2 hours 52 min 56 sec |
Validate state | Invalid |
Credit | 622.08 |
Device peak FLOPS | 2.98 GFLOPS |
Application version | UK Met Office Coupled Model Full Resolution Ocean v6.07 i686-pc-linux-gnu |
Stderr | <core_client_version>6.12.43</core_client_version> <![CDATA[ <message> process exited with code 22 (0x16, -234) </message> <stderr_txt> 16:47:20 (16695): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 16:47:21 (16695): No heartbeat from core client for 30 sec - exiting 16:47:22 (16695): No heartbeat from core client for 30 sec - exiting 16:47:23 (16695): No heartbeat from core client for 30 sec - exiting 16:47:24 (16695): No heartbeat from core client for 30 sec - exiting 16:47:25 (16695): No heartbeat from core client for 30 sec - exiting 16:47:26 (16695): No heartbeat from core client for 30 sec - exiting 16:47:27 (16695): No heartbeat from core client for 30 sec - exiting 16:47:28 (16695): No heartbeat from core client for 30 sec - exiting 16:47:29 (16695): No heartbeat from core client for 30 sec - exiting 16:47:30 (16695): No heartbeat from core client for 30 sec - exiting 16:47:31 (16695): No heartbeat from core client for 30 sec - exiting 16:47:32 (16695): No heartbeat from core client for 30 sec - exiting 16:52:28 (17078): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 16:52:29 (17078): No heartbeat from core client for 30 sec - exiting 16:52:30 (17078): No heartbeat from core client for 30 sec - exiting 17:03:22 (17154): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 17:03:23 (17154): No heartbeat from core client for 30 sec - exiting 17:14:17 (17362): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 17:14:18 (17362): No heartbeat from core client for 30 sec - exiting 17:14:19 (17362): No heartbeat from core client for 30 sec - exiting 17:14:20 (17362): No heartbeat from core client for 30 sec - exiting 17:14:21 (17362): No heartbeat from core client for 30 sec - exiting 17:14:22 (17362): No heartbeat from core client for 30 sec - exiting 17:30:01 (17544): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 18:30:29 (17825): No heartbeat from core client for 30 sec - exiting 18:30:33 (17825): No heartbeat from core client for 30 sec - exiting 18:30:34 (17825): No heartbeat from core client for 30 sec - exiting 18:30:35 (17825): No heartbeat from core client for 30 sec - exiting 18:30:36 (17825): No heartbeat from core client for 30 sec - exiting 18:30:37 (17825): No heartbeat from core client for 30 sec - exiting 18:30:38 (17825): No heartbeat from core client for 30 sec - exiting 18:30:39 (17825): No heartbeat from core client for 30 sec - exiting 18:30:40 (17825): No heartbeat from core client for 30 sec - exiting 18:30:41 (17825): No heartbeat from core client for 30 sec - exiting 18:30:42 (17825): No heartbeat from core client for 30 sec - exiting 18:30:43 (17825): No heartbeat from core client for 30 sec - exiting 18:30:44 (17825): No heartbeat from core client for 30 sec - exiting 18:30:45 (17825): No heartbeat from core client for 30 sec - exiting 18:30:46 (17825): No heartbeat from core client for 30 sec - exiting 18:30:47 (17825): No heartbeat from core client for 30 sec - exiting 18:30:48 (17825): No heartbeat from core client for 30 sec - exiting 18:30:49 (17825): No heartbeat from core client for 30 sec - exiting 18:30:50 (17825): No heartbeat from core client for 30 sec - exiting 18:30:51 (17825): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 18:33:12 (18815): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 19:11:51 (18856): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 19:11:52 (18856): No heartbeat from core client for 30 sec - exiting 19:11:53 (18856): No heartbeat from core client for 30 sec - exiting 19:33:14 (19521): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 19:33:15 (19521): No heartbeat from core client for 30 sec - exiting 19:33:16 (19521): No heartbeat from core client for 30 sec - exiting 19:33:17 (19521): No heartbeat from core client for 30 sec - exiting 20:02:20 (19876): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... 00:05:04 (29074): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 00:16:17 (13045): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 00:58:18 (13482): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... 03:04:36 (19582): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 03:18:18 (19912): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 03:46:03 (20399): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 03:46:04 (20399): No heartbeat from core client for 30 sec - exiting 03:46:05 (20399): No heartbeat from core client for 30 sec - exiting 03:54:13 (21540): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 03:54:14 (21540): No heartbeat from core client for 30 sec - exiting 03:54:15 (21540): No heartbeat from core client for 30 sec - exiting 03:54:16 (21540): No heartbeat from core client for 30 sec - exiting 03:54:17 (21540): No heartbeat from core client for 30 sec - exiting 03:54:18 (21540): No heartbeat from core client for 30 sec - exiting 03:54:19 (21540): No heartbeat from core client for 30 sec - exiting 03:54:20 (21540): No heartbeat from core client for 30 sec - exiting 03:54:21 (21540): No heartbeat from core client for 30 sec - exiting 03:54:22 (21540): No heartbeat from core client for 30 sec - exiting 04:02:35 (21800): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 05:00:10 (22133): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 05:08:02 (24273): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 05:08:03 (24273): No heartbeat from core client for 30 sec - exiting 05:08:04 (24273): No heartbeat from core client for 30 sec - exiting 05:40:15 (24525): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 06:15:35 (25731): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 06:15:36 (25731): No heartbeat from core client for 30 sec - exiting 06:15:37 (25731): No heartbeat from core client for 30 sec - exiting 06:15:38 (25731): No heartbeat from core client for 30 sec - exiting 06:15:39 (25731): No heartbeat from core client for 30 sec - exiting 06:15:40 (25731): No heartbeat from core client for 30 sec - exiting 06:15:41 (25731): No heartbeat from core client for 30 sec - exiting 06:15:42 (25731): No heartbeat from core client for 30 sec - exiting 06:15:43 (25731): No heartbeat from core client for 30 sec - exiting 06:15:44 (25731): No heartbeat from core client for 30 sec - exiting 06:15:45 (25731): No heartbeat from core client for 30 sec - exiting 06:15:46 (25731): No heartbeat from core client for 30 sec - exiting 06:15:47 (25731): No heartbeat from core client for 30 sec - exiting 06:15:48 (25731): No heartbeat from core client for 30 sec - exiting 06:15:49 (25731): No heartbeat from core client for 30 sec - exiting 06:15:50 (25731): No heartbeat from core client for 30 sec - exiting 06:15:51 (25731): No heartbeat from core client for 30 sec - exiting CPDN Monitor - Quit request from BOINC... 10:02:38 (3082): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 10:41:27 (3156): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 19:01:29 (22795): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 19:01:30 (22795): No heartbeat from core client for 30 sec - exiting 19:01:31 (22795): No heartbeat from core client for 30 sec - exiting 19:01:32 (22795): No heartbeat from core client for 30 sec - exiting 19:01:33 (22795): No heartbeat from core client for 30 sec - exiting 19:01:34 (22795): No heartbeat from core client for 30 sec - exiting 19:01:35 (22795): No heartbeat from core client for 30 sec - exiting 19:01:36 (22795): No heartbeat from core client for 30 sec - exiting 19:01:37 (22795): No heartbeat from core client for 30 sec - exiting 19:01:38 (22795): No heartbeat from core client for 30 sec - exiting 19:01:39 (22795): No heartbeat from core client for 30 sec - exiting 19:01:40 (22795): No heartbeat from core client for 30 sec - exiting 19:01:41 (22795): No heartbeat from core client for 30 sec - exiting CPDN Monitor - Quit request from BOINC... 21:19:01 (25487): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 21:19:02 (25487): No heartbeat from core client for 30 sec - exiting 21:19:03 (25487): No heartbeat from core client for 30 sec - exiting 21:19:04 (25487): No heartbeat from core client for 30 sec - exiting 21:19:05 (25487): No heartbeat from core client for 30 sec - exiting 21:19:06 (25487): No heartbeat from core client for 30 sec - exiting 21:19:07 (25487): No heartbeat from core client for 30 sec - exiting 21:19:08 (25487): No heartbeat from core client for 30 sec - exiting 21:19:09 (25487): No heartbeat from core client for 30 sec - exiting 21:19:10 (25487): No heartbeat from core client for 30 sec - exiting 21:19:11 (25487): No heartbeat from core client for 30 sec - exiting 21:19:12 (25487): No heartbeat from core client for 30 sec - exiting 21:19:13 (25487): No heartbeat from core client for 30 sec - exiting 21:19:14 (25487): No heartbeat from core client for 30 sec - exiting 21:19:15 (25487): No heartbeat from core client for 30 sec - exiting 21:19:16 (25487): No heartbeat from core client for 30 sec - exiting 21:19:17 (25487): No heartbeat from core client for 30 sec - exiting 21:19:18 (25487): No heartbeat from core client for 30 sec - exiting 21:19:19 (25487): No heartbeat from core client for 30 sec - exiting 21:19:20 (25487): No heartbeat from core client for 30 sec - exiting 21:43:49 (25819): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 22:31:13 (26161): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 14:46:34 (11557): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 15:12:42 (11808): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 15:33:45 (12255): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 16:06:31 (12662): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 11:06:57 (13979): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 11:06:58 (13979): No heartbeat from core client for 30 sec - exiting 11:42:30 (14453): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Atmos Hold Restart file rename failed on atmos_restart.hold 11:46:13 (15554): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 23:32:40 (9486): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 23:32:41 (9486): No heartbeat from core client for 30 sec - exiting 23:32:42 (9486): No heartbeat from core client for 30 sec - exiting 23:32:43 (9486): No heartbeat from core client for 30 sec - exiting 23:32:44 (9486): No heartbeat from core client for 30 sec - exiting 23:32:45 (9486): No heartbeat from core client for 30 sec - exiting 23:32:46 (9486): No heartbeat from core client for 30 sec - exiting 23:32:47 (9486): No heartbeat from core client for 30 sec - exiting 23:32:48 (9486): No heartbeat from core client for 30 sec - exiting 23:32:49 (9486): No heartbeat from core client for 30 sec - exiting 23:46:23 (11266): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 23:46:24 (11266): No heartbeat from core client for 30 sec - exiting 00:43:11 (11744): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 00:59:19 (14555): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 00:59:20 (14555): No heartbeat from core client for 30 sec - exiting 01:26:39 (15243): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 02:26:53 (16274): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 03:27:47 (18579): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... 09:32:10 (2715): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 09:32:11 (2715): No heartbeat from core client for 30 sec - exiting 10:07:42 (2782): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 10:07:43 (2782): No heartbeat from core client for 30 sec - exiting 10:07:44 (2782): No heartbeat from core client for 30 sec - exiting CPDN Monitor - Quit request from BOINC... 13:22:07 (11536): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 13:22:08 (11536): No heartbeat from core client for 30 sec - exiting 13:22:09 (11536): No heartbeat from core client for 30 sec - exiting 13:22:10 (11536): No heartbeat from core client for 30 sec - exiting 13:22:11 (11536): No heartbeat from core client for 30 sec - exiting 13:22:12 (11536): No heartbeat from core client for 30 sec - exiting 13:24:11 (11959): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 13:24:15 (11959): No heartbeat from core client for 30 sec - exiting 13:24:16 (11959): No heartbeat from core client for 30 sec - exiting SIGABRT: abort called Stack trace (9 frames): /var/lib/boinc/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f] linux-gate.so.1(__kernel_sigreturn+0x0)[0xf7737400] linux-gate.so.1(__kernel_vsyscall+0x10)[0xf7737430] /lib/libc.so.6(gsignal+0x4f)[0xf757131f] /lib/libc.so.6(abort+0x143)[0xf7572c03] /var/lib/boinc/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3] /var/lib/boinc/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395] /var/lib/boinc/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8] /lib/libc.so.6(__libc_start_main+0xf5)[0xf755c3d5] Exiting... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=12010, iMonCtr=1 Model crash detected, will try to restart... SIGABRT: abort called Stack trace (9 frames): /var/lib/boinc/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f] linux-gate.so.1(__kernel_sigreturn+0x0)[0xf7759400] linux-gate.so.1(__kernel_vsyscall+0x10)[0xf7759430] /lib/libc.so.6(gsignal+0x4f)[0xf759331f] /lib/libc.so.6(abort+0x143)[0xf7594c03] /var/lib/boinc/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3] /var/lib/boinc/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395] /var/lib/boinc/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8] /lib/libc.so.6(__libc_start_main+0xf5)[0xf757e3d5] Exiting... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=12010, iMonCtr=1 Model crash detected, will try to restart... SIGABRT: abort called Stack trace (9 frames): /var/lib/boinc/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f] linux-gate.so.1(__kernel_sigreturn+0x0)[0xf776d400] linux-gate.so.1(__kernel_vsyscall+0x10)[0xf776d430] /lib/libc.so.6(gsignal+0x4f)[0xf75a731f] /lib/libc.so.6(abort+0x143)[0xf75a8c03] /var/lib/boinc/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3] /var/lib/boinc/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395] /var/lib/boinc/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8] /lib/libc.so.6(__libc_start_main+0xf5)[0xf75923d5] Exiting... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=12010, iMonCtr=1 Model crash detected, will try to restart... SIGABRT: abort called Stack trace (9 frames): /var/lib/boinc/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f] linux-gate.so.1(__kernel_sigreturn+0x0)[0xf7757400] linux-gate.so.1(__kernel_vsyscall+0x10)[0xf7757430] /lib/libc.so.6(gsignal+0x4f)[0xf759131f] /lib/libc.so.6(abort+0x143)[0xf7592c03] /var/lib/boinc/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3] /var/lib/boinc/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395] /var/lib/boinc/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8] /lib/libc.so.6(__libc_start_main+0xf5)[0xf757c3d5] Exiting... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=12010, iMonCtr=1 Model crash detected, will try to restart... SIGABRT: abort called Stack trace (9 frames): /var/lib/boinc/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f] linux-gate.so.1(__kernel_sigreturn+0x0)[0xf776f400] linux-gate.so.1(__kernel_vsyscall+0x10)[0xf776f430] /lib/libc.so.6(gsignal+0x4f)[0xf75a931f] /lib/libc.so.6(abort+0x143)[0xf75aac03] /var/lib/boinc/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3] /var/lib/boinc/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395] /var/lib/boinc/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8] /lib/libc.so.6(__libc_start_main+0xf5)[0xf75943d5] Exiting... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=12010, iMonCtr=1 Model crash detected, will try to restart... SIGABRT: abort called Stack trace (9 frames): /var/lib/boinc/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f] linux-gate.so.1(__kernel_sigreturn+0x0)[0xf7772400] linux-gate.so.1(__kernel_vsyscall+0x10)[0xf7772430] /lib/libc.so.6(gsignal+0x4f)[0xf75ac31f] /lib/libc.so.6(abort+0x143)[0xf75adc03] /var/lib/boinc/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3] /var/lib/boinc/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395] /var/lib/boinc/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8] /lib/libc.so.6(__libc_start_main+0xf5)[0xf75973d5] Exiting... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=12010, iMonCtr=1 Model crash detected, will try to restart... Sorry, too many model crashes! :-( Called boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
15 Nov 2013 09:45:59 | 1266912 | 16079210 | hadcm3n_820w_1980_40_008460563_1 | 51,840 | 95,456 | 1.8414 |
13 Nov 2013 02:04:48 | 1266912 | 16079210 | hadcm3n_820w_1980_40_008460563_1 | 25,920 | 45,973 | 1.7736 |
©2024 climateprediction.net