climateprediction.net home page
Task 16291628

Task 16291628

Name hadcm3n_ofw3_1900_40_008475718_3
Workunit 8626557
Created 18 Feb 2014, 12:01:06 UTC
Sent 18 Feb 2014, 12:09:40 UTC
Report deadline 20 May 2014, 19:36:51 UTC
Received 17 Mar 2014, 22:42:42 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 193 (0x000000C1) EXIT_SIGNAL
Computer ID 1313937
Run time 18 days 0 hours 5 min 24 sec
CPU time 17 days 16 hours 56 min 29 sec
Validate state Invalid
Credit 12,441.60
Device peak FLOPS 2.62 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
i686-pc-linux-gnu
Stderr
<core_client_version>7.0.27</core_client_version>
<![CDATA[
<message>
process exited with code 193 (0xc1, -63)
</message>
<stderr_txt>
21:03:17 (30947): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
21:08:28 (31985): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
06:23:45 (32007): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
20:55:01 (32061): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
22:46:35 (32251): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
06:23:49 (32753): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
21:12:14 (317): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
21:15:07 (482): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
21:16:18 (718): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
21:22:47 (2367): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
21:26:51 (2391): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
20:55:29 (2415): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
21:06:10 (10447): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
21:18:28 (10715): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
22:40:36 (11049): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
20:54:05 (10729): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
20:59:30 (13052): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
21:16:39 (13226): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
21:27:57 (13682): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
22:31:21 (13951): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
06:25:42 (15503): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 63 - Return code = 1

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 64 - Return code = 1

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 65 - Return code = 1

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 66 - Return code = 1

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 67 - Return code = 1

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 68 - Return code = 1

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 69 - Return code = 1
CPDN Monitor - Quit request from BOINC...
14:01:37 (4379): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
20:57:51 (26978): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
20:59:04 (4768): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
21:01:32 (4822): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
21:09:37 (4917): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
21:10:52 (5152): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
21:10:53 (5152): No heartbeat from core client for 30 sec - exiting
21:10:54 (5152): No heartbeat from core client for 30 sec - exiting
21:10:55 (5152): No heartbeat from core client for 30 sec - exiting
21:10:56 (5152): No heartbeat from core client for 30 sec - exiting
21:10:57 (5152): No heartbeat from core client for 30 sec - exiting
21:10:58 (5152): No heartbeat from core client for 30 sec - exiting
21:10:59 (5152): No heartbeat from core client for 30 sec - exiting
21:11:00 (5152): No heartbeat from core client for 30 sec - exiting
21:11:01 (5152): No heartbeat from core client for 30 sec - exiting
21:11:02 (5152): No heartbeat from core client for 30 sec - exiting
21:11:03 (5152): No heartbeat from core client for 30 sec - exiting
21:11:04 (5152): No heartbeat from core client for 30 sec - exiting
21:11:05 (5152): No heartbeat from core client for 30 sec - exiting
21:11:06 (5152): No heartbeat from core client for 30 sec - exiting
21:11:07 (5152): No heartbeat from core client for 30 sec - exiting
21:11:08 (5152): No heartbeat from core client for 30 sec - exiting
21:11:09 (5152): No heartbeat from core client for 30 sec - exiting
21:11:10 (5152): No heartbeat from core client for 30 sec - exiting
21:14:08 (5262): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
21:17:45 (5371): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
21:26:31 (5459): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
21:27:17 (5688): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
21:10:18 (5759): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
21:24:29 (8201): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...

BUFFOUT: Write Failed: No space left on device
BUFFOUT: C I/O Error - Return code = 1

Model crashed: WRITDUMP: BAD BUFFOUT OF DATA                                                                                                                                                                                                                                   tmp/pipe_dummy                                                                  2048    

BUFFOUT: Write Failed: No space left on device
BUFFOUT: C I/O Error - Return code = 1

Model crashed: WRITDUMP: BAD BUFFOUT OF DATA                                                                                                                                                                                                                                   tmp/pipe_dummy                                                                  2048    

BUFFOUT: Write Failed: No space left on device
BUFFOUT: C I/O Error - Return code = 1

Model crashed: WRITDUMP: BAD BUFFOUT OF DATA                                                                                                                                                                                                                                   tmp/pipe_dummy                                                                  2048    

BUFFOUT: Write Failed: No space left on device
BUFFOUT: C I/O Error - Return code = 1

Model crashed: WRITDUMP: BAD BUFFOUT OF DATA                                                                                                                                                                                                                                   tmp/pipe_dummy                                                                  2048    

BUFFOUT: Write Failed: No space left on device
BUFFOUT: C I/O Error - Return code = 1

Model crashed: WRITDUMP: BAD BUFFOUT OF DATA                                                                                                                                                                                                                                   tmp/pipe_dummy                                                                  2048    
20:59:06 (8625): No heartbeat from core client for 30 sec - exiting
20:59:07 (8625): No heartbeat from core client for 30 sec - exiting
20:59:08 (8625): No heartbeat from core client for 30 sec - exiting
20:59:09 (8625): No heartbeat from core client for 30 sec - exiting
20:59:10 (8625): No heartbeat from core client for 30 sec - exiting
20:59:11 (8625): No heartbeat from core client for 30 sec - exiting
20:59:12 (8625): No heartbeat from core client for 30 sec - exiting
20:59:13 (8625): No heartbeat from core client for 30 sec - exiting
20:59:14 (8625): No heartbeat from core client for 30 sec - exiting
20:59:15 (8625): No heartbeat from core client for 30 sec - exiting
20:59:16 (8625): No heartbeat from core client for 30 sec - exiting
20:59:17 (8625): No heartbeat from core client for 30 sec - exiting
20:59:18 (8625): No heartbeat from core client for 30 sec - exiting
20:59:19 (8625): No heartbeat from core client for 30 sec - exiting
20:59:20 (8625): No heartbeat from core client for 30 sec - exiting
20:59:21 (8625): No heartbeat from core client for 30 sec - exiting
20:59:22 (8625): No heartbeat from core client for 30 sec - exiting
20:59:23 (8625): No heartbeat from core client for 30 sec - exiting
20:59:24 (8625): No heartbeat from core client for 30 sec - exiting
20:59:25 (8625): No heartbeat from core client for 30 sec - exiting
20:59:26 (8625): No heartbeat from core client for 30 sec - exiting
20:59:27 (8625): No heartbeat from core client for 30 sec - exiting
20:59:28 (8625): No heartbeat from core client for 30 sec - exiting
20:59:29 (8625): No heartbeat from core client for 30 sec - exiting
20:59:30 (8625): No heartbeat from core client for 30 sec - exiting
20:59:31 (8625): No heartbeat from core client for 30 sec - exiting
20:59:32 (8625): No heartbeat from core client for 30 sec - exiting
20:59:33 (8625): No heartbeat from core client for 30 sec - exiting
20:59:34 (8625): No heartbeat from core client for 30 sec - exiting
20:59:35 (8625): No heartbeat from core client for 30 sec - exiting
20:59:36 (8625): No heartbeat from core client for 30 sec - exiting
20:59:37 (8625): No heartbeat from core client for 30 sec - exiting
20:59:38 (8625): No heartbeat from core client for 30 sec - exiting
20:59:39 (8625): No heartbeat from core client for 30 sec - exiting
20:59:40 (8625): No heartbeat from core client for 30 sec - exiting
20:59:41 (8625): No heartbeat from core client for 30 sec - exiting
20:59:42 (8625): No heartbeat from core client for 30 sec - exiting
20:59:43 (8625): No heartbeat from core client for 30 sec - exiting
20:59:44 (8625): No heartbeat from core client for 30 sec - exiting
20:59:45 (8625): No heartbeat from core client for 30 sec - exiting
21:00:53 (8625): No heartbeat from core client for 30 sec - exiting
21:01:09 (8625): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
06:25:07 (16581): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
06:27:04 (507): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
*** glibc detected *** ../../projects/climateprediction.net/hadcm3n_6.07_i686-pc-linux-gnu: double free or corruption (out): 0x09b601c8 ***
======= Backtrace: =========
/lib/i386-linux-gnu/i686/cmov/libc.so.6(+0x70f01)[0xf7589f01]
/lib/i386-linux-gnu/i686/cmov/libc.so.6(+0x72768)[0xf758b768]
/lib/i386-linux-gnu/i686/cmov/libc.so.6(cfree+0x6d)[0xf758e8ad]
/usr/lib/i386-linux-gnu/libstdc++.so.6(_ZdlPv+0x1f)[0xf770d4bf]
../../projects/climateprediction.net/hadcm3n_6.07_i686-pc-linux-gnu[0x8057bc4]
../../projects/climateprediction.net/hadcm3n_6.07_i686-pc-linux-gnu[0x804f232]
../../projects/climateprediction.net/hadcm3n_6.07_i686-pc-linux-gnu[0x8050491]
../../projects/climateprediction.net/hadcm3n_6.07_i686-pc-linux-gnu[0x805112c]
../../projects/climateprediction.net/hadcm3n_6.07_i686-pc-linux-gnu[0x805137a]
/lib/i386-linux-gnu/i686/cmov/libc.so.6(__libc_start_main+0xe6)[0xf752fe46]
../../projects/climateprediction.net/hadcm3n_6.07_i686-pc-linux-gnu(__gxx_personality_v0+0x169)[0x804cb51]
======= Memory map: ========
08048000-080e3000 r-xp 00000000 ca:02 778248                             /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_6.07_i686-pc-linux-gnu
080e3000-080e4000 rw-p 0009b000 ca:02 778248                             /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_6.07_i686-pc-linux-gnu
080e4000-0813b000 rw-p 00000000 00:00 0 
09b0a000-09b70000 rw-p 00000000 00:00 0                                  [heap]
f6f00000-f6f21000 rw-p 00000000 00:00 0 
f6f21000-f7000000 ---p 00000000 00:00 0 
f709b000-f7516000 rw-s 00000000 ca:02 778460                             /var/lib/boinc-client/slots/4/137135
f7516000-f7519000 rw-p 00000000 00:00 0 
f7519000-f7676000 r-xp 00000000 ca:02 393741                             /lib/i386-linux-gnu/i686/cmov/libc-2.13.so
f7676000-f7677000 ---p 0015d000 ca:02 393741                             /lib/i386-linux-gnu/i686/cmov/libc-2.13.so
f7677000-f7679000 r--p 0015d000 ca:02 393741                             /lib/i386-linux-gnu/i686/cmov/libc-2.13.so
f7679000-f767a000 rw-p 0015f000 ca:02 393741                             /lib/i386-linux-gnu/i686/cmov/libc-2.13.so
f767a000-f767d000 rw-p 00000000 00:00 0 
f767d000-f7699000 r-xp 00000000 ca:02 1041545                            /lib/i386-linux-gnu/libgcc_s.so.1
f7699000-f769a000 rw-p 0001b000 ca:02 1041545                            /lib/i386-linux-gnu/libgcc_s.so.1
f769a000-f76be000 r-xp 00000000 ca:02 393724                             /lib/i386-linux-gnu/i686/cmov/libm-2.13.so
f76be000-f76bf000 r--p 00023000 ca:02 393724                             /lib/i386-linux-gnu/i686/cmov/libm-2.13.so
f76bf000-f76c0000 rw-p 00024000 ca:02 393724                             /lib/i386-linux-gnu/i686/cmov/libm-2.13.so
f76c0000-f77a0000 r-xp 00000000 ca:02 336568                             /usr/lib/i386-linux-gnu/libstdc++.so.6.0.17
f77a0000-f77a4000 r--p 000e0000 ca:02 336568                             /usr/lib/i386-linux-gnu/libstdc++.so.6.0.17
f77a4000-f77a5000 rw-p 000e4000 ca:02 336568                             /usr/lib/i386-linux-gnu/libstdc++.so.6.0.17
f77a5000-f77ad000 rw-p 00000000 00:00 0 
f77ad000-f77af000 r-xp 00000000 ca:02 393730                             /lib/i386-linux-gnu/i686/cmov/libdl-2.13.so
f77af000-f77b0000 r--p 00001000 ca:02 393730                             /lib/i386-linux-gnu/i686/cmov/libdl-2.13.so
f77b0000-f77b1000 rw-p 00002000 ca:02 393730                             /lib/i386-linux-gnu/i686/cmov/libdl-2.13.so
f77b1000-f77c6000 r-xp 00000000 ca:02 393740                             /lib/i386-linux-gnu/i686/cmov/libpthread-2.13.so
f77c6000-f77c7000 r--p 00014000 ca:02 393740                             /lib/i386-linux-gnu/i686/cmov/libpthread-2.13.so
f77c7000-f77c8000 rw-p 00015000 ca:02 393740                             /lib/i386-linux-gnu/i686/cmov/libpthread-2.13.so
f77c8000-f77ca000 rw-p 00000000 00:00 0 
f77cd000-f77ce000 rw-p 00000000 00:00 0 
f77ce000-f77cf000 ---p 00000000 00:00 0 
f77cf000-f77d2000 rw-p 00000000 00:00 0 
f77d2000-f77d4000 rw-s 00000000 ca:02 778457                             /var/lib/boinc-client/slots/4/boinc_mmap_file
f77d4000-f77d6000 rw-p 00000000 00:00 0 
f77d6000-f77d7000 r-xp 00000000 00:00 0                                  [vdso]
f77d7000-f77f3000 r-xp 00000000 ca:02 24589                              /lib/i386-linux-gnu/ld-2.13.so
f77f3000-f77f4000 r--p 0001b000 ca:02 24589                              /lib/i386-linux-gnu/ld-2.13.so
f77f4000-f77f5000 rw-p 0001c000 ca:02 24589                              /lib/i386-linux-gnu/ld-2.13.so
ff985000-ff9f5000 rw-p 00000000 00:00 0                                  [stack]
SIGABRT: abort called
Stack trace (17 frames):
../../projects/climateprediction.net/hadcm3n_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x80b80df]
[0xf77d6400]
[0xf77d6430]
/lib/i386-linux-gnu/i686/cmov/libc.so.6(gsignal+0x51)[0xf7543941]
/lib/i386-linux-gnu/i686/cmov/libc.so.6(abort+0x182)[0xf7546d72]
/lib/i386-linux-gnu/i686/cmov/libc.so.6(+0x66e15)[0xf757fe15]
/lib/i386-linux-gnu/i686/cmov/libc.so.6(+0x70f01)[0xf7589f01]
/lib/i386-linux-gnu/i686/cmov/libc.so.6(+0x72768)[0xf758b768]
/lib/i386-linux-gnu/i686/cmov/libc.so.6(cfree+0x6d)[0xf758e8ad]
/usr/lib/i386-linux-gnu/libstdc++.so.6(_ZdlPv+0x1f)[0xf770d4bf]
../../projects/climateprediction.net/hadcm3n_6.07_i686-pc-linux-gnu[0x8057bc4]
../../projects/climateprediction.net/hadcm3n_6.07_i686-pc-linux-gnu[0x804f232]
../../projects/climateprediction.net/hadcm3n_6.07_i686-pc-linux-gnu[0x8050491]
../../projects/climateprediction.net/hadcm3n_6.07_i686-pc-linux-gnu[0x805112c]
../../projects/climateprediction.net/hadcm3n_6.07_i686-pc-linux-gnu[0x805137a]
/lib/i386-linux-gnu/i686/cmov/libc.so.6(__libc_start_main+0xe6)[0xf752fe46]
../../projects/climateprediction.net/hadcm3n_6.07_i686-pc-linux-gnu(__gxx_personality_v0+0x169)[0x804cb51]

Exiting...

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
17 Mar 2014 22:42:31 1313937 16291628 hadcm3n_ofw3_1900_40_008475718_3 1,036,800 1,529,788 1.4755
17 Mar 2014 13:11:03 1313937 16291628 hadcm3n_ofw3_1900_40_008475718_3 1,010,880 1,495,522 1.4794
17 Mar 2014 03:44:42 1313937 16291628 hadcm3n_ofw3_1900_40_008475718_3 984,960 1,461,515 1.4838
16 Mar 2014 18:16:24 1313937 16291628 hadcm3n_ofw3_1900_40_008475718_3 959,040 1,427,404 1.4884
16 Mar 2014 08:47:02 1313937 16291628 hadcm3n_ofw3_1900_40_008475718_3 933,120 1,393,386 1.4933
15 Mar 2014 23:20:32 1313937 16291628 hadcm3n_ofw3_1900_40_008475718_3 907,200 1,359,425 1.4985
15 Mar 2014 13:51:57 1313937 16291628 hadcm3n_ofw3_1900_40_008475718_3 881,280 1,325,382 1.5039
15 Mar 2014 01:24:29 1313937 16291628 hadcm3n_ofw3_1900_40_008475718_3 855,360 1,290,692 1.5089
14 Mar 2014 15:38:20 1313937 16291628 hadcm3n_ofw3_1900_40_008475718_3 829,440 1,255,480 1.5136
14 Mar 2014 05:53:26 1313937 16291628 hadcm3n_ofw3_1900_40_008475718_3 803,520 1,220,321 1.5187
13 Mar 2014 20:01:20 1313937 16291628 hadcm3n_ofw3_1900_40_008475718_3 777,600 1,185,109 1.5241
13 Mar 2014 10:13:19 1313937 16291628 hadcm3n_ofw3_1900_40_008475718_3 751,680 1,149,842 1.5297
13 Mar 2014 00:51:19 1313937 16291628 hadcm3n_ofw3_1900_40_008475718_3 725,760 1,113,742 1.5346
12 Mar 2014 13:57:23 1313937 16291628 hadcm3n_ofw3_1900_40_008475718_3 699,840 1,077,064 1.5390
12 Mar 2014 03:46:41 1313937 16291628 hadcm3n_ofw3_1900_40_008475718_3 673,920 1,041,941 1.5461
11 Mar 2014 16:55:36 1313937 16291628 hadcm3n_ofw3_1900_40_008475718_3 648,000 1,003,939 1.5493
11 Mar 2014 05:54:02 1313937 16291628 hadcm3n_ofw3_1900_40_008475718_3 622,080 974,662 1.5668
10 Mar 2014 17:02:54 1313937 16291628 hadcm3n_ofw3_1900_40_008475718_3 596,160 937,635 1.5728
10 Mar 2014 03:59:07 1313937 16291628 hadcm3n_ofw3_1900_40_008475718_3 570,240 890,710 1.5620
09 Mar 2014 14:06:57 1313937 16291628 hadcm3n_ofw3_1900_40_008475718_3 544,320 844,289 1.5511


©2024 climateprediction.net