climateprediction.net home page
Task 19418072

Task 19418072

Name hadcm3n_xa6b_198012_480_374_010412528_0
Workunit 10412528
Created 21 Mar 2016, 15:11:50 UTC
Sent 21 Mar 2016, 19:52:19 UTC
Report deadline 4 Mar 2017, 1:12:19 UTC
Received 2 Apr 2016, 5:30:35 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 193 (0x000000C1) EXIT_SIGNAL
Computer ID 615938
Run time 7 days 17 hours 49 min 4 sec
CPU time 6 days 23 hours 2 min 56 sec
Validate state Invalid
Credit 3,110.40
Device peak FLOPS 3.79 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
i686-apple-darwin
Stderr
<core_client_version>7.6.22</core_client_version>
<![CDATA[
<message>
process exited with code 193 (0xc1, -63)
</message>
<stderr_txt>
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
11:50:11 (746): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
hadcm3n_6.07_i686-apple-darwin(772,0xa4fab240) malloc: *** error for object 0x101e600: incorrect checksum for freed object - object was probably modified after being freed.
*** set a breakpoint in malloc_error_break to debug
SIGILL: illegal instruction
05:18:35 (772): No heartbeat from core client for 30 sec - exiting
07:47:29 (772): No heartbeat from core client for 30 sec - exiting
07:47:30 (772): No heartbeat from core client for 30 sec - exiting
09:12:38 (772): No heartbeat from core client for 30 sec - exiting
09:12:39 (772): No heartbeat from core client for 30 sec - exiting
09:52:16 (772): No heartbeat from core client for 30 sec - exiting
09:52:17 (772): No heartbeat from core client for 30 sec - exiting
09:52:18 (772): No heartbeat from core client for 30 sec - exiting
09:52:19 (772): No heartbeat from core client for 30 sec - exiting
09:52:20 (772): No heartbeat from core client for 30 sec - exiting
09:52:21 (772): No heartbeat from core client for 30 sec - exiting
09:52:22 (772): No heartbeat from core client for 30 sec - exiting
09:52:23 (772): No heartbeat from core client for 30 sec - exiting
09:52:24 (772): No heartbeat from core client for 30 sec - exiting
09:52:25 (772): No heartbeat from core client for 30 sec - exiting
09:52:26 (772): No heartbeat from core client for 30 sec - exiting
09:52:27 (772): No heartbeat from core client for 30 sec - exiting
09:52:28 (772): No heartbeat from core client for 30 sec - exiting
09:52:29 (772): No heartbeat from core client for 30 sec - exiting
09:52:30 (772): No heartbeat from core client for 30 sec - exiting
09:52:31 (772): No heartbeat from core client for 30 sec - exiting
09:52:32 (772): No heartbeat from core client for 30 sec - exiting
09:52:33 (772): No heartbeat from core client for 30 sec - exiting
09:52:34 (772): No heartbeat from core client for 30 sec - exiting
09:52:35 (772): No heartbeat from core client for 30 sec - exiting
09:52:36 (772): No heartbeat from core client for 30 sec - exiting
09:52:37 (772): No heartbeat from core client for 30 sec - exiting
09:52:38 (772): No heartbeat from core client for 30 sec - exiting
09:52:39 (772): No heartbeat from core client for 30 sec - exiting
09:52:41 (772): No heartbeat from core client for 30 sec - exiting
09:52:42 (772): No heartbeat from core client for 30 sec - exiting
09:52:43 (772): No heartbeat from core client for 30 sec - exiting
09:52:44 (772): No heartbeat from core client for 30 sec - exiting
09:52:45 (772): No heartbeat from core client for 30 sec - exiting
09:52:46 (772): No heartbeat from core client for 30 sec - exiting
09:52:47 (772): No heartbeat from core client for 30 sec - exiting
Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=773, selfPID=773, iMonCtr=1
hadcm3n_6.07_i686-apple-darwin(846,0xa4f57240) malloc: *** error for object 0x831c00: incorrect checksum for freed object - object was probably modified after being freed.
*** set a breakpoint in malloc_error_break to debug
SIGILL: illegal instruction

Crashed executable name: hadcm3n_6.07_i686-apple-darwin
built using BOINC library version 6.13.0
Machine type Intel x86-64h Haswell (32-bit executable)
System version: Macintosh OS 10.12 build 16A161
Tue Mar 29 10:23:04 2016

13:48:36 (846): No heartbeat from core client for 30 sec - exiting
13:48:37 (846): No heartbeat from core client for 30 sec - exiting
13:48:38 (846): No heartbeat from core client for 30 sec - exiting
13:48:39 (846): No heartbeat from core client for 30 sec - exiting
13:48:40 (846): No heartbeat from core client for 30 sec - exiting
13:48:41 (846): No heartbeat from core client for 30 sec - exiting
13:48:42 (846): No heartbeat from core client for 30 sec - exiting
13:48:43 (846): No heartbeat from core client for 30 sec - exiting
13:48:44 (846): No heartbeat from core client for 30 sec - exiting
13:48:45 (846): No heartbeat from core client for 30 sec - exiting
13:48:46 (846): No heartbeat from core client for 30 sec - exiting
13:48:48 (846): No heartbeat from core client for 30 sec - exiting
13:48:49 (846): No heartbeat from core client for 30 sec - exiting
13:48:50 (846): No heartbeat from core client for 30 sec - exiting
13:48:51 (846): No heartbeat from core client for 30 sec - exiting
13:48:52 (846): No heartbeat from core client for 30 sec - exiting
13:48:53 (846): No heartbeat from core client for 30 sec - exiting
13:48:54 (846): No heartbeat from core client for 30 sec - exiting
13:48:55 (846): No heartbeat from core client for 30 sec - exiting
13:48:56 (846): No heartbeat from core client for 30 sec - exiting
13:48:57 (846): No heartbeat from core client for 30 sec - exiting
13:48:58 (846): No heartbeat from core client for 30 sec - exiting
13:48:59 (846): No heartbeat from core client for 30 sec - exiting
13:49:00 (846): No heartbeat from core client for 30 sec - exiting
13:49:01 (846): No heartbeat from core client for 30 sec - exiting
13:49:02 (846): No heartbeat from core client for 30 sec - exiting
13:49:03 (846): No heartbeat from core client for 30 sec - exiting
13:49:04 (846): No heartbeat from core client for 30 sec - exiting
13:49:05 (846): No heartbeat from core client for 30 sec - exiting
hadcm3n_6.07_i686-apple-darwin(1196,0xa4f5a240) malloc: *** error for object 0x835400: incorrect checksum for freed object - object was probably modified after being freed.
*** set a breakpoint in malloc_error_break to debug
SIGILL: illegal instruction
13:46:36 (1196): No heartbeat from core client for 30 sec - exiting
13:46:37 (1196): No heartbeat from core client for 30 sec - exiting
13:46:38 (1196): No heartbeat from core client for 30 sec - exiting
13:46:39 (1196): No heartbeat from core client for 30 sec - exiting
13:46:40 (1196): No heartbeat from core client for 30 sec - exiting
13:46:41 (1196): No heartbeat from core client for 30 sec - exiting
13:46:42 (1196): No heartbeat from core client for 30 sec - exiting
13:46:43 (1196): No heartbeat from core client for 30 sec - exiting
13:46:44 (1196): No heartbeat from core client for 30 sec - exiting
13:46:45 (1196): No heartbeat from core client for 30 sec - exiting
13:46:46 (1196): No heartbeat from core client for 30 sec - exiting
13:46:48 (1196): No heartbeat from core client for 30 sec - exiting
13:46:49 (1196): No heartbeat from core client for 30 sec - exiting
13:46:50 (1196): No heartbeat from core client for 30 sec - exiting
13:46:51 (1196): No heartbeat from core client for 30 sec - exiting
13:46:52 (1196): No heartbeat from core client for 30 sec - exiting
13:46:53 (1196): No heartbeat from core client for 30 sec - exiting
13:46:54 (1196): No heartbeat from core client for 30 sec - exiting
13:46:55 (1196): No heartbeat from core client for 30 sec - exiting
13:46:56 (1196): No heartbeat from core client for 30 sec - exiting
13:46:57 (1196): No heartbeat from core client for 30 sec - exiting
13:46:58 (1196): No heartbeat from core client for 30 sec - exiting
13:46:59 (1196): No heartbeat from core client for 30 sec - exiting
13:47:00 (1196): No heartbeat from core client for 30 sec - exiting
13:47:01 (1196): No heartbeat from core client for 30 sec - exiting
13:47:02 (1196): No heartbeat from core client for 30 sec - exiting
13:47:03 (1196): No heartbeat from core client for 30 sec - exiting
13:47:04 (1196): No heartbeat from core client for 30 sec - exiting
13:47:05 (1196): No heartbeat from core client for 30 sec - exiting
hadcm3n_6.07_i686-apple-darwin(3739,0xa4fac240) malloc: *** error for object 0x1036800: incorrect checksum for freed object - object was probably modified after being freed.
*** set a breakpoint in malloc_error_break to debug
SIGILL: illegal instruction

Crashed executable name: hadcm3n_6.07_i686-apple-darwin
built using BOINC library version 6.13.0
Machine type Intel x86-64h Haswell (32-bit executable)
System version: Macintosh OS 10.12 build 16A164
Fri Apr  1 21:29:17 2016

Thread 0 Crashed:

Thread 1:

atos cannot load symbols for the file hadcm3n_6.07_i686-apple-darwin for architecture i386.
Thread 0 crashed with X86 Thread State (32-bit):
  eax: 0x00000000 ebx: 0x00000000 ecx: 0x00000000 edx: 0x00000000
  edi: 0x00000000 esi: 0x00000000 ebp: 0xbff8fd28 esp: 0x00000000
   ss: 0x00000000 efl: 0x00000000 eip: 0x0007697e  cs: 0x00000000
   ds: 0x00000000  es: 0x00000000  fs: 0x00000000  gs: 0x00000000

Binary Images Description:
    0x1000 -    0x93fff /Volumes/Totoro/BOINC Data/slots/4/../../projects/climateprediction.net/hadcm3n_6.07_i686-apple-darwin
  0x127000 -   0x177fff /usr/lib/libstdc++.6.dylib
  0x1d9000 -   0x1d9fff /usr/lib/system/liblaunch.dylib
  0x1e0000 -   0x1e2fff /usr/lib/system/libsystem_sandbox.dylib
  0x1e9000 -   0x20ffff /usr/lib/system/libxpc.dylib
0x9e166000 - 0x9e167fff /usr/lib/libSystem.B.dylib
0x9e250000 - 0x9e250fff /usr/lib/libauto.dylib
0x9e2ec000 - 0x9e341fff /usr/lib/libc++.1.dylib
0x9e342000 - 0x9e365fff /usr/lib/libc++abi.dylib
0x9ec56000 - 0x9f014fff /usr/lib/libobjc.A.dylib
0x9f3ff000 - 0x9f40dfff /usr/lib/libz.1.dylib
0x9f41c000 - 0x9f420fff /usr/lib/system/libcache.dylib
0x9f421000 - 0x9f42bfff /usr/lib/system/libcommonCrypto.dylib
0x9f42c000 - 0x9f431fff /usr/lib/system/libcompiler_rt.dylib
0x9f432000 - 0x9f43afff /usr/lib/system/libcopyfile.dylib
0x9f43b000 - 0x9f4a5fff /usr/lib/system/libcorecrypto.dylib
0x9f4a6000 - 0x9f4d9fff /usr/lib/system/libdispatch.dylib
0x9f4da000 - 0x9f4dffff /usr/lib/system/libdyld.dylib
0x9f4e0000 - 0x9f4e0fff /usr/lib/system/libkeymgr.dylib
0x9f4ee000 - 0x9f4f3fff /usr/lib/system/libmacho.dylib
0x9f4f4000 - 0x9f4f6fff /usr/lib/system/libquarantine.dylib
0x9f4f7000 - 0x9f4f8fff /usr/lib/system/libremovefile.dylib
0x9f4f9000 - 0x9f510fff /usr/lib/system/libsystem_asl.dylib
0x9f511000 - 0x9f511fff /usr/lib/system/libsystem_blocks.dylib
0x9f512000 - 0x9f59ffff /usr/lib/system/libsystem_c.dylib
0x9f5a0000 - 0x9f5a3fff /usr/lib/system/libsystem_configuration.dylib
0x9f5a4000 - 0x9f5a6fff /usr/lib/system/libsystem_coreservices.dylib
0x9f5a7000 - 0x9f5befff /usr/lib/system/libsystem_coretls.dylib
0x9f5bf000 - 0x9f5c5fff /usr/lib/system/libsystem_dnssd.dylib
0x9f5c6000 - 0x9f5edfff /usr/lib/system/libsystem_info.dylib
0x9f5ee000 - 0x9f60efff /usr/lib/system/libsystem_kernel.dylib
0x9f60f000 - 0x9f65afff /usr/lib/system/libsystem_m.dylib
0x9f65b000 - 0x9f673fff /usr/lib/system/libsystem_malloc.dylib
0x9f674000 - 0x9f6c6fff /usr/lib/system/libsystem_network.dylib
0x9f6c7000 - 0x9f6d0fff /usr/lib/system/libsystem_networkextension.dylib
0x9f6d1000 - 0x9f6d9fff /usr/lib/system/libsystem_notify.dylib
0x9f6da000 - 0x9f6e0fff /usr/lib/system/libsystem_platform.dylib
0x9f6e1000 - 0x9f6e9fff /usr/lib/system/libsystem_pthread.dylib
0x9f6ed000 - 0x9f6eefff /usr/lib/system/libsystem_secinit.dylib
0x9f6ef000 - 0x9f6f6fff /usr/lib/system/libsystem_symptoms.dylib
0x9f6f7000 - 0x9f713fff /usr/lib/system/libsystem_trace.dylib
0x9f714000 - 0x9f714fff /usr/lib/system/libunc.dylib
0x9f715000 - 0x9f71bfff /usr/lib/system/libunwind.dylib


Exiting...

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
29 Mar 2016 07:59:47 615938 19418072 hadcm3n_xa6b_198012_480_374_010412528_0 259,200 578,472 2.2318
28 Mar 2016 15:42:25 615938 19418072 hadcm3n_xa6b_198012_480_374_010412528_0 233,280 526,617 2.2574
28 Mar 2016 07:33:34 615938 19418072 hadcm3n_xa6b_198012_480_374_010412528_0 207,360 499,745 2.4100
27 Mar 2016 14:31:57 615938 19418072 hadcm3n_xa6b_198012_480_374_010412528_0 181,440 444,321 2.4489
26 Mar 2016 18:26:34 615938 19418072 hadcm3n_xa6b_198012_480_374_010412528_0 155,520 378,775 2.4355
25 Mar 2016 22:18:23 615938 19418072 hadcm3n_xa6b_198012_480_374_010412528_0 129,600 313,860 2.4218
25 Mar 2016 01:54:54 615938 19418072 hadcm3n_xa6b_198012_480_374_010412528_0 103,680 247,966 2.3916
24 Mar 2016 03:57:25 615938 19418072 hadcm3n_xa6b_198012_480_374_010412528_0 77,760 179,916 2.3137
23 Mar 2016 08:04:19 615938 19418072 hadcm3n_xa6b_198012_480_374_010412528_0 51,840 117,325 2.2632
22 Mar 2016 16:02:15 615938 19418072 hadcm3n_xa6b_198012_480_374_010412528_0 25,920 63,586 2.4532


©2024 climateprediction.net