Task 12755521

Name	hadcm3n_o61v_1900_40_007203174_2
Workunit	7401454
Created	29 Mar 2011, 21:48:56 UTC
Sent	29 Mar 2011, 21:51:16 UTC
Report deadline	29 Jun 2011, 5:18:27 UTC
Received	20 Apr 2011, 15:31:32 UTC
Server state	Over
Outcome	Computation error
Client state	Compute error
Exit status	193 (0x000000C1) EXIT_SIGNAL
Computer ID	852771
Run time	11 days 2 hours 48 min 14 sec
CPU time	5 days 17 hours 40 min 24 sec
Validate state	Invalid
Credit	3,110.40
Device peak FLOPS	2.02 GFLOPS
Application version	UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86
Stderr	<core_client_version>6.10.18</core_client_version> <![CDATA[ <message> - exit code 193 (0xc1) </message> <stderr_txt> 16:18:45 (5468): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 16:18:46 (5468): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3192, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2944, iMonCtr=1 Model crash detected, will try to restart... 22:30:45 (3788): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 22:30:47 (3788): No heartbeat from core client for 30 sec - exiting 22:30:48 (3788): No heartbeat from core client for 30 sec - exiting 22:30:49 (3788): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - Suspend request from BOINC... 21:13:24 (5580): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3956, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3212, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... 15:11:05 (2492): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 15:11:06 (2492): No heartbeat from core client for 30 sec - exiting 15:11:07 (2492): No heartbeat from core client for 30 sec - exiting 15:11:08 (2492): No heartbeat from core client for 30 sec - exiting Atmos Hold Restart file rename failed on atmos_restart.hold Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1476, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2224, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2400, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=676, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2164, iMonCtr=1 Model crash detected, will try to restart... Unhandled Exception Detected... - Unhandled Exception Record - Reason: Access Violation (0xc0000005) at address 0x00417B59 read attempt to address 0x80004156 Engaging BOINC Windows Runtime Debugger... Unhandled Exception Detected... - Unhandled Exception Record - Reason: Access Violation (0xc0000005) at address 0x77268801 read attempt to address 0xFFFFFFF8 Engaging BOINC Windows Runtime Debugger... Cannot serialize file C:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_o61v_1900_40_007203174/dataout/shmem_restart.day Signal 11 received, exiting... Called boinc_finish </stderr_txt> ]]>

Latest Trickles Received
Time Sent (UTC)	Host ID	Result ID	Result Name	Timestep	CPU Time (sec)	Average (sec/TS)
20 Apr 2011 15:37:26	852771	12755521	hadcm3n_o61v_1900_40_007203174_2	259,200	472,558	1.8231
20 Apr 2011 15:37:26	852771	12755521	hadcm3n_o61v_1900_40_007203174_2	233,280	424,009	1.8176
20 Apr 2011 15:37:25	852771	12755521	hadcm3n_o61v_1900_40_007203174_2	207,360	378,475	1.8252
13 Apr 2011 13:02:36	852771	12755521	hadcm3n_o61v_1900_40_007203174_2	181,440	332,881	1.8347
11 Apr 2011 00:45:52	852771	12755521	hadcm3n_o61v_1900_40_007203174_2	155,520	284,246	1.8277
09 Apr 2011 22:58:19	852771	12755521	hadcm3n_o61v_1900_40_007203174_2	129,600	236,251	1.8229
08 Apr 2011 11:13:57	852771	12755521	hadcm3n_o61v_1900_40_007203174_2	103,680	188,595	1.8190
07 Apr 2011 11:52:19	852771	12755521	hadcm3n_o61v_1900_40_007203174_2	77,760	141,776	1.8233
05 Apr 2011 20:53:55	852771	12755521	hadcm3n_o61v_1900_40_007203174_2	51,840	95,604	1.8442
31 Mar 2011 08:13:21	852771	12755521	hadcm3n_o61v_1900_40_007203174_2	25,920	48,236	1.8610