climateprediction.net home page
Task 12325619

Task 12325619

Name hadam3p_saf_24x4_1999_1_007038464_0
Workunit 7241780
Created 25 Nov 2010, 11:03:26 UTC
Sent 10 Dec 2010, 9:41:58 UTC
Report deadline 22 Nov 2011, 15:01:58 UTC
Received 15 Dec 2010, 9:15:20 UTC
Server state In progress
Outcome ---
Client state Compute error
Exit status 0 (0x00000000)
Computer ID 1077553
Run time 4 days 11 hours 55 min 3 sec
CPU time 4 days 4 hours 38 min
Validate state Invalid
Credit 1,683.45
Device peak FLOPS 1.52 GFLOPS
Application version UK Met Office HadAM3P-HadRM3P Southern Africa v6.08
windows_intelx86
Stderr
<core_client_version>6.10.56</core_client_version>
<![CDATA[
<stderr_txt>
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3772, selfPID=2988, iMonCtr=1
Model crash detected, will try to restart...
22:12:55 (2336): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
08:56:03 (3172): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
08:56:04 (3172): No heartbeat from core client for 30 sec - exiting
08:56:05 (3172): No heartbeat from core client for 30 sec - exiting
08:56:06 (3172): No heartbeat from core client for 30 sec - exiting
08:56:07 (3172): No heartbeat from core client for 30 sec - exiting
08:56:08 (3172): No heartbeat from core client for 30 sec - exiting
08:56:09 (3172): No heartbeat from core client for 30 sec - exiting
08:56:10 (3172): No heartbeat from core client for 30 sec - exiting
08:56:11 (3172): No heartbeat from core client for 30 sec - exiting
08:56:12 (3172): No heartbeat from core client for 30 sec - exiting
08:56:13 (3172): No heartbeat from core client for 30 sec - exiting
08:56:14 (3172): No heartbeat from core client for 30 sec - exiting
08:56:15 (3172): No heartbeat from core client for 30 sec - exiting
08:56:16 (3172): No heartbeat from core client for 30 sec - exiting
08:56:17 (3172): No heartbeat from core client for 30 sec - exiting
08:56:18 (3172): No heartbeat from core client for 30 sec - exiting
08:56:19 (3172): No heartbeat from core client for 30 sec - exiting
08:56:20 (3172): No heartbeat from core client for 30 sec - exiting
08:56:21 (3172): No heartbeat from core client for 30 sec - exiting
08:56:22 (3172): No heartbeat from core client for 30 sec - exiting
08:56:23 (3172): No heartbeat from core client for 30 sec - exiting
08:56:24 (3172): No heartbeat from core client for 30 sec - exiting
08:56:25 (3172): No heartbeat from core client for 30 sec - exiting
08:56:26 (3172): No heartbeat from core client for 30 sec - exiting
08:56:27 (3172): No heartbeat from core client for 30 sec - exiting
08:56:28 (3172): No heartbeat from core client for 30 sec - exiting
08:56:29 (3172): No heartbeat from core client for 30 sec - exiting
08:56:30 (3172): No heartbeat from core client for 30 sec - exiting
08:56:31 (3172): No heartbeat from core client for 30 sec - exiting
08:56:32 (3172): No heartbeat from core client for 30 sec - exiting
08:56:33 (3172): No heartbeat from core client for 30 sec - exiting
08:56:34 (3172): No heartbeat from core client for 30 sec - exiting
08:56:35 (3172): No heartbeat from core client for 30 sec - exiting
08:56:36 (3172): No heartbeat from core client for 30 sec - exiting
08:56:37 (3172): No heartbeat from core client for 30 sec - exiting
08:56:38 (3172): No heartbeat from core client for 30 sec - exiting
08:56:39 (3172): No heartbeat from core client for 30 sec - exiting
08:56:40 (3172): No heartbeat from core client for 30 sec - exiting
08:56:41 (3172): No heartbeat from core client for 30 sec - exiting
08:56:42 (3172): No heartbeat from core client for 30 sec - exiting
08:56:43 (3172): No heartbeat from core client for 30 sec - exiting
08:56:44 (3172): No heartbeat from core client for 30 sec - exiting
08:56:45 (3172): No heartbeat from core client for 30 sec - exiting
08:56:46 (3172): No heartbeat from core client for 30 sec - exiting
08:56:47 (3172): No heartbeat from core client for 30 sec - exiting
08:56:48 (3172): No heartbeat from core client for 30 sec - exiting
08:56:49 (3172): No heartbeat from core client for 30 sec - exiting
08:56:50 (3172): No heartbeat from core client for 30 sec - exiting
08:56:51 (3172): No heartbeat from core client for 30 sec - exiting
08:56:52 (3172): No heartbeat from core client for 30 sec - exiting
08:56:53 (3172): No heartbeat from core client for 30 sec - exiting
08:56:54 (3172): No heartbeat from core client for 30 sec - exiting
08:56:55 (3172): No heartbeat from core client for 30 sec - exiting
08:56:56 (3172): No heartbeat from core client for 30 sec - exiting
08:56:57 (3172): No heartbeat from core client for 30 sec - exiting
08:56:58 (3172): No heartbeat from core client for 30 sec - exiting
No Process Handle
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5004, selfPID=0, iMonCtr=1
No Process Handle
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5004, selfPID=5004, iMonCtr=1
CPDN Monitor - Quit request from BOINC...
12:14:49 (4808): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
13:46:25 (360): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
15:43:36 (2492): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
03:07:48 (2448): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
04:28:21 (1968): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
04:31:21 (4316): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
04:31:22 (4316): No heartbeat from core client for 30 sec - exiting
04:31:23 (4316): No heartbeat from core client for 30 sec - exiting
04:31:24 (4316): No heartbeat from core client for 30 sec - exiting
04:31:25 (4316): No heartbeat from core client for 30 sec - exiting
04:31:26 (4316): No heartbeat from core client for 30 sec - exiting
04:31:27 (4316): No heartbeat from core client for 30 sec - exiting
04:31:28 (4316): No heartbeat from core client for 30 sec - exiting
04:31:29 (4316): No heartbeat from core client for 30 sec - exiting

Model crashed: READHIST: End of file in READ from history file for namelist NLIHISTO                                                                                                                                                                                           tmp/xaakm.pipe_dummy                                                            2048    
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4900, selfPID=4900, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4900, selfPID=3696, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
08:34:35 (3696): called boinc_finish

</stderr_txt>
<message>
<file_xfer_error>
  <file_name>hadam3p_saf_24x4_1999_1_007038464_0_10.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_saf_24x4_1999_1_007038464_0_11.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_saf_24x4_1999_1_007038464_0_12.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>

</message>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
15 Dec 2010 10:07:33 1077553 12325619 hadam3p_saf_24x4_1999_1_007038464_0 103,776 356,310 3.4335
14 Dec 2010 14:23:31 1077553 12325619 hadam3p_saf_24x4_1999_1_007038464_0 92,256 316,684 3.4327
14 Dec 2010 10:15:33 1077553 12325619 hadam3p_saf_24x4_1999_1_007038464_0 80,736 277,301 3.4347
13 Dec 2010 15:35:23 1077553 12325619 hadam3p_saf_24x4_1999_1_007038464_0 69,222 238,434 3.4445
13 Dec 2010 15:08:20 1077553 12325619 hadam3p_saf_24x4_1999_1_007038464_0 69,216 237,854 3.4364
13 Dec 2010 11:45:44 1077553 12325619 hadam3p_saf_24x4_1999_1_007038464_0 57,696 199,209 3.4527
12 Dec 2010 16:36:37 1077553 12325619 hadam3p_saf_24x4_1999_1_007038464_0 46,176 159,635 3.4571
12 Dec 2010 09:41:43 1077553 12325619 hadam3p_saf_24x4_1999_1_007038464_0 34,656 119,557 3.4498
11 Dec 2010 17:36:56 1077553 12325619 hadam3p_saf_24x4_1999_1_007038464_0 23,136 79,308 3.4279
11 Dec 2010 08:51:38 1077553 12325619 hadam3p_saf_24x4_1999_1_007038464_0 11,616 40,467 3.4837


©2024 climateprediction.net