climateprediction.net home page
Task 14104805

Task 14104805

Name hadcm3n_o4ul_1980_40_007753575_0
Workunit 7908684
Created 17 Feb 2012, 11:17:51 UTC
Sent 17 Feb 2012, 11:17:55 UTC
Report deadline 18 May 2012, 18:45:06 UTC
Received 13 Mar 2012, 14:40:30 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status -187 (0xFFFFFF45) ERR_RESULT_UPLOAD
Computer ID 1105487
Run time 8 days 11 hours 36 min 59 sec
CPU time 7 days 18 hours 53 min 33 sec
Validate state Invalid
Credit 3,110.40
Device peak FLOPS 3.04 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>6.12.34</core_client_version>
<![CDATA[
<message>
upload failure
</message>
<stderr_txt>
17:29:20 (3832): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
17:29:21 (3832): No heartbeat from core client for 30 sec - exiting
17:29:22 (3832): No heartbeat from core client for 30 sec - exiting
17:29:23 (3832): No heartbeat from core client for 30 sec - exiting
17:29:24 (3832): No heartbeat from core client for 30 sec - exiting
17:29:25 (3832): No heartbeat from core client for 30 sec - exiting
17:29:26 (3832): No heartbeat from core client for 30 sec - exiting
17:29:27 (3832): No heartbeat from core client for 30 sec - exiting
17:29:28 (3832): No heartbeat from core client for 30 sec - exiting
17:29:29 (3832): No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
22:46:27 (4200): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
22:46:29 (4200): No heartbeat from core client for 30 sec - exiting
22:46:30 (4200): No heartbeat from core client for 30 sec - exiting
22:46:31 (4200): No heartbeat from core client for 30 sec - exiting
22:46:32 (4200): No heartbeat from core client for 30 sec - exiting
22:46:33 (4200): No heartbeat from core client for 30 sec - exiting
22:46:34 (4200): No heartbeat from core client for 30 sec - exiting
22:46:35 (4200): No heartbeat from core client for 30 sec - exiting
22:46:36 (4200): No heartbeat from core client for 30 sec - exiting
22:46:37 (4200): No heartbeat from core client for 30 sec - exiting
22:46:38 (4200): No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
15:01:10 (4380): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
15:01:11 (4380): No heartbeat from core client for 30 sec - exiting
15:01:12 (4380): No heartbeat from core client for 30 sec - exiting
15:01:13 (4380): No heartbeat from core client for 30 sec - exiting
15:01:14 (4380): No heartbeat from core client for 30 sec - exiting
15:01:15 (4380): No heartbeat from core client for 30 sec - exiting
15:01:16 (4380): No heartbeat from core client for 30 sec - exiting
15:01:17 (4380): No heartbeat from core client for 30 sec - exiting
15:01:18 (4380): No heartbeat from core client for 30 sec - exiting
15:01:19 (4380): No heartbeat from core client for 30 sec - exiting
15:01:20 (4380): No heartbeat from core client for 30 sec - exiting
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4216, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4216, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4216, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4216, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5116, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5116, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
Called boinc_finish
23:48:19 (4168): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
23:48:20 (4168): No heartbeat from core client for 30 sec - exiting
23:48:21 (4168): No heartbeat from core client for 30 sec - exiting
23:48:22 (4168): No heartbeat from core client for 30 sec - exiting
23:48:23 (4168): No heartbeat from core client for 30 sec - exiting
23:48:25 (4168): No heartbeat from core client for 30 sec - exiting
23:48:26 (4168): No heartbeat from core client for 30 sec - exiting
23:48:27 (4168): No heartbeat from core client for 30 sec - exiting
23:48:28 (4168): No heartbeat from core client for 30 sec - exiting
23:48:29 (4168): No heartbeat from core client for 30 sec - exiting
23:48:30 (4168): No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
23:12:41 (4188): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
23:12:43 (4188): No heartbeat from core client for 30 sec - exiting
23:12:44 (4188): No heartbeat from core client for 30 sec - exiting
23:12:45 (4188): No heartbeat from core client for 30 sec - exiting
23:12:46 (4188): No heartbeat from core client for 30 sec - exiting
23:12:47 (4188): No heartbeat from core client for 30 sec - exiting
23:12:48 (4188): No heartbeat from core client for 30 sec - exiting
23:12:49 (4188): No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
14:11:25 (6044): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
14:11:26 (6044): No heartbeat from core client for 30 sec - exiting
14:11:27 (6044): No heartbeat from core client for 30 sec - exiting
14:11:28 (6044): No heartbeat from core client for 30 sec - exiting
14:11:29 (6044): No heartbeat from core client for 30 sec - exiting
14:11:30 (6044): No heartbeat from core client for 30 sec - exiting
14:11:31 (6044): No heartbeat from core client for 30 sec - exiting
14:11:32 (6044): No heartbeat from core client for 30 sec - exiting
14:11:33 (6044): No heartbeat from core client for 30 sec - exiting
14:11:34 (6044): No heartbeat from core client for 30 sec - exiting
14:11:35 (6044): No heartbeat from core client for 30 sec - exiting
15:04:12 (2520): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Abort request from BOINC...
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
13 Mar 2012 14:40:56 1105487 14104805 hadcm3n_o4ul_1980_40_007753575_0 259,200 672,779 2.5956
12 Mar 2012 17:04:14 1105487 14104805 hadcm3n_o4ul_1980_40_007753575_0 233,280 626,397 2.6852
11 Mar 2012 20:57:19 1105487 14104805 hadcm3n_o4ul_1980_40_007753575_0 207,360 580,447 2.7992
10 Mar 2012 22:08:00 1105487 14104805 hadcm3n_o4ul_1980_40_007753575_0 181,440 534,497 2.9459
10 Mar 2012 00:07:41 1105487 14104805 hadcm3n_o4ul_1980_40_007753575_0 155,520 488,796 3.1430
09 Mar 2012 10:22:52 1105487 14104805 hadcm3n_o4ul_1980_40_007753575_0 129,600 442,828 3.4169
03 Mar 2012 20:32:18 1105487 14104805 hadcm3n_o4ul_1980_40_007753575_0 103,680 183,756 1.7723
02 Mar 2012 22:22:21 1105487 14104805 hadcm3n_o4ul_1980_40_007753575_0 77,760 139,423 1.7930
02 Mar 2012 09:17:08 1105487 14104805 hadcm3n_o4ul_1980_40_007753575_0 51,840 93,157 1.7970
01 Mar 2012 12:35:35 1105487 14104805 hadcm3n_o4ul_1980_40_007753575_0 25,920 46,672 1.8006


©2024 climateprediction.net