climateprediction.net home page
Task 10989643

Task 10989643

Name hadsm3dhet2_jmzp_006592791_6
Workunit 6796164
Created 15 Mar 2010, 11:57:57 UTC
Sent 14 Oct 2010, 15:17:43 UTC
Report deadline 26 Sep 2011, 20:37:43 UTC
Received 21 Apr 2011, 21:20:58 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status -177 (0xFFFFFF4F) ERR_RSC_LIMIT_EXCEEDED
Computer ID 1045226
Run time 123 days 7 hours 20 min 55 sec
CPU time 116 days 20 hours 39 min 10 sec
Validate state Invalid
Credit 4,366.71
Device peak FLOPS 2.32 GFLOPS
Application version UK Met Office HadSM3 Slab Model v6.07
windows_intelx86
Stderr
<core_client_version>6.10.18</core_client_version>
<![CDATA[
<message>
Maximum elapsed time exceeded
</message>
<stderr_txt>
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
MainError:	02:29:11 AM	No files match the supplied pattern.
MainError:	02:29:11 AM	No files match the supplied pattern.
MainError:	02:29:11 AM	No files match the supplied pattern.
MainError:	02:29:11 AM	No files match the supplied pattern.
MainError:	02:29:11 AM	No files match the supplied pattern.
MainError:	02:29:11 AM	No files match the supplied pattern.
MainError:	02:29:11 AM	No files match the supplied pattern.
MainError:	02:29:11 AM	No files match the supplied pattern.
MainError:	02:29:11 AM	No files match the supplied pattern.
MainError:	02:29:11 AM	No files match the supplied pattern.
MainError:	02:29:11 AM	No files match the supplied pattern.
MainError:	02:29:11 AM	No files match the supplied pattern.
MainError:	02:29:11 AM	No files match the supplied pattern.
MainError:	02:29:11 AM	No files match the supplied pattern.
MainError:	02:29:11 AM	No files match the supplied pattern.
MainError:	02:29:11 AM	No files match the supplied pattern.
MainError:	02:29:11 AM	No files match the supplied pattern.
MainError:	02:29:11 AM	No files match the supplied pattern.
MainError:	02:29:11 AM	No files match the supplied pattern.
MainError:	02:29:11 AM	No files match the supplied pattern.
MainError:	02:29:11 AM	No files match the supplied pattern.
MainError:	02:29:11 AM	No files match the supplied pattern.
MainError:	02:29:11 AM	No files match the supplied pattern.
MainError:	02:29:11 AM	No files match the supplied pattern.
MainError:	02:29:11 AM	No files match the supplied pattern.
MainError:	02:29:11 AM	No files match the supplied pattern.
MainError:	02:29:11 AM	No files match the supplied pattern.
MainError:	02:29:11 AM	No files match the supplied pattern.
MainError:	02:29:11 AM	No files match the supplied pattern.
MainError:	02:29:11 AM	No files match the supplied pattern.
MainError:	02:29:11 AM	No files match the supplied pattern.
MainError:	02:29:11 AM	No files match the supplied pattern.
MainError:	02:29:11 AM	No files match the supplied pattern.
MainError:	02:29:11 AM	No files match the supplied pattern.
MainError:	02:29:11 AM	No files match the supplied pattern.
MainError:	02:29:11 AM	No files match the supplied pattern.
MainError:	02:29:11 AM	No files match the supplied pattern.
MainError:	02:29:11 AM	No files match the supplied pattern.
MainError:	02:29:11 AM	No files match the supplied pattern.
MainError:	02:29:11 AM	No files match the supplied pattern.
MainError:	02:29:11 AM	No files match the supplied pattern.
MainError:	02:29:11 AM	No files match the supplied pattern.
MainError:	02:29:11 AM	No files match the supplied pattern.
MainError:	02:29:11 AM	No files match the supplied pattern.
MainError:	02:29:11 AM	No files match the supplied pattern.
MainError:	02:29:11 AM	No files match the supplied pattern.
MainError:	02:29:11 AM	No files match the supplied pattern.
MainError:	02:29:11 AM	No files match the supplied pattern.
MainError:	02:29:11 AM	No files match the supplied pattern.
MainError:	02:29:11 AM	No files match the supplied pattern.
MainError:	02:29:11 AM	No files match the supplied pattern.
MainError:	02:29:11 AM	No files match the supplied pattern.
MainError:	02:29:11 AM	No files match the supplied pattern.
MainError:	02:29:11 AM	No files match the supplied pattern.
MainError:	02:29:11 AM	No files match the supplied pattern.
MainError:	02:29:11 AM	No files match the supplied pattern.
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6064, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5784, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1732, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6136, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7784, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5820, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5632, iMonCtr=1
Model crash detected, will try to restart...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5716, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5196, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5544, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3948, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=9772, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=8220, iMonCtr=1
Model crash detected, will try to restart...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5680, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4580, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5528, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Abort request from BOINC...
called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
20 Apr 2011 15:27:28 1045226 10989643 hadsm3dhet2_jmzp_006592791_6 216,040 9,823,638 20.6688
09 Apr 2011 17:05:27 1045226 10989643 hadsm3dhet2_jmzp_006592791_6 205,238 9,203,923 19.8153
30 Mar 2011 10:34:16 1045226 10989643 hadsm3dhet2_jmzp_006592791_6 194,436 8,611,515 18.9813
12 Mar 2011 23:06:25 1045226 10989643 hadsm3dhet2_jmzp_006592791_6 183,634 8,013,323 18.0936
08 Mar 2011 10:04:32 1045226 10989643 hadsm3dhet2_jmzp_006592791_6 172,832 7,441,227 17.2219
26 Feb 2011 08:24:41 1045226 10989643 hadsm3dhet2_jmzp_006592791_6 162,030 6,854,741 16.2713
19 Feb 2011 14:33:50 1045226 10989643 hadsm3dhet2_jmzp_006592791_6 151,228 6,265,670 15.2644
09 Feb 2011 00:31:16 1045226 10989643 hadsm3dhet2_jmzp_006592791_6 140,426 5,676,856 14.2037
27 Jan 2011 11:46:13 1045226 10989643 hadsm3dhet2_jmzp_006592791_6 129,624 5,088,404 13.0850
17 Jan 2011 14:31:25 1045226 10989643 hadsm3dhet2_jmzp_006592791_6 118,822 4,502,811 11.9100
04 Jan 2011 18:43:26 1045226 10989643 hadsm3dhet2_jmzp_006592791_6 108,020 3,916,565 10.6641
12 Dec 2010 23:24:49 1045226 10989643 hadsm3dhet2_jmzp_006592791_6 97,218 3,330,777 9.3439
05 Dec 2010 08:57:11 1045226 10989643 hadsm3dhet2_jmzp_006592791_6 86,416 2,743,132 7.9358
27 Nov 2010 15:17:36 1045226 10989643 hadsm3dhet2_jmzp_006592791_6 75,614 2,157,891 6.4441
20 Nov 2010 06:27:59 1045226 10989643 hadsm3dhet2_jmzp_006592791_6 64,812 1,572,741 4.8532
01 Nov 2010 10:42:04 1045226 10989643 hadsm3dhet2_jmzp_006592791_6 54,010 983,025 3.1381
21 Oct 2010 21:16:43 1045226 10989643 hadsm3dhet2_jmzp_006592791_6 43,208 473,452 1.5654
21 Oct 2010 16:24:24 1045226 10989643 hadsm3dhet2_jmzp_006592791_6 32,406 456,918 1.5666
21 Oct 2010 11:54:03 1045226 10989643 hadsm3dhet2_jmzp_006592791_6 21,604 440,764 1.5694
21 Oct 2010 07:16:50 1045226 10989643 hadsm3dhet2_jmzp_006592791_6 10,802 424,412 1.5716


©2024 climateprediction.net