climateprediction.net home page
Task 11035537

Task 11035537

Name hadsm3dhet2_jqj6_006597380_9
Workunit 6800753
Created 15 Mar 2010, 12:03:50 UTC
Sent 27 Sep 2010, 23:23:12 UTC
Report deadline 10 Sep 2011, 4:43:12 UTC
Received 10 Oct 2010, 9:14:31 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status -2 (0xFFFFFFFE) Unknown error code
Computer ID 960025
Run time 8 days 14 hours 11 min 4 sec
CPU time 8 days 12 hours 13 min 31 sec
Validate state Invalid
Credit 3,969.74
Device peak FLOPS 2.39 GFLOPS
Application version UK Met Office HadSM3 Slab Model v6.07
windows_intelx86
Stderr
<core_client_version>6.10.18</core_client_version>
<![CDATA[
<message>
 - exit code -2 (0xfffffffe)
</message>
<stderr_txt>
CCPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7248, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7248, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6680, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5424, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3952, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6772, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7472, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6356, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7048, iMonCtr=1
Model crash detected, will try to restart...
MainError:	09:55:13 PM	No files match the supplied pattern.
MainError:	09:55:13 PM	No files match the supplied pattern.
MainError:	09:55:13 PM	No files match the supplied pattern.
MainError:	09:55:13 PM	No files match the supplied pattern.
MainError:	09:55:13 PM	No files match the supplied pattern.
MainError:	09:55:13 PM	No files match the supplied pattern.
MainError:	09:55:13 PM	No files match the supplied pattern.
MainError:	09:55:13 PM	No files match the supplied pattern.
MainError:	09:55:13 PM	No files match the supplied pattern.
MainError:	09:55:13 PM	No files match the supplied pattern.
MainError:	09:55:13 PM	No files match the supplied pattern.
MainError:	09:55:13 PM	No files match the supplied pattern.
MainError:	09:55:13 PM	No files match the supplied pattern.
MainError:	09:55:13 PM	No files match the supplied pattern.
MainError:	09:55:13 PM	No files match the supplied pattern.
MainError:	09:55:13 PM	No files match the supplied pattern.
MainError:	09:55:13 PM	No files match the supplied pattern.
MainError:	09:55:13 PM	No files match the supplied pattern.
MainError:	09:55:13 PM	No files match the supplied pattern.
MainError:	09:55:13 PM	No files match the supplied pattern.
MainError:	09:55:13 PM	No files match the supplied pattern.
MainError:	09:55:13 PM	No files match the supplied pattern.
MainError:	09:55:13 PM	No files match the supplied pattern.
MainError:	09:55:13 PM	No files match the supplied pattern.
MainError:	09:55:13 PM	No files match the supplied pattern.
MainError:	09:55:13 PM	No files match the supplied pattern.
MainError:	09:55:13 PM	No files match the supplied pattern.
MainError:	09:55:13 PM	No files match the supplied pattern.
MainError:	09:55:13 PM	No files match the supplied pattern.
MainError:	09:55:13 PM	No files match the supplied pattern.
MainError:	09:55:13 PM	No files match the supplied pattern.
MainError:	09:55:13 PM	No files match the supplied pattern.
MainError:	09:55:13 PM	No files match the supplied pattern.
MainError:	09:55:13 PM	No files match the supplied pattern.
MainError:	09:55:13 PM	No files match the supplied pattern.
MainError:	09:55:13 PM	No files match the supplied pattern.
MainError:	09:55:13 PM	No files match the supplied pattern.
MainError:	09:55:13 PM	No files match the supplied pattern.
MainError:	09:55:13 PM	No files match the supplied pattern.
MainError:	09:55:13 PM	No files match the supplied pattern.
MainError:	09:55:13 PM	No files match the supplied pattern.
MainError:	09:55:13 PM	No files match the supplied pattern.
MainError:	09:55:13 PM	No files match the supplied pattern.
MainError:	09:55:13 PM	No files match the supplied pattern.
MainError:	09:55:13 PM	No files match the supplied pattern.
MainError:	09:55:13 PM	No files match the supplied pattern.
MainError:	09:55:13 PM	No files match the supplied pattern.
MainError:	09:55:13 PM	No files match the supplied pattern.
MainError:	09:55:13 PM	No files match the supplied pattern.
MainError:	09:55:13 PM	No files match the supplied pattern.
MainError:	09:55:13 PM	No files match the supplied pattern.
MainError:	09:55:13 PM	No files match the supplied pattern.
MainError:	09:55:13 PM	No files match the supplied pattern.
MainError:	09:55:13 PM	No files match the supplied pattern.
MainError:	09:55:13 PM	No files match the supplied pattern.
MainError:	09:55:13 PM	No files match the supplied pattern.
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
BUFFOUT: C I/O Error - Return code = 32

Model crashed: 
Could not launch model process. Last Error=8
called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
10 Oct 2010 04:17:28 960025 11035537 hadsm3dhet2_jqj6_006597380_9 172,832 717,468 1.6605
10 Oct 2010 04:17:28 960025 11035537 hadsm3dhet2_jqj6_006597380_9 162,030 699,553 1.6605
10 Oct 2010 04:17:28 960025 11035537 hadsm3dhet2_jqj6_006597380_9 151,228 681,705 1.6608
10 Oct 2010 04:17:28 960025 11035537 hadsm3dhet2_jqj6_006597380_9 140,426 664,013 1.6614
10 Oct 2010 04:17:28 960025 11035537 hadsm3dhet2_jqj6_006597380_9 129,624 646,433 1.6623
10 Oct 2010 04:17:28 960025 11035537 hadsm3dhet2_jqj6_006597380_9 118,822 628,889 1.6634
10 Oct 2010 04:17:28 960025 11035537 hadsm3dhet2_jqj6_006597380_9 108,020 611,096 1.6639
10 Oct 2010 04:17:28 960025 11035537 hadsm3dhet2_jqj6_006597380_9 97,218 593,181 1.6641
10 Oct 2010 04:17:28 960025 11035537 hadsm3dhet2_jqj6_006597380_9 86,416 575,307 1.6644
10 Oct 2010 04:17:28 960025 11035537 hadsm3dhet2_jqj6_006597380_9 75,614 557,373 1.6645
10 Oct 2010 04:17:28 960025 11035537 hadsm3dhet2_jqj6_006597380_9 64,812 539,429 1.6646
10 Oct 2010 04:17:28 960025 11035537 hadsm3dhet2_jqj6_006597380_9 54,010 521,481 1.6647
10 Oct 2010 04:17:28 960025 11035537 hadsm3dhet2_jqj6_006597380_9 43,208 503,527 1.6648
10 Oct 2010 04:17:27 960025 11035537 hadsm3dhet2_jqj6_006597380_9 32,406 485,630 1.6651
06 Oct 2010 09:03:40 960025 11035537 hadsm3dhet2_jqj6_006597380_9 21,604 467,657 1.6651
06 Oct 2010 04:28:17 960025 11035537 hadsm3dhet2_jqj6_006597380_9 10,802 449,834 1.6657
05 Oct 2010 21:57:27 960025 11035537 hadsm3dhet2_jqj6_006597380_9 259,248 431,923 1.6661
05 Oct 2010 16:13:49 960025 11035537 hadsm3dhet2_jqj6_006597380_9 248,446 414,009 1.6664
05 Oct 2010 10:36:39 960025 11035537 hadsm3dhet2_jqj6_006597380_9 237,644 396,081 1.6667
05 Oct 2010 05:18:34 960025 11035537 hadsm3dhet2_jqj6_006597380_9 226,842 378,176 1.6671


©2024 climateprediction.net