climateprediction.net home page
Task 16045760

Task 16045760

Name hadcm3n_ofdj_1900_40_008475050_0
Workunit 8625889
Created 27 Sep 2013, 10:34:09 UTC
Sent 27 Sep 2013, 18:59:33 UTC
Report deadline 28 Dec 2013, 2:26:44 UTC
Received 6 Nov 2013, 21:04:57 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 25 (0x00000019) Unknown error code
Computer ID 1212841
Run time 9 days 17 hours 19 min 54 sec
CPU time 9 days 14 hours 54 min 25 sec
Validate state Invalid
Credit 9,953.28
Device peak FLOPS 2.92 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>7.0.64</core_client_version>
<![CDATA[
<message>
The drive cannot locate a specific area or track on the disk.
 (0x19) - exit code 25 (0x19)
</message>
<stderr_txt>
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5728, iMonCtr=1
Model crash detected, will try to restart...
21:15:05 (3304): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
18:43:59 (4016): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
18:51:05 (3460): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4164, iMonCtr=1
Model crash detected, will try to restart...
07:48:56 (2132): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
19:03:27 (5568): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CSuspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4428, iMonCtr=1
Model crash detected, will try to restart...
18:53:53 (5628): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3088, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3888, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
19:22:28 (4888): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4488, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5888, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4512, iMonCtr=1
Model crash detected, will try to restart...
05:32:34 (5044): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5736, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4676, iMonCtr=1
Model crash detected, will try to restart...
20:00:35 (5844): No heartbeat from core client for 30 sec - exiting
20:00:36 (5844): No heartbeat from core client for 30 sec - exiting
20:00:37 (5844): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
19:23:23 (3712): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
19:24:41 (2540): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
03 Nov 2013 20:54:53 1212841 16045760 hadcm3n_ofdj_1900_40_008475050_0 829,440 819,864 0.9885
03 Nov 2013 13:19:13 1212841 16045760 hadcm3n_ofdj_1900_40_008475050_0 803,520 792,879 0.9868
02 Nov 2013 19:07:50 1212841 16045760 hadcm3n_ofdj_1900_40_008475050_0 777,600 766,239 0.9854
02 Nov 2013 11:59:22 1212841 16045760 hadcm3n_ofdj_1900_40_008475050_0 751,680 741,019 0.9858
01 Nov 2013 19:13:01 1212841 16045760 hadcm3n_ofdj_1900_40_008475050_0 725,760 714,369 0.9843
29 Oct 2013 19:27:59 1212841 16045760 hadcm3n_ofdj_1900_40_008475050_0 699,840 687,907 0.9829
28 Oct 2013 14:50:41 1212841 16045760 hadcm3n_ofdj_1900_40_008475050_0 673,920 677,301 1.0050
28 Oct 2013 07:50:14 1212841 16045760 hadcm3n_ofdj_1900_40_008475050_0 648,000 652,136 1.0064
27 Oct 2013 16:08:39 1212841 16045760 hadcm3n_ofdj_1900_40_008475050_0 622,080 625,711 1.0058
27 Oct 2013 08:53:21 1212841 16045760 hadcm3n_ofdj_1900_40_008475050_0 596,160 600,507 1.0073
26 Oct 2013 16:05:08 1212841 16045760 hadcm3n_ofdj_1900_40_008475050_0 570,240 574,314 1.0071
26 Oct 2013 08:52:11 1212841 16045760 hadcm3n_ofdj_1900_40_008475050_0 544,320 548,599 1.0079
23 Oct 2013 22:13:13 1212841 16045760 hadcm3n_ofdj_1900_40_008475050_0 518,400 522,188 1.0073
20 Oct 2013 19:13:25 1212841 16045760 hadcm3n_ofdj_1900_40_008475050_0 492,480 495,769 1.0067
20 Oct 2013 11:50:48 1212841 16045760 hadcm3n_ofdj_1900_40_008475050_0 466,560 469,365 1.0060
19 Oct 2013 18:23:07 1212841 16045760 hadcm3n_ofdj_1900_40_008475050_0 440,640 442,502 1.0042
16 Oct 2013 20:42:23 1212841 16045760 hadcm3n_ofdj_1900_40_008475050_0 414,720 415,565 1.0020
14 Oct 2013 18:32:24 1212841 16045760 hadcm3n_ofdj_1900_40_008475050_0 388,800 388,810 1.0000
13 Oct 2013 12:52:00 1212841 16045760 hadcm3n_ofdj_1900_40_008475050_0 362,880 362,863 1.0000
12 Oct 2013 19:48:39 1212841 16045760 hadcm3n_ofdj_1900_40_008475050_0 336,960 336,597 0.9989


©2024 climateprediction.net