climateprediction.net home page
Task 13648577

Task 13648577

Name hadcm3n_ydis_1900_40_007518612_3
Workunit 7716087
Created 20 Nov 2011, 10:44:43 UTC
Sent 20 Nov 2011, 10:51:02 UTC
Report deadline 19 Feb 2012, 18:18:13 UTC
Received 4 Mar 2012, 1:57:44 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 193 (0x000000C1) EXIT_SIGNAL
Computer ID 1163145
Run time 25 days 7 hours 6 min 24 sec
CPU time 25 days 1 hours 44 min 59 sec
Validate state Invalid
Credit 12,441.60
Device peak FLOPS 1.80 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>6.12.34</core_client_version>
<![CDATA[
<message>
 - exit code 193 (0xc1)
</message>
<stderr_txt>
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4320, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4772, iMonCtr=1
Model crash detected, will try to restart...
Ocean Restart file copy failed on ydisko.daa35r0
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4232, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
18:40:56 (4144): No heartbeat from core client for 30 sec - exiting
18:40:57 (4144): No heartbeat from core client for 30 sec - exiting
18:40:58 (4144): No heartbeat from core client for 30 sec - exiting
18:40:59 (4144): No heartbeat from core client for 30 sec - exiting
18:41:00 (4144): No heartbeat from core client for 30 sec - exiting
18:41:01 (4144): No heartbeat from core client for 30 sec - exiting
18:41:02 (4144): No heartbeat from core client for 30 sec - exiting
18:41:03 (4144): No heartbeat from core client for 30 sec - exiting
18:41:04 (4144): No heartbeat from core client for 30 sec - exiting
18:41:05 (4144): No heartbeat from core client for 30 sec - exiting
18:41:06 (4144): No heartbeat from core client for 30 sec - exiting
18:41:07 (4144): No heartbeat from core client for 30 sec - exiting
18:41:08 (4144): No heartbeat from core client for 30 sec - exiting
18:41:09 (4144): No heartbeat from core client for 30 sec - exiting
18:41:10 (4144): No heartbeat from core client for 30 sec - exiting
18:41:11 (4144): No heartbeat from core client for 30 sec - exiting
18:41:12 (4144): No heartbeat from core client for 30 sec - exiting
18:41:13 (4144): No heartbeat from core client for 30 sec - exiting
18:41:14 (4144): No heartbeat from core client for 30 sec - exiting
18:41:15 (4144): No heartbeat from core client for 30 sec - exiting
18:41:16 (4144): No heartbeat from core client for 30 sec - exiting
18:41:17 (4144): No heartbeat from core client for 30 sec - exiting
18:41:18 (4144): No heartbeat from core client for 30 sec - exiting
18:41:19 (4144): No heartbeat from core client for 30 sec - exiting
18:41:20 (4144): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4172, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4040, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4068, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Ocean Restart file copy failed on ydisko.dab34e0
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4012, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4012, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Ocean Restart file copy failed on ydisko.dac0cp0
05:59:37 (4324): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
BUFFIN: C I/O Error feof - Unit 63 - Return code = 16
BUFFIN: C I/O Error feof - Unit 64 - Return code = 16
BUFFIN: C I/O Error feof - Unit 65 - Return code = 16
BUFFIN: C I/O Error feof - Unit 66 - Return code = 16
BUFFIN: C I/O Error feof - Unit 67 - Return code = 16
BUFFIN: C I/O Error feof - Unit 68 - Return code = 16
BUFFIN: C I/O Error feof - Unit 69 - Return code = 16
Error converting file to netcdf: dataout/ydisko.pjc6c10
Error converting file to netcdf: dataout/ydisko.pic6c10
Error converting file to netcdf: dataout/ydisko.pfc6c10
Error converting file to netcdf: dataout/ydiska.phc6c10
Error converting file to netcdf: dataout/ydiska.pgc6c10
Error converting file to netcdf: dataout/ydiska.pec6c10
Error converting file to netcdf: dataout/ydiska.pdc6c10
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4396, iMonCtr=1
Model crash detected, will try to restart...
Ocean Restart file copy failed on ydisko.dad04c0
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3724, iMonCtr=1
Model crash detected, will try to restart...
Signal 11 received, exiting...
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
04 Mar 2012 01:00:09 1163145 13648577 hadcm3n_ydis_1900_40_007518612_3 1,036,800 2,166,287 2.0894
03 Mar 2012 02:42:25 1163145 13648577 hadcm3n_ydis_1900_40_007518612_3 1,010,880 2,111,617 2.0889
28 Feb 2012 17:38:58 1163145 13648577 hadcm3n_ydis_1900_40_007518612_3 984,960 2,057,896 2.0893
27 Feb 2012 06:03:37 1163145 13648577 hadcm3n_ydis_1900_40_007518612_3 959,040 2,004,740 2.0904
26 Feb 2012 15:20:48 1163145 13648577 hadcm3n_ydis_1900_40_007518612_3 933,120 1,951,984 2.0919
25 Feb 2012 16:54:34 1163145 13648577 hadcm3n_ydis_1900_40_007518612_3 907,200 1,897,622 2.0917
24 Feb 2012 12:38:01 1163145 13648577 hadcm3n_ydis_1900_40_007518612_3 881,280 1,843,057 2.0913
21 Feb 2012 11:25:30 1163145 13648577 hadcm3n_ydis_1900_40_007518612_3 855,360 1,788,788 2.0913
20 Feb 2012 09:45:05 1163145 13648577 hadcm3n_ydis_1900_40_007518612_3 829,440 1,734,060 2.0906
19 Feb 2012 08:53:15 1163145 13648577 hadcm3n_ydis_1900_40_007518612_3 803,520 1,679,448 2.0901
29 Jan 2012 18:08:20 1163145 13648577 hadcm3n_ydis_1900_40_007518612_3 777,600 1,625,364 2.0902
29 Jan 2012 03:16:39 1163145 13648577 hadcm3n_ydis_1900_40_007518612_3 751,680 1,571,958 2.0913
25 Jan 2012 12:09:29 1163145 13648577 hadcm3n_ydis_1900_40_007518612_3 725,760 1,517,872 2.0914
24 Jan 2012 08:48:27 1163145 13648577 hadcm3n_ydis_1900_40_007518612_3 699,840 1,464,009 2.0919
21 Jan 2012 17:12:36 1163145 13648577 hadcm3n_ydis_1900_40_007518612_3 673,920 1,410,651 2.0932
19 Jan 2012 08:11:24 1163145 13648577 hadcm3n_ydis_1900_40_007518612_3 648,000 1,356,997 2.0941
16 Jan 2012 13:26:16 1163145 13648577 hadcm3n_ydis_1900_40_007518612_3 622,080 1,302,914 2.0944
15 Jan 2012 13:06:21 1163145 13648577 hadcm3n_ydis_1900_40_007518612_3 596,160 1,248,960 2.0950
14 Jan 2012 13:03:04 1163145 13648577 hadcm3n_ydis_1900_40_007518612_3 570,240 1,194,608 2.0949
13 Jan 2012 12:58:35 1163145 13648577 hadcm3n_ydis_1900_40_007518612_3 544,320 1,140,088 2.0945


©2024 climateprediction.net