climateprediction.net home page
Task 11813174

Task 11813174

Name famous_vj7b_1599_200_006710653_0
Workunit 6913906
Created 26 Aug 2010, 17:13:20 UTC
Sent 8 Nov 2010, 22:17:01 UTC
Report deadline 8 Feb 2011, 5:44:12 UTC
Received 21 Dec 2010, 17:24:42 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 875054
Run time
CPU time 9 days 16 hours 16 min 55 sec
Validate state Invalid
Credit 2,285.33
Device peak FLOPS 0.89 GFLOPS
Application version UK Met Office FAMOUS v6.11
windows_intelx86
Stderr
<core_client_version>6.4.7</core_client_version>
<![CDATA[
<message>
The device does not recognize the command. (0x16) - exit code 22 (0x16)
</message>
<stderr_txt>
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3544, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
OPEN:  File Open Failed: Permission denied
OPEN:  Unable to Open File dataout/vj7bla#pb000001603c1+ for Read/Write

Model crashed: STWORK  : Error opening output PP file on unit 61                                                                                                                                                                                                               tmp/pipe_dummy                                                                  
OPEN:  File Open Failed: Permission denied
OPEN:  Unable to Open File dataout/vj7bla#pb000001603c1+ for Read/Write

Model crashed: PPCTL   : Error opening preassigned PPfile                                                                                                                                                                                                                      tmp/pipe_dummy                                                                  
CPDN Monitor - Quit request from BOINC...
15:00:35 (4396): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
16:19:01 (5192): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
19:19:49 (5896): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
16:29:20 (4140): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2708, iMonCtr=1
Model crash detected, will try to restart...
14:45:27 (6016): No heartbeat from core client for 30 sec - exiting
14:45:28 (6016): No heartbeat from core client for 30 sec - exiting
14:45:29 (6016): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
14:45:30 (6016): No heartbeat from core client for 30 sec - exiting
14:45:31 (6016): No heartbeat from core client for 30 sec - exiting
14:45:32 (6016): No heartbeat from core client for 30 sec - exiting
14:45:33 (6016): No heartbeat from core client for 30 sec - exiting
14:45:34 (6016): No heartbeat from core client for 30 sec - exiting
14:45:35 (6016): No heartbeat from core client for 30 sec - exiting
14:45:36 (6016): No heartbeat from core client for 30 sec - exiting
14:45:38 (6016): No heartbeat from core client for 30 sec - exiting
14:45:39 (6016): No heartbeat from core client for 30 sec - exiting
15:16:25 (7520): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
15:16:29 (7520): No heartbeat from core client for 30 sec - exiting
15:16:30 (7520): No heartbeat from core client for 30 sec - exiting
15:16:31 (7520): No heartbeat from core client for 30 sec - exiting
15:16:32 (7520): No heartbeat from core client for 30 sec - exiting
15:16:33 (7520): No heartbeat from core client for 30 sec - exiting
15:16:34 (7520): No heartbeat from core client for 30 sec - exiting
15:16:38 (7520): No heartbeat from core client for 30 sec - exiting
15:16:39 (7520): No heartbeat from core client for 30 sec - exiting
12:47:33 (6088): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
12:47:34 (6088): No heartbeat from core client for 30 sec - exiting
12:47:36 (6088): No heartbeat from core client for 30 sec - exiting
12:47:37 (6088): No heartbeat from core client for 30 sec - exiting

Model crashed: TEMPHIST: Write ERROR on history file for namelistNLIHISTO                                                                                                                                                                                                      tmp/pipe_dummy                                                                  
CPDN Monitor - Quit request from BOINC...
12:36:25 (4384): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
12:36:26 (4384): No heartbeat from core client for 30 sec - exiting
12:36:27 (4384): No heartbeat from core client for 30 sec - exiting
12:36:28 (4384): No heartbeat from core client for 30 sec - exiting
12:36:30 (4384): No heartbeat from core client for 30 sec - exiting
OPEN:  File Open Failed: Permission denied
OPEN:  Unable to Open File dataout/vj7bla#pb000001616c1+ for Read/Write

Model crashed: STWORK  : Error opening output PP file on unit 61                                                                                                                                                                                                               tmp/pipe_dummy                                                                  
OPEN:  File Open Failed: Permission denied
OPEN:  Unable to Open File dataout/vj7bla#pb000001616c1+ for Read/Write

Model crashed: PPCTL   : Error opening preassigned PPfile                                                                                                                                                                                                                      tmp/pipe_dummy                                                                  
11:04:53 (4868): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 60 - Return code = 16

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 61 - Return code = 16

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 68 - Return code = 16

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 69 - Return code = 16
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=8148, iMonCtr=1
Model crash detected, will try to restart...
16:12:10 (6500): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6480, iMonCtr=1
Model crash detected, will try to restart...
16:16:11 (1424): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=8036, iMonCtr=1
Model crash detected, will try to restart...
16:10:22 (3348): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
11:36:54 (5580): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
C16:31:29 (4316): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
10:08:45 (6872): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 60 - Return code = 16

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 61 - Return code = 16

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 68 - Return code = 16

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 69 - Return code = 16
09:52:02 (1996): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...

BUFFIN: Read Failed: Result too large
BUFFIN: C I/O Error feof - Unit 60 - Return code = 16

BUFFIN: Read Failed: Result too large
BUFFIN: C I/O Error feof - Unit 61 - Return code = 16

BUFFIN: Read Failed: Result too large
BUFFIN: C I/O Error feof - Unit 68 - Return code = 16

BUFFIN: Read Failed: Result too large
BUFFIN: C I/O Error feof - Unit 69 - Return code = 16
17:21:56 (7892): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
17:21:57 (7892): No heartbeat from core client for 30 sec - exiting
10:14:41 (5948): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 60 - Return code = 16

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 61 - Return code = 16

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 68 - Return code = 16

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 69 - Return code = 16
10:09:54 (5180): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...

BUFFIN: Read Failed: Result too large
BUFFIN: C I/O Error feof - Unit 60 - Return code = 16

BUFFIN: Read Failed: Result too large
BUFFIN: C I/O Error feof - Unit 61 - Return code = 16

BUFFIN: Read Failed: Result too large
BUFFIN: C I/O Error feof - Unit 68 - Return code = 16

BUFFIN: Read Failed: Result too large
BUFFIN: C I/O Error feof - Unit 69 - Return code = 16
14:05:34 (2700): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
17:27:37 (4540): No heartbeat from core client for 30 sec - exiting
17:27:38 (4540): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
09:36:07 (4132): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
09:53:45 (5200): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
09:44:34 (4568): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
07:46:22 (4184): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
07:39:59 (4652): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
08:32:58 (7592): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 60 - Return code = 16

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 61 - Return code = 16

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 68 - Return code = 16

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 69 - Return code = 16
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
08:10:25 (5108): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5620, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5620, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5620, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5620, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5620, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5620, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
22:53:23 (5620): called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
18 Dec 2010 19:27:22 875054 11813174 famous_vj7b_1599_200_006710653_0 692,666 824,282 1.1900
18 Dec 2010 13:54:25 875054 11813174 famous_vj7b_1599_200_006710653_0 683,306 811,922 1.1882
18 Dec 2010 10:13:04 875054 11813174 famous_vj7b_1599_200_006710653_0 673,946 800,236 1.1874
16 Dec 2010 09:22:25 875054 11813174 famous_vj7b_1599_200_006710653_0 664,586 789,484 1.1879
15 Dec 2010 20:08:16 875054 11813174 famous_vj7b_1599_200_006710653_0 655,226 776,762 1.1855
15 Dec 2010 13:15:49 875054 11813174 famous_vj7b_1599_200_006710653_0 645,866 765,203 1.1848
15 Dec 2010 10:10:46 875054 11813174 famous_vj7b_1599_200_006710653_0 636,506 753,826 1.1843
14 Dec 2010 14:34:25 875054 11813174 famous_vj7b_1599_200_006710653_0 627,146 742,868 1.1845
14 Dec 2010 09:40:02 875054 11813174 famous_vj7b_1599_200_006710653_0 617,786 731,016 1.1833
13 Dec 2010 13:15:11 875054 11813174 famous_vj7b_1599_200_006710653_0 608,426 720,390 1.1840
13 Dec 2010 09:55:55 875054 11813174 famous_vj7b_1599_200_006710653_0 599,066 709,907 1.1850
12 Dec 2010 23:24:54 875054 11813174 famous_vj7b_1599_200_006710653_0 589,706 699,516 1.1862
12 Dec 2010 19:44:18 875054 11813174 famous_vj7b_1599_200_006710653_0 580,346 687,993 1.1855
12 Dec 2010 15:56:07 875054 11813174 famous_vj7b_1599_200_006710653_0 570,986 676,236 1.1843
12 Dec 2010 12:29:59 875054 11813174 famous_vj7b_1599_200_006710653_0 561,626 665,555 1.1851
11 Dec 2010 17:46:13 875054 11813174 famous_vj7b_1599_200_006710653_0 552,266 654,425 1.1850
11 Dec 2010 13:41:21 875054 11813174 famous_vj7b_1599_200_006710653_0 542,906 642,993 1.1844
11 Dec 2010 10:42:44 875054 11813174 famous_vj7b_1599_200_006710653_0 533,546 632,504 1.1855
11 Dec 2010 09:31:53 875054 11813174 famous_vj7b_1599_200_006710653_0 524,186 622,077 1.1867
10 Dec 2010 19:40:33 875054 11813174 famous_vj7b_1599_200_006710653_0 514,826 611,277 1.1873


©2024 climateprediction.net