climateprediction.net home page
Task 11569695

Task 11569695

Name famous_unv0_1399_200_006663767_2
Workunit 6867139
Created 10 Jun 2010, 15:31:03 UTC
Sent 3 Jul 2010, 18:51:01 UTC
Report deadline 3 Oct 2010, 2:18:12 UTC
Received 17 Jul 2010, 20:03:32 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1085414
Run time 2 days 16 hours 47 min 44 sec
CPU time 2 days 12 hours 31 min 2 sec
Validate state Invalid
Credit 525.07
Device peak FLOPS 0.74 GFLOPS
Application version UK Met Office FAMOUS v6.11
i686-pc-linux-gnu
Stderr
<core_client_version>6.10.17</core_client_version>
<![CDATA[
<message>
process exited with code 22 (0x16, -234)
</message>
<stderr_txt>
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=8260, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 60 - Return code = 1

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 61 - Return code = 1

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 68 - Return code = 1

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 69 - Return code = 1
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=11064, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=11064, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=11064, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=11064, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=11064, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=11064, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=11064, iMonCtr=1
Model crash detected, will try to restart...

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 60 - Return code = 1

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 61 - Return code = 1

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 68 - Return code = 1

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 69 - Return code = 1
Signal 3 received, exiting...
 (15435): called boinc_finish

BUFFIN: Read Failed: Inappropriate ioctl for device
BUFFIN: C I/O Error feof - Unit 60 - Return code = 1

BUFFIN: Read Failed: Invalid argument
BUFFIN: C I/O Error feof - Unit 61 - Return code = 1

BUFFIN: Read Failed: Invalid argument
BUFFIN: C I/O Error feof - Unit 68 - Return code = 1

BUFFIN: Read Failed: Invalid argument
BUFFIN: C I/O Error feof - Unit 69 - Return code = 1
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=8044, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=8044, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=17274, selfPID=17274, iMonCtr=1
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=18877, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=18877, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=18877, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=18877, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=18877, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=18877, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=19724, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=19724, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=19840, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=19874, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=19874, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
 (19874): called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
16 Jul 2010 17:38:32 1085414 11569695 famous_unv0_1399_200_006663767_2 159,146 209,958 1.3193
16 Jul 2010 11:53:13 1085414 11569695 famous_unv0_1399_200_006663767_2 149,786 196,930 1.3147
15 Jul 2010 20:42:49 1085414 11569695 famous_unv0_1399_200_006663767_2 140,426 184,736 1.3155
15 Jul 2010 17:14:05 1085414 11569695 famous_unv0_1399_200_006663767_2 131,066 172,977 1.3198
15 Jul 2010 13:42:56 1085414 11569695 famous_unv0_1399_200_006663767_2 121,706 161,046 1.3232
15 Jul 2010 10:17:42 1085414 11569695 famous_unv0_1399_200_006663767_2 112,346 149,320 1.3291
15 Jul 2010 06:48:34 1085414 11569695 famous_unv0_1399_200_006663767_2 102,986 137,561 1.3357
14 Jul 2010 22:16:48 1085414 11569695 famous_unv0_1399_200_006663767_2 93,626 125,021 1.3353
14 Jul 2010 18:04:44 1085414 11569695 famous_unv0_1399_200_006663767_2 84,266 111,499 1.3232
14 Jul 2010 10:16:59 1085414 11569695 famous_unv0_1399_200_006663767_2 74,906 100,022 1.3353
14 Jul 2010 00:47:10 1085414 11569695 famous_unv0_1399_200_006663767_2 65,546 88,877 1.3559
12 Jul 2010 18:06:51 1085414 11569695 famous_unv0_1399_200_006663767_2 56,186 77,651 1.3820
12 Jul 2010 03:25:16 1085414 11569695 famous_unv0_1399_200_006663767_2 46,826 64,294 1.3730
11 Jul 2010 09:52:02 1085414 11569695 famous_unv0_1399_200_006663767_2 37,466 50,537 1.3489
05 Jul 2010 22:20:28 1085414 11569695 famous_unv0_1399_200_006663767_2 28,106 37,622 1.3386
05 Jul 2010 18:24:36 1085414 11569695 famous_unv0_1399_200_006663767_2 18,746 24,450 1.3043
05 Jul 2010 14:49:20 1085414 11569695 famous_unv0_1399_200_006663767_2 9,386 12,234 1.3034


©2024 climateprediction.net