climateprediction.net home page
Task 12975678

Task 12975678

Name hadam3p_eu_2jga_1965_1_007291614_0
Workunit 7488888
Created 14 Jun 2011, 17:14:18 UTC
Sent 14 Jun 2011, 17:14:36 UTC
Report deadline 26 May 2012, 22:34:36 UTC
Received 17 Nov 2011, 0:44:49 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 194 (0x000000C2) EXIT_ABORTED_BY_CLIENT
Computer ID 1009995
Run time 6 days 21 hours 10 min 41 sec
CPU time 4 days 12 hours 20 min 6 sec
Validate state Invalid
Credit 1,392.75
Device peak FLOPS 2.14 GFLOPS
Application version UK Met Office HadAM3P-HadRM3P Europe v6.09
windows_intelx86
Stderr
<core_client_version>6.6.36</core_client_version>
<![CDATA[
<message>
Got ack for job that's till active
</message>
<stderr_txt>
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5184, selfPID=5184, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3584, selfPID=156, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=7380, selfPID=4848, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Called boinc_finish
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4604, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5652, iMonCtr=2
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6128, selfPID=1176, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=7732, selfPID=7732, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6984, selfPID=4420, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CCPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
GCPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=564, selfPID=5916, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Colobal Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5084, iMonCtr=2

Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
13:16:07 (5780): No heartbeat from core client for 30 sec - exiting
13:16:08 (5780): No heartbeat from core client for 30 sec - exiting
13:16:09 (5780): No heartbeat from core client for 30 sec - exiting
13:16:10 (5780): No heartbeat from core client for 30 sec - exiting
13:16:11 (5780): No heartbeat from core client for 30 sec - exiting
13:16:12 (5780): No heartbeat from core client for 30 sec - exiting
13:16:13 (5780): No heartbeat from core client for 30 sec - exiting
13:16:14 (5780): No heartbeat from core client for 30 sec - exiting
13:16:15 (5780): No heartbeat from core client for 30 sec - exiting
13:16:16 (5780): No heartbeat from core client for 30 sec - exiting
13:16:17 (5780): No heartbeat from core client for 30 sec - exiting
13:16:18 (5780): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
13:16:19 (5780): No heartbeat from core client for 30 sec - exiting
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5548, selfPID=5040, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3000, iMonCtr=2
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2200, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2964, iMonCtr=2
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=7864, selfPID=6588, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3456, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
Controller:: C:: CPPN process is notDrN procesesi is not running, exiting, bRetVal = 1D=,1 checkPID=0, s
elfPID=4176, iMonCtr=2
Model crash detected, will try to restart...
11:57:03 (4512): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4344, selfPID=4344, iMonCtr=2
22:26:10 (6352): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
22:33:45 (6596): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6836, selfPID=6836, iMonCtr=2
22:33:46 (6596): No heartbeat from core client for 30 sec - exiting
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3868, selfPID=3788, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5512, selfPID=5440, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6716, iMonCtr=2
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
16:16:37 (5888): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4900, iMonCtr=2
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
16:53:10 (4608): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
No Process Handle
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=7360, selfPID=7368, iMonCtr=1
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6232, selfPID=6232, iMonCtr=2

BUFFOUT: Write Failed: No space left on device
BUFFOUT: C I/O Error - Return code = 32

Model crashed: 
Leaving CPDN_Main::Monitor...

zip error: Output file write failure (write error on zip file)
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
17 Nov 2011 00:46:20 1009995 12975678 hadam3p_eu_2jga_1965_1_007291614_0 80,736 376,622 4.6649
16 Sep 2011 11:32:52 1009995 12975678 hadam3p_eu_2jga_1965_1_007291614_0 69,216 324,424 4.6871
25 Aug 2011 02:21:49 1009995 12975678 hadam3p_eu_2jga_1965_1_007291614_0 57,696 271,411 4.7042
06 Aug 2011 18:57:36 1009995 12975678 hadam3p_eu_2jga_1965_1_007291614_0 46,176 218,178 4.7249
26 Jul 2011 19:41:44 1009995 12975678 hadam3p_eu_2jga_1965_1_007291614_0 34,656 163,358 4.7137
25 Jul 2011 17:37:49 1009995 12975678 hadam3p_eu_2jga_1965_1_007291614_0 23,136 109,603 4.7373
08 Jul 2011 18:38:20 1009995 12975678 hadam3p_eu_2jga_1965_1_007291614_0 11,616 55,949 4.8165


©2024 climateprediction.net