climateprediction.net home page
Task 19408960

Task 19408960

Name hadam3p_afr50_ezyg_201412_12_371_010408134_0
Workunit 10408134
Created 19 Mar 2016, 22:15:19 UTC
Sent 20 Mar 2016, 19:13:45 UTC
Report deadline 3 Mar 2017, 0:33:45 UTC
Received 6 Jun 2016, 20:25:34 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 14 (0x0000000E) Unknown error code
Computer ID 1393608
Run time 15 days 15 hours 6 min 43 sec
CPU time 1 days 1 hours 58 min 30 sec
Validate state Invalid
Credit 4,200.37
Device peak FLOPS 3.57 GFLOPS
Application version UK Met Office HadAM3P-HadRM3P Africa v7.22
windows_intelx86
Stderr
<core_client_version>7.6.22</core_client_version>
<![CDATA[
<message>
Not enough storage is available to complete this operation.
 (0xe) - exit code 14 (0xe)
</message>
<stderr_txt>
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=12832, selfPID=12832, iMonCtr=1
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=15352, selfPID=12464, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=9248, selfPID=5628, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
CPDN Monitor - Quit request from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=11112, selfPID=11112, iMonCtr=1
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=11112, selfPID=13968, iMonCtr=1
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=9724, selfPID=13028, iMonCtr=1
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=9724, selfPID=9724, iMonCtr=1
CPDN Monitor - Quit request from BOINC...
20:48:15 (9084): BOINC client no longer exists - exiting
20:48:16 (9084): timer handler: client dead, exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=7888, selfPID=5336, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
22:46:35 (5336): BOINC client no longer exists - exiting
22:46:35 (5336): timer handler: client dead, exiting
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6464, selfPID=6464, iMonCtr=1
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6472, selfPID=3844, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
23:26:20 (3844): BOINC client no longer exists - exiting
23:26:20 (3844): timer handler: client dead, exiting
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=13176, selfPID=11260, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=8920, selfPID=1404, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=13056, selfPID=8952, iMonCtr=1
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=13056, selfPID=13056, iMonCtr=1
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=8820, selfPID=8820, iMonCtr=1
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=8820, selfPID=8932, iMonCtr=1
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5044, selfPID=5044, iMonCtr=1
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5044, selfPID=11520, iMonCtr=1
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=12060, selfPID=12060, iMonCtr=1
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=12060, selfPID=7824, iMonCtr=1
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6864, selfPID=10248, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
22:50:01 (10248): BOINC client no longer exists - exiting
22:50:01 (10248): timer handler: client dead, exiting
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=8252, selfPID=8636, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6416, selfPID=2556, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=8016, selfPID=10256, iMonCtr=1
No Process Handle
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=8016, selfPID=8016, iMonCtr=1
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=11704, selfPID=11704, iMonCtr=1
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=11704, selfPID=10088, iMonCtr=1
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1708, selfPID=1708, iMonCtr=1
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1708, selfPID=11124, iMonCtr=1
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=11040, selfPID=2780, iMonCtr=1
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=9412, selfPID=2400, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=15720, selfPID=15720, iMonCtr=1
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=15720, selfPID=10792, iMonCtr=1
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=16116, selfPID=16116, iMonCtr=1
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=16116, selfPID=6588, iMonCtr=1
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6544, selfPID=6544, iMonCtr=1
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6544, selfPID=8328, iMonCtr=1
No Process Handle
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5604, selfPID=5604, iMonCtr=1
No Process Handle
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5604, selfPID=304, iMonCtr=1
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=11260, selfPID=10908, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=7180, selfPID=5436, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
22:29:40 (5436): BOINC client no longer exists - exiting
22:29:40 (5436): timer handler: client dead, exiting
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=10632, selfPID=10632, iMonCtr=1
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=10632, selfPID=16188, iMonCtr=1
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2852, selfPID=2852, iMonCtr=1
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2852, selfPID=15008, iMonCtr=1
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4276, selfPID=4276, iMonCtr=1
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4276, selfPID=4816, iMonCtr=1
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1012, selfPID=7720, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
23:03:28 (7720): BOINC client no longer exists - exiting
23:03:28 (7720): timer handler: client dead, exiting
CGlobal Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=8596, selfPID=8596, iMonCtr=1
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
19:29:49 (3384): Can't acquire lockfile (32) - waiting 35s
19:30:24 (3384): Can't acquire lockfile (32) - exiting
19:30:24 (3384): Error: The process cannot access the file because it is being used by another process.

 (0x20)
19:40:43 (5604): Can't acquire lockfile (32) - waiting 35s
19:41:18 (5604): Can't acquire lockfile (32) - exiting
19:41:18 (5604): Error: The process cannot access the file because it is being used by another process.

 (0x20)
19:52:09 (5140): Can't acquire lockfile (32) - waiting 35s
19:52:44 (5140): Can't acquire lockfile (32) - exiting
19:52:44 (5140): Error: The process cannot access the file because it is being used by another process.

 (0x20)
20:02:50 (13808): Can't acquire lockfile (32) - waiting 35s
20:03:25 (13808): Can't acquire lockfile (32) - exiting
20:03:25 (13808): Error: The process cannot access the file because it is being used by another process.

 (0x20)
BUFFOUT: C I/O Error - Return code = 32

Model crashed: WRITDUMP: BAD BUFFOUT OF DATA                                                                                                                                                                                                                                   tmp/xaakm.pipe_dummy                                                            2048    
Leaving CPDN_Main::Monitor...
BUFFOUT: C I/O Error - Return code = 32

Model crashed: 
Leaving CPDN_Main::Monitor...

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
08 May 2016 18:00:54 1393608 19408960 hadam3p_afr50_ezyg_201412_12_371_010408134_0 46,379 173,787 3.7471
21 Apr 2016 20:07:38 1393608 19408960 hadam3p_afr50_ezyg_201412_12_371_010408134_0 34,859 119,447 3.4266
27 Mar 2016 20:34:04 1393608 19408960 hadam3p_afr50_ezyg_201412_12_371_010408134_0 23,339 80,469 3.4478
25 Mar 2016 13:21:59 1393608 19408960 hadam3p_afr50_ezyg_201412_12_371_010408134_0 11,819 38,752 3.2788


©2024 climateprediction.net