climateprediction.net home page
Task 21508251

Task 21508251

Name wah2_safr50_n12n_201612_16_789_011744067_0
Workunit 11744067
Created 4 Mar 2019, 14:09:49 UTC
Sent 6 Mar 2019, 9:27:59 UTC
Report deadline 16 Feb 2020, 14:47:59 UTC
Received 4 May 2019, 7:23:29 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 0 (0x00000000)
Computer ID 1479061
Run time 5 days 15 hours 42 min 23 sec
CPU time 4 days 16 hours 18 min 54 sec
Validate state Invalid
Credit 6,859.15
Device peak FLOPS 4.24 GFLOPS
Application version Weather At Home 2 (wah2) v8.24
windows_intelx86
Peak working set size 254.09 MB
Peak swap size 220.10 MB
Peak disk usage 126.17 MB
Stderr
<core_client_version>7.14.2</core_client_version>
<![CDATA[
<stderr_txt>
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=14596, selfPID=14596, iMonCtr=1
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6848, selfPID=5548, iMonCtr=1
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=16752, selfPID=12032, iMonCtr=1
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=11784, selfPID=11784, iMonCtr=1
No Process Handle
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=11784, selfPID=13896, iMonCtr=1
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=13884, selfPID=15504, iMonCtr=1
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=10676, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4952, selfPID=4952, iMonCtr=1
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=13032, selfPID=13032, iMonCtr=1
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6872, selfPID=6872, iMonCtr=1
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=21020, selfPID=21020, iMonCtr=1
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4828, selfPID=4828, iMonCtr=1
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=13072, selfPID=13072, iMonCtr=1
No Process Handle
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=13072, selfPID=19108, iMonCtr=1
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=20428, selfPID=19560, iMonCtr=1
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4464, selfPID=4464, iMonCtr=1
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=13588, selfPID=13588, iMonCtr=1
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6136, selfPID=6136, iMonCtr=1
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=11268, selfPID=11268, iMonCtr=1
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=7888, selfPID=8208, iMonCtr=1
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=8024, selfPID=5696, iMonCtr=1
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=412, selfPID=412, iMonCtr=1
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4300, selfPID=4300, iMonCtr=1
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4280, selfPID=4280, iMonCtr=1
No Process Handle
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=10528, selfPID=10528, iMonCtr=1
No Process Handle
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=10528, selfPID=18372, iMonCtr=1
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=14224, selfPID=17292, iMonCtr=1
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=15540, selfPID=12580, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
No Process Handle
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=14868, selfPID=14160, iMonCtr=1
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=7560, selfPID=7560, iMonCtr=1
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=8536, selfPID=7352, iMonCtr=1
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=10288, selfPID=7080, iMonCtr=1
No Process Handle
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2872, selfPID=17892, iMonCtr=1
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=13636, selfPID=13636, iMonCtr=1
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1452, selfPID=10408, iMonCtr=1
GCM : BUFFIN: C I/O Error feof - Unit 61 - Return code = 16
GCM : BUFFIN: C I/O Error feof - Unit 61 - Return code = 16

GCM : BUFFIN: C I/O Error feof - Unit 63 - Return code = 16
GCM : BUFFIN: C I/O Error feof - Unit 63 - Return code = 16

GCM : BUFFIN: C I/O Error feof - Unit 64 - Return code = 16
GCM : BUFFIN: C I/O Error feof - Unit 64 - Return code = 16

GCM : BUFFIN: C I/O Error feof - Unit 61 - Return code = 16
GCM : BUFFIN: C I/O Error feof - Unit 61 - Return code = 16

GCM : BUFFIN: C I/O Error feof - Unit 63 - Return code = 16
GCM : BUFFIN: C I/O Error feof - Unit 63 - Return code = 16

GCM : BUFFIN: C I/O Error feof - Unit 64 - Return code = 16
GCM : BUFFIN: C I/O Error feof - Unit 64 - Return code = 16

GCM : BUFFIN: C I/O Error feof - Unit 61 - Return code = 16
GCM : BUFFIN: C I/O Error feof - Unit 61 - Return code = 16

GCM : BUFFIN: C I/O Error feof - Unit 63 - Return code = 16
GCM : BUFFIN: C I/O Error feof - Unit 63 - Return code = 16

GCM : BUFFIN: C I/O Error feof - Unit 64 - Return code = 16
GCM : BUFFIN: C I/O Error feof - Unit 64 - Return code = 16

Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3172, selfPID=14552, iMonCtr=1
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3172, selfPID=3172, iMonCtr=1
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=11196, iMonCtr=2
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=16428, selfPID=16428, iMonCtr=1
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=9056, selfPID=9056, iMonCtr=1
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=13472, selfPID=13472, iMonCtr=1
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=13472, selfPID=11476, iMonCtr=1
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=14624, selfPID=20024, iMonCtr=1
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5012, selfPID=11208, iMonCtr=1
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5012, selfPID=5012, iMonCtr=1
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=15396, selfPID=15396, iMonCtr=1
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=7324, selfPID=10772, iMonCtr=1
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=10052, selfPID=14240, iMonCtr=1
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=12408, selfPID=12408, iMonCtr=1
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=12408, selfPID=14304, iMonCtr=1
No Process Handle
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=8096, selfPID=15680, iMonCtr=1
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1308, selfPID=1308, iMonCtr=1
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=14604, selfPID=14604, iMonCtr=1
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=14620, selfPID=14620, iMonCtr=1
No Process Handle
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=24280, selfPID=21744, iMonCtr=1
Signal 11 received: Segment violation
Signal 11 received: Software termination signal from kill 
Signal 11 received: Abnormal termination triggered by abort call
Signal 11 received, exiting...
11:38:43 (18976): called boinc_finish(193)
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=34508, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=18976, selfPID=15828, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_ain::Monitor...
11:38:47 (15828): called boinc_finish(0)

</stderr_txt>
<message>
upload failure: <file_xfer_error>
  <file_name>wah2_safr50_n12n_201612_16_789_011744067_0_r1433314078_10.zip</file_name>
  <error_code>-240 (stat() failed)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>wah2_safr50_n12n_201612_16_789_011744067_0_r1433314078_11.zip</file_name>
  <error_code>-240 (stat() failed)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>wah2_safr50_n12n_201612_16_789_011744067_0_r1433314078_12.zip</file_name>
  <error_code>-240 (stat() failed)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>wah2_safr50_n12n_201612_16_789_011744067_0_r1433314078_13.zip</file_name>
  <error_code>-240 (stat() failed)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>wah2_safr50_n12n_201612_16_789_011744067_0_r1433314078_14.zip</file_name>
  <error_code>-240 (stat() failed)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>wah2_safr50_n12n_201612_16_789_011744067_0_r1433314078_15.zip</file_name>
  <error_code>-240 (stat() failed)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>wah2_safr50_n12n_201612_16_789_011744067_0_r1433314078_16.zip</file_name>
  <error_code>-240 (stat() failed)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>wah2_safr50_n12n_201612_16_789_011744067_0_r1433314078_restart.zip</file_name>
  <error_code>-240 (stat() failed)</error_code>
</file_xfer_error>
</message>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
30 Apr 2019 11:56:18 1479061 21508251 wah2_safr50_n12n_201612_16_789_011744067_0 103,979 371,436 3.5722
18 Apr 2019 09:47:31 1479061 21508251 wah2_safr50_n12n_201612_16_789_011744067_0 92,459 336,806 3.6428
12 Apr 2019 17:47:36 1479061 21508251 wah2_safr50_n12n_201612_16_789_011744067_0 80,939 300,320 3.7104
05 Apr 2019 08:48:49 1479061 21508251 wah2_safr50_n12n_201612_16_789_011744067_0 69,419 256,802 3.6993
02 Apr 2019 07:50:01 1479061 21508251 wah2_safr50_n12n_201612_16_789_011744067_0 57,899 217,755 3.7609
29 Mar 2019 07:16:18 1479061 21508251 wah2_safr50_n12n_201612_16_789_011744067_0 46,379 164,754 3.5523
24 Mar 2019 14:35:17 1479061 21508251 wah2_safr50_n12n_201612_16_789_011744067_0 34,859 118,768 3.4071
09 Mar 2019 11:28:39 1479061 21508251 wah2_safr50_n12n_201612_16_789_011744067_0 23,339 83,769 3.5892
07 Mar 2019 01:37:50 1479061 21508251 wah2_safr50_n12n_201612_16_789_011744067_0 11,819 39,163 3.3136


©2024 climateprediction.net