climateprediction.net home page
Task 21521070

Task 21521070

Name wah2_safr50_n022_201612_24_791_011755080_0
Workunit 11755080
Created 7 Mar 2019, 10:22:38 UTC
Sent 8 Mar 2019, 11:32:37 UTC
Report deadline 18 Feb 2020, 16:52:37 UTC
Received 6 Jun 2019, 12:44:21 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 0 (0x00000000)
Computer ID 1479732
Run time 12 days 19 hours 48 min 41 sec
CPU time 5 days 10 hours 1 min 10 sec
Validate state Invalid
Credit 6,859.15
Device peak FLOPS 4.51 GFLOPS
Application version Weather At Home 2 (wah2) v8.24
windows_intelx86
Peak working set size 253.25 MB
Peak swap size 220.44 MB
Peak disk usage 842.96 MB
Stderr
<core_client_version>7.14.2</core_client_version>
<![CDATA[
<stderr_txt>
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=12320, selfPID=12320, iMonCtr=1
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5020, selfPID=5020, iMonCtr=1
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=11252, selfPID=11252, iMonCtr=1
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1992, selfPID=1992, iMonCtr=1
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=12952, selfPID=13308, iMonCtr=1
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=13000, selfPID=13304, iMonCtr=1
Suspended CPDN Monitor - Suspend request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=10548, selfPID=12428, iMonCtr=1
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=12932, selfPID=1888, iMonCtr=1
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=12828, selfPID=12828, iMonCtr=1
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=12828, selfPID=12488, iMonCtr=1
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3952, selfPID=3952, iMonCtr=1
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3952, selfPID=7364, iMonCtr=1
GCM : BUFFIN: C I/O Error feof - Unit 61 - Return code = 16
GCM : BUFFIN: C I/O Error feof - Unit 61 - Return code = 16

GCM : BUFFIN: C I/O Error feof - Unit 63 - Return code = 16
GCM : BUFFIN: C I/O Error feof - Unit 63 - Return code = 16

GCM : BUFFIN: C I/O Error feof - Unit 64 - Return code = 16
GCM : BUFFIN: C I/O Error feof - Unit 64 - Return code = 16

Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1560, selfPID=1560, iMonCtr=1
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1560, selfPID=10828, iMonCtr=1
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=12212, selfPID=12212, iMonCtr=1
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=12212, selfPID=9096, iMonCtr=1
Suspended CPDN Monitor - Suspend request from BOINC...
Signal 11 received: Segment violation
Signal 11 received: Software termination signal from kill 
Signal 11 received: Abnormal termination triggered by abort call
Signal 11 received, exiting...
23:23:43 (5856): called boinc_finish(193)
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2324, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5856, selfPID=12264, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_ain::Monitor...
23:23:49 (12264): called boinc_finish(0)
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6728, selfPID=9312, iMonCtr=1
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=12312, selfPID=12312, iMonCtr=1
CPDN Monitor - Quit request from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=13748, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=11272, selfPID=14192, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=12836, selfPID=8964, iMonCtr=1
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3576, selfPID=13212, iMonCtr=1
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Signal 11 received: Segment violation
Signal 11 received: Software termination signal from kill 
Signal 11 received: Abnormal termination triggered by abort call
Signal 11 received, exiting...
14:17:19 (14400): called boinc_finish(193)
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=10444, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=14400, selfPID=14140, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_ain::Monitor...
14:17:23 (14140): called boinc_finish(0)

</stderr_txt>
<message>
upload failure: <file_xfer_error>
  <file_name>wah2_safr50_n022_201612_24_791_011755080_0_r576673158_10.zip</file_name>
  <error_code>-240 (stat() failed)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>wah2_safr50_n022_201612_24_791_011755080_0_r576673158_11.zip</file_name>
  <error_code>-240 (stat() failed)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>wah2_safr50_n022_201612_24_791_011755080_0_r576673158_12.zip</file_name>
  <error_code>-240 (stat() failed)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>wah2_safr50_n022_201612_24_791_011755080_0_r576673158_13.zip</file_name>
  <error_code>-240 (stat() failed)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>wah2_safr50_n022_201612_24_791_011755080_0_r576673158_14.zip</file_name>
  <error_code>-240 (stat() failed)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>wah2_safr50_n022_201612_24_791_011755080_0_r576673158_15.zip</file_name>
  <error_code>-240 (stat() failed)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>wah2_safr50_n022_201612_24_791_011755080_0_r576673158_16.zip</file_name>
  <error_code>-240 (stat() failed)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>wah2_safr50_n022_201612_24_791_011755080_0_r576673158_17.zip</file_name>
  <error_code>-240 (stat() failed)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>wah2_safr50_n022_201612_24_791_011755080_0_r576673158_18.zip</file_name>
  <error_code>-240 (stat() failed)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>wah2_safr50_n022_201612_24_791_011755080_0_r576673158_19.zip</file_name>
  <error_code>-240 (stat() failed)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>wah2_safr50_n022_201612_24_791_011755080_0_r576673158_20.zip</file_name>
  <error_code>-240 (stat() failed)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>wah2_safr50_n022_201612_24_791_011755080_0_r576673158_21.zip</file_name>
  <error_code>-240 (stat() failed)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>wah2_safr50_n022_201612_24_791_011755080_0_r576673158_22.zip</file_name>
  <error_code>-240 (stat() failed)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>wah2_safr50_n022_201612_24_791_011755080_0_r576673158_23.zip</file_name>
  <error_code>-240 (stat() failed)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>wah2_safr50_n022_201612_24_791_011755080_0_r576673158_24.zip</file_name>
  <error_code>-240 (stat() failed)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>wah2_safr50_n022_201612_24_791_011755080_0_r576673158_restart.zip</file_name>
  <error_code>-240 (stat() failed)</error_code>
</file_xfer_error>
</message>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
21 Apr 2019 14:30:07 1479732 21521070 wah2_safr50_n022_201612_24_791_011755080_0 103,979 469,575 4.5161
20 Apr 2019 13:53:26 1479732 21521070 wah2_safr50_n022_201612_24_791_011755080_0 92,459 419,433 4.5364
14 Apr 2019 12:43:44 1479732 21521070 wah2_safr50_n022_201612_24_791_011755080_0 80,939 364,298 4.5009
07 Apr 2019 14:14:56 1479732 21521070 wah2_safr50_n022_201612_24_791_011755080_0 69,419 308,826 4.4487
05 Apr 2019 09:19:40 1479732 21521070 wah2_safr50_n022_201612_24_791_011755080_0 57,899 247,410 4.2731
24 Mar 2019 10:07:26 1479732 21521070 wah2_safr50_n022_201612_24_791_011755080_0 46,379 192,203 4.1442
19 Mar 2019 13:36:28 1479732 21521070 wah2_safr50_n022_201612_24_791_011755080_0 34,859 137,543 3.9457
17 Mar 2019 01:22:17 1479732 21521070 wah2_safr50_n022_201612_24_791_011755080_0 23,339 80,208 3.4367
11 Mar 2019 22:34:02 1479732 21521070 wah2_safr50_n022_201612_24_791_011755080_0 11,819 39,443 3.3373


©2024 climateprediction.net