climateprediction.net home page
Task 22335524

Task 22335524

Name wah2_nz25_201d_209005_25_995_012220447_1
Workunit 12220447
Created 25 Jul 2023, 1:44:13 UTC
Sent 25 Jul 2023, 1:44:21 UTC
Report deadline 5 Aug 2024, 7:04:21 UTC
Received 28 Aug 2023, 1:08:47 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 0 (0x00000000)
Computer ID 1541803
Run time 2 days 6 hours 17 min 1 sec
CPU time 2 days 4 hours 27 min 48 sec
Validate state Invalid
Credit 4,579.34
Device peak FLOPS 4.24 GFLOPS
Application version Weather At Home 2 (wah2) v8.24
windows_intelx86
Peak working set size 262.14 MB
Peak swap size 225.29 MB
Peak disk usage 87.46 MB
Stderr
<core_client_version>7.20.2</core_client_version>
<![CDATA[
<stderr_txt>
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=20840, selfPID=21268, iMonCtr=1
Suspended CPDN Monitor - Suspend request from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=21660, selfPID=21660, iMonCtr=1
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=17196, selfPID=17140, iMonCtr=1
Suspended CPDN Monitor - Suspend request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=16064, selfPID=2584, iMonCtr=1
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=21012, selfPID=21012, iMonCtr=1
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=17124, selfPID=17124, iMonCtr=1
Suspended CPDN Monitor - Suspend request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=18632, selfPID=9580, iMonCtr=1
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=18632, selfPID=18632, iMonCtr=1
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=18580, selfPID=18580, iMonCtr=1
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=12248, selfPID=22484, iMonCtr=1
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=9668, selfPID=9668, iMonCtr=1
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=14996, selfPID=14996, iMonCtr=1
GCM : BUFFIN: C I/O Error feof - Unit 61 - Return code = 16
GCM : BUFFIN: C I/O Error feof - Unit 61 - Return code = 16

GCM : BUFFIN: C I/O Error feof - Unit 63 - Return code = 16
GCM : BUFFIN: C I/O Error feof - Unit 63 - Return code = 16

GCM : BUFFIN: C I/O Error feof - Unit 64 - Return code = 16
GCM : BUFFIN: C I/O Error feof - Unit 64 - Return code = 16

Suspended CPDN Monitor - Suspend request from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4452, selfPID=4452, iMonCtr=1
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4452, selfPID=19312, iMonCtr=1
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=13552, selfPID=13608, iMonCtr=1
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=13552, selfPID=13552, iMonCtr=1
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=23892, selfPID=16556, iMonCtr=1
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=23892, selfPID=23892, iMonCtr=1
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5612, selfPID=5612, iMonCtr=1
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=14164, selfPID=14164, iMonCtr=1
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=24372, selfPID=24372, iMonCtr=1
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=14144, selfPID=14144, iMonCtr=1
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=14144, selfPID=23668, iMonCtr=1
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=26280, selfPID=25876, iMonCtr=1
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3152, selfPID=20864, iMonCtr=1
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3152, selfPID=3152, iMonCtr=1
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=20236, selfPID=20236, iMonCtr=1
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=20236, selfPID=19588, iMonCtr=1
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=10536, selfPID=10536, iMonCtr=1
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=16812, selfPID=25916, iMonCtr=1
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=16812, selfPID=16812, iMonCtr=1
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=26940, selfPID=20292, iMonCtr=1
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=15840, selfPID=15840, iMonCtr=1
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=15840, selfPID=21912, iMonCtr=1
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5340, selfPID=5340, iMonCtr=1
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5340, selfPID=12452, iMonCtr=1
Suspended CPDN Monitor - Suspend request from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=25240, selfPID=25240, iMonCtr=1
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=25240, selfPID=22268, iMonCtr=1
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=22144, selfPID=22144, iMonCtr=1
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=12476, selfPID=12476, iMonCtr=1
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=11080, selfPID=11080, iMonCtr=1
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=9492, selfPID=18732, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_ain::Monitor...
23:15:09 (18732): called boinc_finish(0)

</stderr_txt>
<message>
upload failure: <file_xfer_error>
  <file_name>wah2_nz25_201d_209005_25_995_012220447_1_r1265517372_7.zip</file_name>
  <error_code>-240 (stat() failed)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>wah2_nz25_201d_209005_25_995_012220447_1_r1265517372_8.zip</file_name>
  <error_code>-240 (stat() failed)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>wah2_nz25_201d_209005_25_995_012220447_1_r1265517372_9.zip</file_name>
  <error_code>-240 (stat() failed)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>wah2_nz25_201d_209005_25_995_012220447_1_r1265517372_10.zip</file_name>
  <error_code>-240 (stat() failed)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>wah2_nz25_201d_209005_25_995_012220447_1_r1265517372_11.zip</file_name>
  <error_code>-240 (stat() failed)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>wah2_nz25_201d_209005_25_995_012220447_1_r1265517372_12.zip</file_name>
  <error_code>-240 (stat() failed)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>wah2_nz25_201d_209005_25_995_012220447_1_r1265517372_13.zip</file_name>
  <error_code>-240 (stat() failed)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>wah2_nz25_201d_209005_25_995_012220447_1_r1265517372_14.zip</file_name>
  <error_code>-240 (stat() failed)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>wah2_nz25_201d_209005_25_995_012220447_1_r1265517372_15.zip</file_name>
  <error_code>-240 (stat() failed)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>wah2_nz25_201d_209005_25_995_012220447_1_r1265517372_16.zip</file_name>
  <error_code>-240 (stat() failed)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>wah2_nz25_201d_209005_25_995_012220447_1_r1265517372_17.zip</file_name>
  <error_code>-240 (stat() failed)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>wah2_nz25_201d_209005_25_995_012220447_1_r1265517372_18.zip</file_name>
  <error_code>-240 (stat() failed)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>wah2_nz25_201d_209005_25_995_012220447_1_r1265517372_19.zip</file_name>
  <error_code>-240 (stat() failed)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>wah2_nz25_201d_209005_25_995_012220447_1_r1265517372_20.zip</file_name>
  <error_code>-240 (stat() failed)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>wah2_nz25_201d_209005_25_995_012220447_1_r1265517372_21.zip</file_name>
  <error_code>-240 (stat() failed)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>wah2_nz25_201d_209005_25_995_012220447_1_r1265517372_22.zip</file_name>
  <error_code>-240 (stat() failed)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>wah2_nz25_201d_209005_25_995_012220447_1_r1265517372_23.zip</file_name>
  <error_code>-240 (stat() failed)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>wah2_nz25_201d_209005_25_995_012220447_1_r1265517372_24.zip</file_name>
  <error_code>-240 (stat() failed)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>wah2_nz25_201d_209005_25_995_012220447_1_r1265517372_25.zip</file_name>
  <error_code>-240 (stat() failed)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>wah2_nz25_201d_209005_25_995_012220447_1_r1265517372_restart.zip</file_name>
  <error_code>-240 (stat() failed)</error_code>
</file_xfer_error>
</message>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
26 Aug 2023 13:50:08 1541803 22335524 wah2_nz25_201d_209005_25_995_012220447_1 69,419 187,547 2.7017
20 Aug 2023 16:26:23 1541803 22335524 wah2_nz25_201d_209005_25_995_012220447_1 57,899 154,946 2.6761
17 Aug 2023 03:07:23 1541803 22335524 wah2_nz25_201d_209005_25_995_012220447_1 46,379 122,038 2.6313
12 Aug 2023 01:38:29 1541803 22335524 wah2_nz25_201d_209005_25_995_012220447_1 34,859 88,667 2.5436
12 Aug 2023 04:34:45 1541803 22335524 wah2_nz25_201d_209005_25_995_012220447_1 34,859 89,529 2.5683
10 Aug 2023 00:47:25 1541803 22335524 wah2_nz25_201d_209005_25_995_012220447_1 23,339 57,187 2.4503
03 Aug 2023 01:37:24 1541803 22335524 wah2_nz25_201d_209005_25_995_012220447_1 11,819 28,012 2.3701


©2024 climateprediction.net