Name | wah2_sam50_a0fh_201312_25_881_012034565_0 |
Workunit | 12034565 |
Created | 2 Nov 2020, 12:13:19 UTC |
Sent | 2 Nov 2020, 12:22:35 UTC |
Report deadline | 15 Oct 2021, 17:42:35 UTC |
Received | 24 Nov 2020, 10:52:40 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 0 (0x00000000) |
Computer ID | 1510585 |
Run time | 4 days 1 hours 33 min 48 sec |
CPU time | 4 days 0 hours 28 min 42 sec |
Validate state | Invalid |
Credit | 9,138.96 |
Device peak FLOPS | 4.36 GFLOPS |
Application version | Weather At Home 2 (wah2) v8.24 windows_intelx86 |
Peak working set size | 226.76 MB |
Peak swap size | 187.97 MB |
Peak disk usage | 151.04 MB |
Stderr | <core_client_version>7.16.5</core_client_version> <![CDATA[ <stderr_txt> Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=20484, selfPID=20484, iMonCtr=1 Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 20:06:10 (21284): Can't acquire lockfile (32) - waiting 35s 20:06:45 (21284): Can't acquire lockfile (32) - exiting 20:06:46 (21284): Error: The process cannot access the file because it is being used by another process. (0x20) 20:16:51 (21000): Can't acquire lockfile (32) - waiting 35s 20:17:26 (21000): Can't acquire lockfile (32) - exiting 20:17:26 (21000): Error: The process cannot access the file because it is being used by another process. (0x20) 20:27:39 (8572): Can't acquire lockfile (32) - waiting 35s 20:28:14 (8572): Can't acquire lockfile (32) - exiting 20:28:14 (8572): Error: The process cannot access the file because it is being used by another process. (0x20) 20:38:31 (20836): Can't acquire lockfile (32) - waiting 35s 20:39:06 (20836): Can't acquire lockfile (32) - exiting 20:39:06 (20836): Error: The process cannot access the file because it is being used by another process. (0x20) 20:49:14 (21172): Can't acquire lockfile (32) - waiting 35s 20:49:49 (21172): Can't acquire lockfile (32) - exiting 20:49:49 (21172): Error: The process cannot access the file because it is being used by another process. (0x20) 21:00:01 (19884): Can't acquire lockfile (32) - waiting 35s 21:00:36 (19884): Can't acquire lockfile (32) - exiting 21:00:36 (19884): Error: The process cannot access the file because it is being used by another process. (0x20) 21:10:57 (20292): Can't acquire lockfile (32) - waiting 35s 21:11:32 (20292): Can't acquire lockfile (32) - exiting 21:11:32 (20292): Error: The process cannot access the file because it is being used by another process. (0x20) 21:22:15 (15544): Can't acquire lockfile (32) - waiting 35s 21:22:50 (15544): Can't acquire lockfile (32) - exiting 21:22:50 (15544): Error: The process cannot access the file because it is being used by another process. (0x20) 22:44:06 (15104): Can't acquire lockfile (32) - waiting 35s 22:44:42 (15104): Can't acquire lockfile (32) - exiting 22:44:42 (15104): Error: The process cannot access the file because it is being used by another process. (0x20) 00:03:41 (8220): Can't acquire lockfile (32) - waiting 35s 00:04:16 (8220): Can't acquire lockfile (32) - exiting 00:04:16 (8220): Error: The process cannot access the file because it is being used by another process. (0x20) 00:19:02 (23732): Can't acquire lockfile (32) - waiting 35s 00:19:37 (23732): Can't acquire lockfile (32) - exiting 00:19:37 (23732): Error: The process cannot access the file because it is being used by another process. (0x20) 00:29:40 (4176): Can't acquire lockfile (32) - waiting 35s 00:30:15 (4176): Can't acquire lockfile (32) - exiting 00:30:15 (4176): Error: The process cannot access the file because it is being used by another process. (0x20) 00:56:14 (15956): Can't acquire lockfile (32) - waiting 35s 00:56:49 (15956): Can't acquire lockfile (32) - exiting 00:56:49 (15956): Error: The process cannot access the file because it is being used by another process. (0x20) 06:51:29 (9244): Can't acquire lockfile (32) - waiting 35s 06:52:04 (9244): Can't acquire lockfile (32) - exiting 06:52:04 (9244): Error: The process cannot access the file because it is being used by another process. (0x20) 07:06:13 (4172): Can't acquire lockfile (32) - waiting 35s 07:06:48 (4172): Can't acquire lockfile (32) - exiting 07:06:48 (4172): Error: The process cannot access the file because it is being used by another process. (0x20) 07:48:10 (5400): Can't acquire lockfile (32) - waiting 35s 07:48:45 (5400): Can't acquire lockfile (32) - exiting 07:48:45 (5400): Error: The process cannot access the file because it is being used by another process. (0x20) 08:36:03 (24464): Can't acquire lockfile (32) - waiting 35s 08:36:38 (24464): Can't acquire lockfile (32) - exiting 08:36:38 (24464): Error: The process cannot access the file because it is being used by another process. (0x20) 08:56:32 (24896): Can't acquire lockfile (32) - waiting 35s 08:57:07 (24896): Can't acquire lockfile (32) - exiting 08:57:07 (24896): Error: The process cannot access the file because it is being used by another process. (0x20) 11:29:57 (20448): Can't acquire lockfile (32) - waiting 35s 11:30:32 (20448): Can't acquire lockfile (32) - exiting 11:30:32 (20448): Error: The process cannot access the file because it is being used by another process. (0x20) 12:04:23 (24144): Can't acquire lockfile (32) - waiting 35s 12:04:58 (24144): Can't acquire lockfile (32) - exiting 12:04:58 (24144): Error: The process cannot access the file because it is being used by another process. (0x20) 13:07:52 (19920): Can't acquire lockfile (32) - waiting 35s 13:08:27 (19920): Can't acquire lockfile (32) - exiting 13:08:27 (19920): Error: The process cannot access the file because it is being used by another process. (0x20) 13:34:05 (2040): Can't acquire lockfile (32) - waiting 35s 13:34:40 (2040): Can't acquire lockfile (32) - exiting 13:34:40 (2040): Error: The process cannot access the file because it is being used by another process. (0x20) CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=19484, selfPID=19400, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_ain::Monitor... cpdnmonitor: cannot open input file Y:\BOINC/projects/climateprediction.net/wah2_sam50_a0fh_201312_25_881_012034565/dataout/atmos_restart.day after 11 attempts cpdnmonitor: cannot open input file Y:\BOINC/projects/climateprediction.net/wah2_sam50_a0fh_201312_25_881_012034565/dataout/region_restart.day after 11 attempts Model crashed: READHIST: End of file in READ from history file for namelist NLIHISTO tmp/xadae.pipe_dummy Model crashed: READHIST: End of file in READ from history file for namelist NLIHISTO tmp/xacxf.pipe_dummy 2048 Leaving CPDN_ain::Monitor... 04:35:47 (18340): called boinc_finish(0) </stderr_txt> <message> upload failure: <file_xfer_error> <file_name>wah2_sam50_a0fh_201312_25_881_012034565_0_r1078671741_13.zip</file_name> <error_code>-240 (stat() failed)</error_code> </file_xfer_error> <file_xfer_error> <file_name>wah2_sam50_a0fh_201312_25_881_012034565_0_r1078671741_14.zip</file_name> <error_code>-240 (stat() failed)</error_code> </file_xfer_error> <file_xfer_error> <file_name>wah2_sam50_a0fh_201312_25_881_012034565_0_r1078671741_15.zip</file_name> <error_code>-240 (stat() failed)</error_code> </file_xfer_error> <file_xfer_error> <file_name>wah2_sam50_a0fh_201312_25_881_012034565_0_r1078671741_16.zip</file_name> <error_code>-240 (stat() failed)</error_code> </file_xfer_error> <file_xfer_error> <file_name>wah2_sam50_a0fh_201312_25_881_012034565_0_r1078671741_17.zip</file_name> <error_code>-240 (stat() failed)</error_code> </file_xfer_error> <file_xfer_error> <file_name>wah2_sam50_a0fh_201312_25_881_012034565_0_r1078671741_18.zip</file_name> <error_code>-240 (stat() failed)</error_code> </file_xfer_error> <file_xfer_error> <file_name>wah2_sam50_a0fh_201312_25_881_012034565_0_r1078671741_19.zip</file_name> <error_code>-240 (stat() failed)</error_code> </file_xfer_error> <file_xfer_error> <file_name>wah2_sam50_a0fh_201312_25_881_012034565_0_r1078671741_20.zip</file_name> <error_code>-240 (stat() failed)</error_code> </file_xfer_error> <file_xfer_error> <file_name>wah2_sam50_a0fh_201312_25_881_012034565_0_r1078671741_21.zip</file_name> <error_code>-240 (stat() failed)</error_code> </file_xfer_error> <file_xfer_error> <file_name>wah2_sam50_a0fh_201312_25_881_012034565_0_r1078671741_22.zip</file_name> <error_code>-240 (stat() failed)</error_code> </file_xfer_error> <file_xfer_error> <file_name>wah2_sam50_a0fh_201312_25_881_012034565_0_r1078671741_23.zip</file_name> <error_code>-240 (stat() failed)</error_code> </file_xfer_error> <file_xfer_error> <file_name>wah2_sam50_a0fh_201312_25_881_012034565_0_r1078671741_24.zip</file_name> <error_code>-240 (stat() failed)</error_code> </file_xfer_error> <file_xfer_error> <file_name>wah2_sam50_a0fh_201312_25_881_012034565_0_r1078671741_25.zip</file_name> <error_code>-240 (stat() failed)</error_code> </file_xfer_error> <file_xfer_error> <file_name>wah2_sam50_a0fh_201312_25_881_012034565_0_r1078671741_restart.zip</file_name> <error_code>-240 (stat() failed)</error_code> </file_xfer_error> </message> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
23 Nov 2020 23:03:38 | 1510585 | 21960097 | wah2_sam50_a0fh_201312_25_881_012034565_0 | 138,539 | 336,383 | 2.4281 |
20 Nov 2020 14:31:21 | 1510585 | 21960097 | wah2_sam50_a0fh_201312_25_881_012034565_0 | 127,019 | 305,722 | 2.4069 |
16 Nov 2020 16:30:49 | 1510585 | 21960097 | wah2_sam50_a0fh_201312_25_881_012034565_0 | 115,499 | 276,613 | 2.3949 |
15 Nov 2020 21:00:04 | 1510585 | 21960097 | wah2_sam50_a0fh_201312_25_881_012034565_0 | 103,979 | 245,702 | 2.3630 |
14 Nov 2020 17:19:27 | 1510585 | 21960097 | wah2_sam50_a0fh_201312_25_881_012034565_0 | 92,459 | 218,124 | 2.3591 |
14 Nov 2020 02:42:34 | 1510585 | 21960097 | wah2_sam50_a0fh_201312_25_881_012034565_0 | 80,939 | 188,166 | 2.3248 |
12 Nov 2020 13:58:25 | 1510585 | 21960097 | wah2_sam50_a0fh_201312_25_881_012034565_0 | 57,899 | 135,333 | 2.3374 |
12 Nov 2020 05:52:42 | 1510585 | 21960097 | wah2_sam50_a0fh_201312_25_881_012034565_0 | 46,379 | 106,561 | 2.2976 |
12 Nov 2020 02:53:10 | 1510585 | 21960097 | wah2_sam50_a0fh_201312_25_881_012034565_0 | 34,859 | 80,316 | 2.3040 |
12 Nov 2020 02:53:10 | 1510585 | 21960097 | wah2_sam50_a0fh_201312_25_881_012034565_0 | 23,339 | 58,758 | 2.5176 |
11 Nov 2020 08:24:37 | 1510585 | 21960097 | wah2_sam50_a0fh_201312_25_881_012034565_0 | 11,819 | 33,487 | 2.8333 |
©2024 climateprediction.net