Name | hadcm3n_o2in_1940_40_007443119_1 |
Workunit | 7640622 |
Created | 8 Sep 2011, 23:02:51 UTC |
Sent | 8 Sep 2011, 23:07:47 UTC |
Report deadline | 9 Dec 2011, 6:34:58 UTC |
Received | 26 Oct 2011, 11:04:26 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 193 (0x000000C1) EXIT_SIGNAL |
Computer ID | 1163145 |
Run time | 6 days 8 hours 17 min 12 sec |
CPU time | 6 days 6 hours 18 min 40 sec |
Validate state | Invalid |
Credit | 3,110.40 |
Device peak FLOPS | 1.80 GFLOPS |
Application version | UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86 |
Stderr | <core_client_version>6.12.34</core_client_version> <![CDATA[ <message> - exit code 193 (0xc1) </message> <stderr_txt> Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4596, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4488, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4124, iMonCtr=1 Model crash detected, will try to restart... 01:30:11 (4008): No heartbeat from core client for 30 sec - exiting 01:30:13 (4008): No heartbeat from core client for 30 sec - exiting 01:30:14 (4008): No heartbeat from core client for 30 sec - exiting 01:30:15 (4008): No heartbeat from core client for 30 sec - exiting 01:30:16 (4008): No heartbeat from core client for 30 sec - exiting 01:30:17 (4008): No heartbeat from core client for 30 sec - exiting 01:30:18 (4008): No heartbeat from core client for 30 sec - exiting 01:30:19 (4008): No heartbeat from core client for 30 sec - exiting 01:30:20 (4008): No heartbeat from core client for 30 sec - exiting 01:30:21 (4008): No heartbeat from core client for 30 sec - exiting 01:30:22 (4008): No heartbeat from core client for 30 sec - exiting 01:30:23 (4008): No heartbeat from core client for 30 sec - exiting 01:30:25 (4008): No heartbeat from core client for 30 sec - exiting 01:30:26 (4008): No heartbeat from core client for 30 sec - exiting 01:30:27 (4008): No heartbeat from core client for 30 sec - exiting 01:30:28 (4008): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 19:12:06 (1600): No heartbeat from core client for 30 sec - exiting 19:12:07 (1600): No heartbeat from core client for 30 sec - exiting 19:12:08 (1600): No heartbeat from core client for 30 sec - exiting 19:12:10 (1600): No heartbeat from core client for 30 sec - exiting 19:12:11 (1600): No heartbeat from core client for 30 sec - exiting 19:12:12 (1600): No heartbeat from core client for 30 sec - exiting 19:12:13 (1600): No heartbeat from core client for 30 sec - exiting 19:12:14 (1600): No heartbeat from core client for 30 sec - exiting 19:12:15 (1600): No heartbeat from core client for 30 sec - exiting 19:12:16 (1600): No heartbeat from core client for 30 sec - exiting 19:12:17 (1600): No heartbeat from core client for 30 sec - exiting 19:12:18 (1600): No heartbeat from core client for 30 sec - exiting 19:12:19 (1600): No heartbeat from core client for 30 sec - exiting 19:12:20 (1600): No heartbeat from core client for 30 sec - exiting 19:12:21 (1600): No heartbeat from core client for 30 sec - exiting 19:12:22 (1600): No heartbeat from core client for 30 sec - exiting 19:12:23 (1600): No heartbeat from core client for 30 sec - exiting 19:12:24 (1600): No heartbeat from core client for 30 sec - exiting 19:12:25 (1600): No heartbeat from core client for 30 sec - exiting 19:12:26 (1600): No heartbeat from core client for 30 sec - exiting 19:12:27 (1600): No heartbeat from core client for 30 sec - exiting 19:12:28 (1600): No heartbeat from core client for 30 sec - exiting 19:12:30 (1600): No heartbeat from core client for 30 sec - exiting 19:12:31 (1600): No heartbeat from core client for 30 sec - exiting 19:12:32 (1600): No heartbeat from core client for 30 sec - exiting 19:12:33 (1600): No heartbeat from core client for 30 sec - exiting 19:12:34 (1600): No heartbeat from core client for 30 sec - exiting 19:12:35 (1600): No heartbeat from core client for 30 sec - exiting 19:12:36 (1600): No heartbeat from core client for 30 sec - exiting 19:12:37 (1600): No heartbeat from core client for 30 sec - exiting 19:12:38 (1600): No heartbeat from core client for 30 sec - exiting 19:12:39 (1600): No heartbeat from core client for 30 sec - exiting 19:12:40 (1600): No heartbeat from core client for 30 sec - exiting 19:12:42 (1600): No heartbeat from core client for 30 sec - exiting 19:12:43 (1600): No heartbeat from core client for 30 sec - exiting 19:12:44 (1600): No heartbeat from core client for 30 sec - exiting 19:12:45 (1600): No heartbeat from core client for 30 sec - exiting 19:12:46 (1600): No heartbeat from core client for 30 sec - exiting 19:12:47 (1600): No heartbeat from core client for 30 sec - exiting 19:12:48 (1600): No heartbeat from core client for 30 sec - exiting 19:12:49 (1600): No heartbeat from core client for 30 sec - exiting 19:12:50 (1600): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 13:15:11 (3904): No heartbeat from core client for 30 sec - exiting 13:15:13 (3904): No heartbeat from core client for 30 sec - exiting 13:15:14 (3904): No heartbeat from core client for 30 sec - exiting 13:15:15 (3904): No heartbeat from core client for 30 sec - exiting 13:15:16 (3904): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 23:46:26 (4200): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4440, iMonCtr=1 Model crash detected, will try to restart... 04:56:26 (4132): No heartbeat from core client for 30 sec - exiting 04:56:27 (4132): No heartbeat from core client for 30 sec - exiting 04:56:28 (4132): No heartbeat from core client for 30 sec - exiting 04:56:29 (4132): No heartbeat from core client for 30 sec - exiting 04:56:30 (4132): No heartbeat from core client for 30 sec - exiting 04:56:31 (4132): No heartbeat from core client for 30 sec - exiting 04:56:32 (4132): No heartbeat from core client for 30 sec - exiting 04:56:34 (4132): No heartbeat from core client for 30 sec - exiting 04:56:35 (4132): No heartbeat from core client for 30 sec - exiting 04:56:36 (4132): No heartbeat from core client for 30 sec - exiting 04:56:37 (4132): No heartbeat from core client for 30 sec - exiting 04:56:38 (4132): No heartbeat from core client for 30 sec - exiting 04:56:39 (4132): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3808, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4168, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4168, iMonCtr=1 Model crash detected, will try to restart... CSignal 11 received, exiting... Called boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
31 Oct 2011 15:56:28 | 1163145 | 13348113 | hadcm3n_o2in_1940_40_007443119_1 | 259,200 | 541,119 | 2.0877 |
31 Oct 2011 15:44:32 | 1163145 | 13348113 | hadcm3n_o2in_1940_40_007443119_1 | 233,280 | 486,390 | 2.0850 |
31 Oct 2011 15:44:32 | 1163145 | 13348113 | hadcm3n_o2in_1940_40_007443119_1 | 207,360 | 431,003 | 2.0785 |
18 Oct 2011 11:25:19 | 1163145 | 13348113 | hadcm3n_o2in_1940_40_007443119_1 | 181,440 | 376,277 | 2.0738 |
16 Oct 2011 08:26:39 | 1163145 | 13348113 | hadcm3n_o2in_1940_40_007443119_1 | 155,520 | 322,234 | 2.0720 |
15 Oct 2011 00:25:11 | 1163145 | 13348113 | hadcm3n_o2in_1940_40_007443119_1 | 129,600 | 267,494 | 2.0640 |
16 Sep 2011 17:38:23 | 1163145 | 13348113 | hadcm3n_o2in_1940_40_007443119_1 | 103,680 | 213,373 | 2.0580 |
16 Sep 2011 01:45:47 | 1163145 | 13348113 | hadcm3n_o2in_1940_40_007443119_1 | 77,760 | 159,426 | 2.0502 |
15 Sep 2011 10:50:38 | 1163145 | 13348113 | hadcm3n_o2in_1940_40_007443119_1 | 51,840 | 106,026 | 2.0453 |
14 Sep 2011 20:05:06 | 1163145 | 13348113 | hadcm3n_o2in_1940_40_007443119_1 | 25,920 | 52,920 | 2.0417 |
©2024 climateprediction.net