Name | hadcm3n_z8qa_1960_40_008282307_1 |
Workunit | 8433442 |
Created | 19 Mar 2013, 6:39:34 UTC |
Sent | 19 Mar 2013, 6:39:55 UTC |
Report deadline | 18 Jun 2013, 14:07:06 UTC |
Received | 8 Apr 2013, 17:49:29 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 22 (0x00000016) Unknown error code |
Computer ID | 985512 |
Run time | 18 days 6 hours 52 min 57 sec |
CPU time | 17 days 2 hours 58 min 44 sec |
Validate state | Invalid |
Credit | 10,264.32 |
Device peak FLOPS | 2.52 GFLOPS |
Application version | UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86 |
Stderr | <core_client_version>7.0.28</core_client_version> <![CDATA[ <message> The device does not recognize the command. (0x16) - exit code 22 (0x16) </message> <stderr_txt> 08:24:37 (2992): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 08:24:38 (2992): No heartbeat from core client for 30 sec - exiting 08:24:39 (2992): No heartbeat from core client for 30 sec - exiting 08:24:40 (2992): No heartbeat from core client for 30 sec - exiting 08:24:41 (2992): No heartbeat from core client for 30 sec - exiting 08:24:42 (2992): No heartbeat from core client for 30 sec - exiting 08:24:43 (2992): No heartbeat from core client for 30 sec - exiting 08:24:44 (2992): No heartbeat from core client for 30 sec - exiting 08:24:45 (2992): No heartbeat from core client for 30 sec - exiting 08:24:46 (2992): No heartbeat from core client for 30 sec - exiting 08:24:47 (2992): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - Suspend request from BOINC... 13:49:09 (8104): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 13:49:10 (8104): No heartbeat from core client for 30 sec - exiting 13:49:11 (8104): No heartbeat from core client for 30 sec - exiting 13:49:12 (8104): No heartbeat from core client for 30 sec - exiting 13:49:13 (8104): No heartbeat from core client for 30 sec - exiting 13:49:14 (8104): No heartbeat from core client for 30 sec - exiting 13:49:15 (8104): No heartbeat from core client for 30 sec - exiting 13:49:16 (8104): No heartbeat from core client for 30 sec - exiting 13:49:17 (8104): No heartbeat from core client for 30 sec - exiting 13:49:18 (8104): No heartbeat from core client for 30 sec - exiting CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 00:22:06 (12044): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 06:48:14 (8496): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... 19:26:15 (8004): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 23:26:23 (3344): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 06:57:33 (12380): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... 06:04:02 (10080): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 06:04:42 (10080): No heartbeat from core client for 30 sec - exiting 06:04:43 (10080): No heartbeat from core client for 30 sec - exiting 06:04:44 (10080): No heartbeat from core client for 30 sec - exiting 06:04:45 (10080): No heartbeat from core client for 30 sec - exiting 06:04:46 (10080): No heartbeat from core client for 30 sec - exiting 06:04:47 (10080): No heartbeat from core client for 30 sec - exiting 06:04:48 (10080): No heartbeat from core client for 30 sec - exiting 06:04:49 (10080): No heartbeat from core client for 30 sec - exiting 06:04:50 (10080): No heartbeat from core client for 30 sec - exiting 06:04:51 (10080): No heartbeat from core client for 30 sec - exiting CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 06:11:31 (13548): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 08:53:39 (11972): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 09:10:08 (16752): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 09:13:58 (14936): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 09:20:15 (18224): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 09:24:13 (6224): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 09:36:08 (6172): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 09:41:59 (452): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 09:42:00 (452): No heartbeat from core client for 30 sec - exiting 10:00:00 (7232): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 10:20:27 (16976): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 10:22:10 (10428): No heartbeat from core client for 30 sec - exiting 10:22:11 (10428): No heartbeat from core client for 30 sec - exiting 10:22:12 (10428): No heartbeat from core client for 30 sec - exiting 10:22:13 (10428): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 10:24:28 (17900): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 10:26:11 (5944): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 10:32:10 (6372): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 10:34:37 (3012): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 10:40:36 (9192): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 10:40:37 (9192): No heartbeat from core client for 30 sec - exiting 10:40:38 (9192): No heartbeat from core client for 30 sec - exiting 10:40:39 (9192): No heartbeat from core client for 30 sec - exiting 10:40:40 (9192): No heartbeat from core client for 30 sec - exiting 10:40:41 (9192): No heartbeat from core client for 30 sec - exiting 10:40:42 (9192): No heartbeat from core client for 30 sec - exiting 10:40:43 (9192): No heartbeat from core client for 30 sec - exiting 10:40:44 (9192): No heartbeat from core client for 30 sec - exiting 10:57:05 (14596): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 11:13:03 (3916): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 11:13:54 (3652): No heartbeat from core client for 30 sec - exiting 11:13:55 (3652): No heartbeat from core client for 30 sec - exiting 11:13:56 (3652): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 11:29:53 (16332): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 11:32:04 (13108): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 11:35:03 (1608): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 11:39:39 (16304): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 11:51:12 (12480): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 12:01:16 (9404): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 12:01:17 (9404): No heartbeat from core client for 30 sec - exiting 12:01:18 (9404): No heartbeat from core client for 30 sec - exiting 12:01:19 (9404): No heartbeat from core client for 30 sec - exiting 12:01:20 (9404): No heartbeat from core client for 30 sec - exiting 12:01:21 (9404): No heartbeat from core client for 30 sec - exiting 12:01:22 (9404): No heartbeat from core client for 30 sec - exiting 12:01:23 (9404): No heartbeat from core client for 30 sec - exiting 12:01:24 (9404): No heartbeat from core client for 30 sec - exiting 12:01:25 (9404): No heartbeat from core client for 30 sec - exiting 12:01:26 (9404): No heartbeat from core client for 30 sec - exiting 23:19:23 (8868): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6616, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Model crashed: P_TH_ADJ : NEGATIVE PRESSURE VALUE CREATED. tmp/pipe_dummy 2048 Model crashed: P_TH_ADJ : NEGATIVE PRESSURE VALUE CREATED. tmp/pipe_dummy 2048 Model crashed: P_TH_ADJ : NEGATIVE PRESSURE VALUE CREATED. tmp/pipe_dummy 2048 Model crashed: P_TH_ADJ : NEGATIVE PRESSURE VALUE CREATED. tmp/pipe_dummy 2048 Model crashed: P_TH_ADJ : NEGATIVE PRESSURE VALUE CREATED. tmp/pipe_dummy 2048 Model crashed: P_TH_ADJ : NEGATIVE PRESSURE VALUE CREATED. tmp/pipe_dummy 2048 Sorry, too many model crashes! :-( Called boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
08 Apr 2013 14:50:25 | 985512 | 15671780 | hadcm3n_z8qa_1960_40_008282307_1 | 855,360 | 1,513,478 | 1.7694 |
08 Apr 2013 01:39:26 | 985512 | 15671780 | hadcm3n_z8qa_1960_40_008282307_1 | 829,440 | 1,467,127 | 1.7688 |
07 Apr 2013 12:34:07 | 985512 | 15671780 | hadcm3n_z8qa_1960_40_008282307_1 | 803,520 | 1,421,228 | 1.7688 |
06 Apr 2013 21:03:33 | 985512 | 15671780 | hadcm3n_z8qa_1960_40_008282307_1 | 777,600 | 1,375,024 | 1.7683 |
06 Apr 2013 07:48:15 | 985512 | 15671780 | hadcm3n_z8qa_1960_40_008282307_1 | 751,680 | 1,328,951 | 1.7680 |
04 Apr 2013 03:03:29 | 985512 | 15671780 | hadcm3n_z8qa_1960_40_008282307_1 | 725,760 | 1,283,233 | 1.7681 |
03 Apr 2013 12:21:59 | 985512 | 15671780 | hadcm3n_z8qa_1960_40_008282307_1 | 699,840 | 1,238,043 | 1.7690 |
02 Apr 2013 23:06:38 | 985512 | 15671780 | hadcm3n_z8qa_1960_40_008282307_1 | 673,920 | 1,191,766 | 1.7684 |
02 Apr 2013 09:48:58 | 985512 | 15671780 | hadcm3n_z8qa_1960_40_008282307_1 | 648,000 | 1,145,730 | 1.7681 |
01 Apr 2013 20:28:04 | 985512 | 15671780 | hadcm3n_z8qa_1960_40_008282307_1 | 622,080 | 1,099,541 | 1.7675 |
01 Apr 2013 07:08:53 | 985512 | 15671780 | hadcm3n_z8qa_1960_40_008282307_1 | 596,160 | 1,053,551 | 1.7672 |
31 Mar 2013 17:21:23 | 985512 | 15671780 | hadcm3n_z8qa_1960_40_008282307_1 | 570,240 | 1,007,108 | 1.7661 |
31 Mar 2013 04:02:54 | 985512 | 15671780 | hadcm3n_z8qa_1960_40_008282307_1 | 544,320 | 961,133 | 1.7657 |
30 Mar 2013 14:40:26 | 985512 | 15671780 | hadcm3n_z8qa_1960_40_008282307_1 | 518,400 | 915,229 | 1.7655 |
30 Mar 2013 01:26:59 | 985512 | 15671780 | hadcm3n_z8qa_1960_40_008282307_1 | 492,480 | 869,578 | 1.7657 |
29 Mar 2013 12:27:17 | 985512 | 15671780 | hadcm3n_z8qa_1960_40_008282307_1 | 466,560 | 824,419 | 1.7670 |
28 Mar 2013 23:00:47 | 985512 | 15671780 | hadcm3n_z8qa_1960_40_008282307_1 | 440,640 | 778,882 | 1.7676 |
28 Mar 2013 09:54:02 | 985512 | 15671780 | hadcm3n_z8qa_1960_40_008282307_1 | 414,720 | 733,397 | 1.7684 |
27 Mar 2013 20:46:55 | 985512 | 15671780 | hadcm3n_z8qa_1960_40_008282307_1 | 388,800 | 687,477 | 1.7682 |
27 Mar 2013 07:21:49 | 985512 | 15671780 | hadcm3n_z8qa_1960_40_008282307_1 | 362,880 | 642,044 | 1.7693 |
©2024 climateprediction.net