|
Name | hadcm3n_4get_1940_40_008302929_0 |
Workunit | 8454064 |
Created | 6 Feb 2013, 20:55:48 UTC |
Sent | 6 Feb 2013, 20:57:12 UTC |
Report deadline | 9 May 2013, 4:24:23 UTC |
Received | 2 Mar 2013, 15:00:55 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 22 (0x00000016) Unknown error code |
Computer ID | 1105487 |
Run time | 16 days 18 hours 23 min |
CPU time | 16 days 5 hours 42 min 44 sec |
Validate state | Invalid |
Credit | 10,264.32 |
Device peak FLOPS | 3.05 GFLOPS |
Application version | UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86 |
Stderr | <core_client_version>7.0.28</core_client_version> <![CDATA[ <message> The device does not recognize the command. (0x16) - exit code 22 (0x16) </message> <stderr_txt> Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 06:57:31 (3908): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 06:57:32 (3908): No heartbeat from core client for 30 sec - exiting 06:57:33 (3908): No heartbeat from core client for 30 sec - exiting 06:57:34 (3908): No heartbeat from core client for 30 sec - exiting 06:57:35 (3908): No heartbeat from core client for 30 sec - exiting 06:57:36 (3908): No heartbeat from core client for 30 sec - exiting 06:57:37 (3908): No heartbeat from core client for 30 sec - exiting 06:57:38 (3908): No heartbeat from core client for 30 sec - exiting 06:57:39 (3908): No heartbeat from core client for 30 sec - exiting 06:57:40 (3908): No heartbeat from core client for 30 sec - exiting 06:57:41 (3908): No heartbeat from core client for 30 sec - exiting 20:32:03 (2132): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 17:27:14 (3864): No heartbeat from core client for 30 sec - exiting 17:27:15 (3864): No heartbeat from core client for 30 sec - exiting 17:27:16 (3864): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Quit request from BOINC... 04:07:16 (4720): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 04:07:17 (4720): No heartbeat from core client for 30 sec - exiting 04:07:18 (4720): No heartbeat from core client for 30 sec - exiting 04:07:19 (4720): No heartbeat from core client for 30 sec - exiting 04:07:20 (4720): No heartbeat from core client for 30 sec - exiting 04:07:21 (4720): No heartbeat from core client for 30 sec - exiting 04:07:22 (4720): No heartbeat from core client for 30 sec - exiting 04:07:23 (4720): No heartbeat from core client for 30 sec - exiting 04:07:25 (4720): No heartbeat from core client for 30 sec - exiting 04:07:26 (4720): No heartbeat from core client for 30 sec - exiting 07:46:27 (2868): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 11:25:18 (5708): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 20:00:52 (2396): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 23:38:33 (1384): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 10:58:16 (1428): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 10:58:18 (1428): No heartbeat from core client for 30 sec - exiting 10:58:19 (1428): No heartbeat from core client for 30 sec - exiting 10:58:20 (1428): No heartbeat from core client for 30 sec - exiting 10:58:21 (1428): No heartbeat from core client for 30 sec - exiting 13:53:42 (768): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 16:35:33 (4740): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 02:13:10 (4880): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 06:12:55 (5116): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 06:14:08 (2820): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 21:59:15 (3704): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 21:59:16 (3704): No heartbeat from core client for 30 sec - exiting 21:59:17 (3704): No heartbeat from core client for 30 sec - exiting 21:59:18 (3704): No heartbeat from core client for 30 sec - exiting 21:59:19 (3704): No heartbeat from core client for 30 sec - exiting 03:10:50 (3852): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 03:10:51 (3852): No heartbeat from core client for 30 sec - exiting 03:10:52 (3852): No heartbeat from core client for 30 sec - exiting 03:10:53 (3852): No heartbeat from core client for 30 sec - exiting 13:54:00 (5184): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 02:47:47 (5912): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 02:47:48 (5912): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - Suspend request from BOINC... 05:57:58 (5904): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 05:57:59 (5904): No heartbeat from core client for 30 sec - exiting 05:58:00 (5904): No heartbeat from core client for 30 sec - exiting 05:58:01 (5904): No heartbeat from core client for 30 sec - exiting 05:58:02 (5904): No heartbeat from core client for 30 sec - exiting 05:58:03 (5904): No heartbeat from core client for 30 sec - exiting 05:58:04 (5904): No heartbeat from core client for 30 sec - exiting 05:58:05 (5904): No heartbeat from core client for 30 sec - exiting 05:58:06 (5904): No heartbeat from core client for 30 sec - exiting 05:58:07 (5904): No heartbeat from core client for 30 sec - exiting 05:58:08 (5904): No heartbeat from core client for 30 sec - exiting 11:15:43 (1552): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 22:06:13 (5332): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 04:15:32 (5736): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 15:37:07 (1896): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 15:37:08 (1896): No heartbeat from core client for 30 sec - exiting 15:37:09 (1896): No heartbeat from core client for 30 sec - exiting 19:12:58 (5360): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 22:23:36 (3456): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 08:51:34 (4416): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 08:51:35 (4416): No heartbeat from core client for 30 sec - exiting 08:51:36 (4416): No heartbeat from core client for 30 sec - exiting 08:51:37 (4416): No heartbeat from core client for 30 sec - exiting 08:51:38 (4416): No heartbeat from core client for 30 sec - exiting 08:51:39 (4416): No heartbeat from core client for 30 sec - exiting 08:51:40 (4416): No heartbeat from core client for 30 sec - exiting 08:51:41 (4416): No heartbeat from core client for 30 sec - exiting 08:51:42 (4416): No heartbeat from core client for 30 sec - exiting Model crashed: ATM_DYN : INVALID THETA DETECTED. tmp/pipe_dummy 2048 Model crashed: ATM_DYN : INVALID THETA DETECTED. tmp/pipe_dummy 2048 Model crashed: ATM_DYN : INVALID THETA DETECTED. tmp/pipe_dummy 2048 Model crashed: ATM_DYN : INVALID THETA DETECTED. tmp/pipe_dummy 2048 Model crashed: ATM_DYN : INVALID THETA DETECTED. tmp/pipe_dummy 2048 Model crashed: ATM_DYN : INVALID THETA DETECTED. tmp/pipe_dummy 2048 Sorry, too many model crashes! :-( Called boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
02 Mar 2013 06:47:26 | 1105487 | 15588410 | hadcm3n_4get_1940_40_008302929_0 | 855,360 | 1,395,591 | 1.6316 |
01 Mar 2013 20:16:18 | 1105487 | 15588410 | hadcm3n_4get_1940_40_008302929_0 | 829,440 | 1,356,015 | 1.6349 |
01 Mar 2013 08:02:18 | 1105487 | 15588410 | hadcm3n_4get_1940_40_008302929_0 | 803,520 | 1,315,977 | 1.6378 |
28 Feb 2013 20:53:30 | 1105487 | 15588410 | hadcm3n_4get_1940_40_008302929_0 | 777,600 | 1,276,818 | 1.6420 |
28 Feb 2013 09:02:49 | 1105487 | 15588410 | hadcm3n_4get_1940_40_008302929_0 | 751,680 | 1,234,953 | 1.6429 |
27 Feb 2013 21:18:57 | 1105487 | 15588410 | hadcm3n_4get_1940_40_008302929_0 | 725,760 | 1,193,631 | 1.6447 |
27 Feb 2013 09:37:00 | 1105487 | 15588410 | hadcm3n_4get_1940_40_008302929_0 | 699,840 | 1,152,602 | 1.6470 |
26 Feb 2013 22:19:03 | 1105487 | 15588410 | hadcm3n_4get_1940_40_008302929_0 | 673,920 | 1,112,371 | 1.6506 |
26 Feb 2013 11:36:09 | 1105487 | 15588410 | hadcm3n_4get_1940_40_008302929_0 | 648,000 | 1,071,731 | 1.6539 |
25 Feb 2013 23:52:12 | 1105487 | 15588410 | hadcm3n_4get_1940_40_008302929_0 | 622,080 | 1,031,390 | 1.6580 |
25 Feb 2013 11:55:50 | 1105487 | 15588410 | hadcm3n_4get_1940_40_008302929_0 | 596,160 | 990,021 | 1.6607 |
24 Feb 2013 23:20:49 | 1105487 | 15588410 | hadcm3n_4get_1940_40_008302929_0 | 570,240 | 947,845 | 1.6622 |
24 Feb 2013 11:11:22 | 1105487 | 15588410 | hadcm3n_4get_1940_40_008302929_0 | 544,320 | 906,105 | 1.6647 |
23 Feb 2013 23:31:33 | 1105487 | 15588410 | hadcm3n_4get_1940_40_008302929_0 | 518,400 | 865,643 | 1.6698 |
23 Feb 2013 11:22:40 | 1105487 | 15588410 | hadcm3n_4get_1940_40_008302929_0 | 492,480 | 824,177 | 1.6735 |
23 Feb 2013 00:10:14 | 1105487 | 15588410 | hadcm3n_4get_1940_40_008302929_0 | 466,560 | 784,089 | 1.6806 |
22 Feb 2013 12:07:38 | 1105487 | 15588410 | hadcm3n_4get_1940_40_008302929_0 | 440,640 | 742,773 | 1.6857 |
22 Feb 2013 00:35:13 | 1105487 | 15588410 | hadcm3n_4get_1940_40_008302929_0 | 414,720 | 701,792 | 1.6922 |
21 Feb 2013 12:45:08 | 1105487 | 15588410 | hadcm3n_4get_1940_40_008302929_0 | 388,800 | 660,093 | 1.6978 |
21 Feb 2013 00:47:53 | 1105487 | 15588410 | hadcm3n_4get_1940_40_008302929_0 | 362,880 | 618,503 | 1.7044 |
©2024 climateprediction.net