Name | hadcm3n_o38k_1980_40_008386053_1 |
Workunit | 8536912 |
Created | 24 Jul 2013, 3:15:30 UTC |
Sent | 24 Jul 2013, 3:29:40 UTC |
Report deadline | 23 Oct 2013, 10:56:51 UTC |
Received | 14 Aug 2013, 18:45:15 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 22 (0x00000016) Unknown error code |
Computer ID | 1100639 |
Run time | 10 days 8 hours 29 min 44 sec |
CPU time | 10 days 0 hours 25 min 37 sec |
Validate state | Invalid |
Credit | 6,220.80 |
Device peak FLOPS | 2.68 GFLOPS |
Application version | UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86 |
Stderr | <core_client_version>7.0.28</core_client_version> <![CDATA[ <message> The device does not recognize the command. (0x16) - exit code 22 (0x16) </message> <stderr_txt> Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 02:55:45 (11186540): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 21:55:09 (908624): No heartbeat from core client for 30 sec - exiting 21:55:10 (908624): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 03:01:10 (3975184): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 03:06:36 (4275236): No heartbeat from core client for 30 sec - exiting 03:06:37 (4275236): No heartbeat from core client for 30 sec - exiting 03:06:38 (4275236): No heartbeat from core client for 30 sec - exiting 03:06:39 (4275236): No heartbeat from core client for 30 sec - exiting 03:06:40 (4275236): No heartbeat from core client for 30 sec - exiting 03:06:41 (4275236): No heartbeat from core client for 30 sec - exiting 03:06:42 (4275236): No heartbeat from core client for 30 sec - exiting 03:06:43 (4275236): No heartbeat from core client for 30 sec - exiting 03:06:45 (4275236): No heartbeat from core client for 30 sec - exiting 03:06:46 (4275236): No heartbeat from core client for 30 sec - exiting 03:06:47 (4275236): No heartbeat from core client for 30 sec - exiting 03:06:48 (4275236): No heartbeat from core client for 30 sec - exiting 03:06:49 (4275236): No heartbeat from core client for 30 sec - exiting 03:06:50 (4275236): No heartbeat from core client for 30 sec - exiting 03:06:51 (4275236): No heartbeat from core client for 30 sec - exiting 03:06:52 (4275236): No heartbeat from core client for 30 sec - exiting 03:06:53 (4275236): No heartbeat from core client for 30 sec - exiting 03:06:54 (4275236): No heartbeat from core client for 30 sec - exiting 03:06:55 (4275236): No heartbeat from core client for 30 sec - exiting 03:06:57 (4275236): No heartbeat from core client for 30 sec - exiting 03:06:58 (4275236): No heartbeat from core client for 30 sec - exiting 03:06:59 (4275236): No heartbeat from core client for 30 sec - exiting 03:07:00 (4275236): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... BUFFIN: C I/O Error feof - Unit 63 - Return code = 16 BUFFIN: C I/O Error feof - Unit 64 - Return code = 16 BUFFIN: C I/O Error feof - Unit 65 - Return code = 16 BUFFIN: C I/O Error feof - Unit 66 - Return code = 16 BUFFIN: C I/O Error feof - Unit 67 - Return code = 16 BUFFIN: C I/O Error feof - Unit 68 - Return code = 16 BUFFIN: C I/O Error feof - Unit 69 - Return code = 16 Error converting file to netcdf: dataout/o38kko.pji5c10 Error converting file to netcdf: dataout/o38kko.pii5c10 Error converting file to netcdf: dataout/o38kko.pfi5c10 Error converting file to netcdf: dataout/o38kka.phi5c10 Error converting file to netcdf: dataout/o38kka.pgi5c10 Error converting file to netcdf: dataout/o38kka.pei5c10 Error converting file to netcdf: dataout/o38kka.pdi5c10 Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 18:43:13 (4620192): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 18:45:54 (4621016): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 18:53:37 (4631360): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 16:58:47 (6025828): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... 17:07:18 (6025888): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 19:54:08 (6131808): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 21:47:59 (6251352): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 21:48:37 (6253524): No heartbeat from core client for 30 sec - exiting 21:48:38 (6253524): No heartbeat from core client for 30 sec - exiting 21:48:39 (6253524): No heartbeat from core client for 30 sec - exiting 21:48:41 (6253524): No heartbeat from core client for 30 sec - exiting 21:48:42 (6253524): No heartbeat from core client for 30 sec - exiting 21:48:43 (6253524): No heartbeat from core client for 30 sec - exiting 21:48:44 (6253524): No heartbeat from core client for 30 sec - exiting 21:48:45 (6253524): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 10:49:57 (7071376): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 15:09:35 (7347380): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 15:11:16 (7343476): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... 15:19:39 (7357968): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 13:31:43 (8760632): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 13:33:45 (8761052): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... 15:08:54 (8862460): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... zip error: Could not create output file (was replacing the original zip file) cpdnmonitor: cannot open input file D:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_o38k_1980_40_008386053/dataout/atmos_restart.day after 11 attempts cpdnmonitor: cannot open input file D:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_o38k_1980_40_008386053/dataout/ocean_restart.day after 11 attempts Model crashed: READ_FLH: I/O error tmp/pipe_dummy 2048 cpdnmonitor: cannot open input file D:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_o38k_1980_40_008386053/dataout/atmos_restart.day after 11 attempts cpdnmonitor: cannot open input file D:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_o38k_1980_40_008386053/dataout/ocean_restart.day after 11 attempts Model crashed: READ_FLH: I/O error tmp/pipe_dummy 2048 cpdnmonitor: cannot open input file D:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_o38k_1980_40_008386053/dataout/atmos_restart.day after 11 attempts cpdnmonitor: cannot open input file D:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_o38k_1980_40_008386053/dataout/ocean_restart.day after 11 attempts Model crashed: READ_FLH: I/O error tmp/pipe_dummy 2048 cpdnmonitor: cannot open input file D:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_o38k_1980_40_008386053/dataout/atmos_restart.day after 11 attempts cpdnmonitor: cannot open input file D:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_o38k_1980_40_008386053/dataout/ocean_restart.day after 11 attempts Model crashed: READ_FLH: I/O error tmp/pipe_dummy 2048 cpdnmonitor: cannot open input file D:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_o38k_1980_40_008386053/dataout/atmos_restart.day after 11 attempts cpdnmonitor: cannot open input file D:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_o38k_1980_40_008386053/dataout/ocean_restart.day after 11 attempts Model crashed: READ_FLH: I/O error tmp/pipe_dummy 2048 cpdnmonitor: cannot open input file D:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_o38k_1980_40_008386053/dataout/atmos_restart.day after 11 attempts cpdnmonitor: cannot open input file D:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_o38k_1980_40_008386053/dataout/ocean_restart.day after 11 attempts Model crashed: READ_FLH: I/O error tmp/pipe_dummy 2048 Sorry, too many model crashes! :-( Called boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
14 Aug 2013 18:48:07 | 1100639 | 15905189 | hadcm3n_o38k_1980_40_008386053_1 | 518,400 | 865,592 | 1.6697 |
14 Aug 2013 18:48:07 | 1100639 | 15905189 | hadcm3n_o38k_1980_40_008386053_1 | 492,480 | 821,442 | 1.6680 |
14 Aug 2013 18:48:07 | 1100639 | 15905189 | hadcm3n_o38k_1980_40_008386053_1 | 466,560 | 778,001 | 1.6675 |
14 Aug 2013 18:48:07 | 1100639 | 15905189 | hadcm3n_o38k_1980_40_008386053_1 | 440,640 | 734,942 | 1.6679 |
14 Aug 2013 18:48:07 | 1100639 | 15905189 | hadcm3n_o38k_1980_40_008386053_1 | 414,720 | 691,964 | 1.6685 |
14 Aug 2013 18:48:07 | 1100639 | 15905189 | hadcm3n_o38k_1980_40_008386053_1 | 388,800 | 649,148 | 1.6696 |
14 Aug 2013 18:48:07 | 1100639 | 15905189 | hadcm3n_o38k_1980_40_008386053_1 | 362,880 | 605,953 | 1.6698 |
14 Aug 2013 18:48:07 | 1100639 | 15905189 | hadcm3n_o38k_1980_40_008386053_1 | 336,960 | 562,630 | 1.6697 |
14 Aug 2013 18:48:07 | 1100639 | 15905189 | hadcm3n_o38k_1980_40_008386053_1 | 311,040 | 519,171 | 1.6691 |
14 Aug 2013 18:48:07 | 1100639 | 15905189 | hadcm3n_o38k_1980_40_008386053_1 | 285,120 | 475,679 | 1.6683 |
14 Aug 2013 18:48:07 | 1100639 | 15905189 | hadcm3n_o38k_1980_40_008386053_1 | 259,200 | 432,402 | 1.6682 |
14 Aug 2013 18:48:07 | 1100639 | 15905189 | hadcm3n_o38k_1980_40_008386053_1 | 233,280 | 389,585 | 1.6700 |
30 Jul 2013 09:46:33 | 1100639 | 15905189 | hadcm3n_o38k_1980_40_008386053_1 | 207,360 | 346,023 | 1.6687 |
29 Jul 2013 14:17:49 | 1100639 | 15905189 | hadcm3n_o38k_1980_40_008386053_1 | 181,440 | 302,913 | 1.6695 |
29 Jul 2013 14:17:49 | 1100639 | 15905189 | hadcm3n_o38k_1980_40_008386053_1 | 155,520 | 259,648 | 1.6695 |
29 Jul 2013 14:17:49 | 1100639 | 15905189 | hadcm3n_o38k_1980_40_008386053_1 | 129,600 | 216,598 | 1.6713 |
29 Jul 2013 14:17:49 | 1100639 | 15905189 | hadcm3n_o38k_1980_40_008386053_1 | 103,680 | 173,204 | 1.6706 |
29 Jul 2013 14:17:49 | 1100639 | 15905189 | hadcm3n_o38k_1980_40_008386053_1 | 77,760 | 129,934 | 1.6710 |
26 Jul 2013 15:30:56 | 1100639 | 15905189 | hadcm3n_o38k_1980_40_008386053_1 | 51,840 | 86,750 | 1.6734 |
26 Jul 2013 02:34:59 | 1100639 | 15905189 | hadcm3n_o38k_1980_40_008386053_1 | 25,920 | 43,376 | 1.6735 |
©2024 climateprediction.net