|
Name | hadcm3n_o255_2140_40_008269032_1 |
Workunit | 8424156 |
Created | 28 Dec 2012, 11:00:40 UTC |
Sent | 28 Dec 2012, 11:22:36 UTC |
Report deadline | 29 Mar 2013, 18:49:47 UTC |
Received | 13 Jan 2013, 0:28:54 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 22 (0x00000016) Unknown error code |
Computer ID | 1210909 |
Run time | 13 days 3 hours 20 min 22 sec |
CPU time | 9 days 21 hours 21 min 51 sec |
Validate state | Invalid |
Credit | 9,331.20 |
Device peak FLOPS | 2.74 GFLOPS |
Application version | UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86 |
Stderr | <core_client_version>6.10.60</core_client_version> <![CDATA[ <message> The device does not recognize the command. (0x16) - exit code 22 (0x16) </message> <stderr_txt> CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 08:43:03 (3308): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... MainError: 08:20:22 AM No files match the supplied pattern. MainError: 08:20:22 AM No files match the supplied pattern. MainError: 07:25:23 PM No files match the supplied pattern. MainError: 07:25:23 PM No files match the supplied pattern. MainError: 06:35:18 AM No files match the supplied pattern. MainError: 06:35:18 AM No files match the supplied pattern. MainError: 05:46:24 PM No files match the supplied pattern. MainError: 05:46:24 PM No files match the supplied pattern. CPDN Monitor - Quit request from BOINC... 11:13:56 (3872): No heartbeat from core client for 30 sec - exiting 11:13:57 (3872): No heartbeat from core client for 30 sec - exiting 11:13:59 (3872): No heartbeat from core client for 30 sec - exiting 11:14:00 (3872): No heartbeat from core client for 30 sec - exiting 11:14:01 (3872): No heartbeat from core client for 30 sec - exiting 11:14:02 (3872): No heartbeat from core client for 30 sec - exiting 11:14:03 (3872): No heartbeat from core client for 30 sec - exiting 11:14:04 (3872): No heartbeat from core client for 30 sec - exiting 11:14:05 (3872): No heartbeat from core client for 30 sec - exiting 11:14:06 (3872): No heartbeat from core client for 30 sec - exiting 11:14:07 (3872): No heartbeat from core client for 30 sec - exiting 11:14:08 (3872): No heartbeat from core client for 30 sec - exiting 11:14:09 (3872): No heartbeat from core client for 30 sec - exiting 11:14:11 (3872): No heartbeat from core client for 30 sec - exiting 11:14:12 (3872): No heartbeat from core client for 30 sec - exiting 11:14:13 (3872): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 12:49:09 (3256): No heartbeat from core client for 30 sec - exiting 12:49:10 (3256): No heartbeat from core client for 30 sec - exiting 12:49:11 (3256): No heartbeat from core client for 30 sec - exiting 12:49:12 (3256): No heartbeat from core client for 30 sec - exiting 12:49:13 (3256): No heartbeat from core client for 30 sec - exiting 12:49:14 (3256): No heartbeat from core client for 30 sec - exiting 12:49:15 (3256): No heartbeat from core client for 30 sec - exiting 12:49:16 (3256): No heartbeat from core client for 30 sec - exiting 12:49:17 (3256): No heartbeat from core client for 30 sec - exiting 12:49:18 (3256): No heartbeat from core client for 30 sec - exiting 12:49:20 (3256): No heartbeat from core client for 30 sec - exiting 12:49:21 (3256): No heartbeat from core client for 30 sec - exiting 12:49:22 (3256): No heartbeat from core client for 30 sec - exiting 12:49:23 (3256): No heartbeat from core client for 30 sec - exiting 12:49:24 (3256): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... MainError: 05:10:40 AM No files match the supplied pattern. MainError: 05:10:40 AM No files match the supplied pattern. MainError: 03:59:20 PM No files match the supplied pattern. MainError: 03:59:20 PM No files match the supplied pattern. MainError: 02:56:56 AM No files match the supplied pattern. MainError: 02:56:56 AM No files match the supplied pattern. Suspended CPDN Monitor - Suspend request from BOINC... MainError: 01:56:03 PM No files match the supplied pattern. MainError: 01:56:03 PM No files match the supplied pattern. MainError: 12:54:55 AM No files match the supplied pattern. MainError: 12:54:55 AM No files match the supplied pattern. MainError: 11:52:42 AM No files match the supplied pattern. MainError: 11:52:42 AM No files match the supplied pattern. Error converting file to netcdf: dataout/o255ka.ph11c10 Error converting file to netcdf: dataout/o255ka.pg11c10 Error converting file to netcdf: dataout/o255ka.pe11c10 MainError: 10:54:52 PM No files match the supplied pattern. MainError: 10:54:52 PM No files match the supplied pattern. BUFFIN: C I/O Error feof - Unit 67 - Return code = 16 Model crashed: STWORK : I/O error - PP fixed length header tmp/pipe_dummy 2048 BUFFIN: C I/O Error feof - Unit 64 - Return code = 16 BUFFIN: C I/O Error feof - Unit 66 - Return code = 16 BUFFIN: C I/O Error feof - Unit 67 - Return code = 16 BUFFIN: C I/O Error feof - Unit 68 - Return code = 16 BUFFIN: C I/O Error feof - Unit 69 - Return code = 16 BUFFIN: C I/O Error feof - Unit 67 - Return code = 16 Model crashed: STWORK : I/O error - PP fixed length header tmp/pipe_dummy 2048 BUFFIN: C I/O Error feof - Unit 64 - Return code = 16 BUFFIN: C I/O Error feof - Unit 66 - Return code = 16 BUFFIN: C I/O Error feof - Unit 67 - Return code = 16 BUFFIN: C I/O Error feof - Unit 68 - Return code = 16 BUFFIN: C I/O Error feof - Unit 69 - Return code = 16 BUFFIN: C I/O Error feof - Unit 67 - Return code = 16 Model crashed: STWORK : I/O error - PP fixed length header tmp/pipe_dummy 2048 BUFFIN: C I/O Error feof - Unit 64 - Return code = 16 BUFFIN: C I/O Error feof - Unit 66 - Return code = 16 BUFFIN: C I/O Error feof - Unit 67 - Return code = 16 BUFFIN: C I/O Error feof - Unit 68 - Return code = 16 BUFFIN: C I/O Error feof - Unit 69 - Return code = 16 BUFFIN: C I/O Error feof - Unit 67 - Return code = 16 Model crashed: STWORK : I/O error - PP fixed length header tmp/pipe_dummy 2048 BUFFIN: C I/O Error feof - Unit 64 - Return code = 16 BUFFIN: C I/O Error feof - Unit 66 - Return code = 16 BUFFIN: C I/O Error feof - Unit 67 - Return code = 16 BUFFIN: C I/O Error feof - Unit 68 - Return code = 16 BUFFIN: C I/O Error feof - Unit 69 - Return code = 16 BUFFIN: C I/O Error feof - Unit 67 - Return code = 16 Model crashed: STWORK : I/O error - PP fixed length header tmp/pipe_dummy 2048 BUFFIN: C I/O Error feof - Unit 64 - Return code = 16 BUFFIN: C I/O Error feof - Unit 66 - Return code = 16 BUFFIN: C I/O Error feof - Unit 67 - Return code = 16 BUFFIN: C I/O Error feof - Unit 68 - Return code = 16 BUFFIN: C I/O Error feof - Unit 69 - Return code = 16 BUFFIN: C I/O Error feof - Unit 67 - Return code = 16 Model crashed: STWORK : I/O error - PP fixed length header tmp/pipe_dummy 2048 Sorry, too many model crashes! :-( Called boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
12 Jan 2013 23:32:54 | 1210909 | 15514843 | hadcm3n_o255_2140_40_008269032_1 | 777,600 | 1,116,842 | 1.4363 |
12 Jan 2013 12:35:06 | 1210909 | 15514843 | hadcm3n_o255_2140_40_008269032_1 | 751,680 | 1,077,663 | 1.4337 |
12 Jan 2013 01:48:00 | 1210909 | 15514843 | hadcm3n_o255_2140_40_008269032_1 | 725,760 | 1,038,636 | 1.4311 |
11 Jan 2013 14:35:31 | 1210909 | 15514843 | hadcm3n_o255_2140_40_008269032_1 | 699,840 | 999,586 | 1.4283 |
11 Jan 2013 03:58:29 | 1210909 | 15514843 | hadcm3n_o255_2140_40_008269032_1 | 673,920 | 960,544 | 1.4253 |
10 Jan 2013 16:00:48 | 1210909 | 15514843 | hadcm3n_o255_2140_40_008269032_1 | 648,000 | 921,566 | 1.4222 |
10 Jan 2013 06:13:14 | 1210909 | 15514843 | hadcm3n_o255_2140_40_008269032_1 | 622,080 | 882,857 | 1.4192 |
09 Jan 2013 18:41:31 | 1210909 | 15514843 | hadcm3n_o255_2140_40_008269032_1 | 596,160 | 844,090 | 1.4159 |
09 Jan 2013 07:29:59 | 1210909 | 15514843 | hadcm3n_o255_2140_40_008269032_1 | 570,240 | 804,889 | 1.4115 |
08 Jan 2013 20:15:05 | 1210909 | 15514843 | hadcm3n_o255_2140_40_008269032_1 | 544,320 | 765,683 | 1.4067 |
08 Jan 2013 09:02:06 | 1210909 | 15514843 | hadcm3n_o255_2140_40_008269032_1 | 518,400 | 726,709 | 1.4018 |
07 Jan 2013 21:54:18 | 1210909 | 15514843 | hadcm3n_o255_2140_40_008269032_1 | 492,480 | 687,780 | 1.3966 |
07 Jan 2013 10:56:29 | 1210909 | 15514843 | hadcm3n_o255_2140_40_008269032_1 | 466,560 | 648,816 | 1.3906 |
06 Jan 2013 23:49:06 | 1210909 | 15514843 | hadcm3n_o255_2140_40_008269032_1 | 440,640 | 609,883 | 1.3841 |
06 Jan 2013 13:02:00 | 1210909 | 15514843 | hadcm3n_o255_2140_40_008269032_1 | 414,720 | 572,073 | 1.3794 |
06 Jan 2013 02:54:50 | 1210909 | 15514843 | hadcm3n_o255_2140_40_008269032_1 | 388,800 | 534,455 | 1.3746 |
05 Jan 2013 16:31:01 | 1210909 | 15514843 | hadcm3n_o255_2140_40_008269032_1 | 362,880 | 498,513 | 1.3738 |
05 Jan 2013 06:55:23 | 1210909 | 15514843 | hadcm3n_o255_2140_40_008269032_1 | 336,960 | 462,808 | 1.3735 |
04 Jan 2013 21:11:41 | 1210909 | 15514843 | hadcm3n_o255_2140_40_008269032_1 | 311,040 | 427,147 | 1.3733 |
03 Jan 2013 19:45:07 | 1210909 | 15514843 | hadcm3n_o255_2140_40_008269032_1 | 285,120 | 391,466 | 1.3730 |
©2024 climateprediction.net