|
Name | hadcm3n_7wbr_1980_40_008453178_4 |
Workunit | 8604034 |
Created | 1 Jan 2014, 20:07:29 UTC |
Sent | 1 Jan 2014, 20:07:37 UTC |
Report deadline | 3 Apr 2014, 3:34:48 UTC |
Received | 15 Jan 2014, 4:03:54 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 22 (0x00000016) Unknown error code |
Computer ID | 1042468 |
Run time | 10 days 6 hours 54 min 38 sec |
CPU time | 9 days 17 hours 11 min 52 sec |
Validate state | Invalid |
Credit | 6,531.84 |
Device peak FLOPS | 2.50 GFLOPS |
Application version | UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86 |
Stderr | <core_client_version>7.0.64</core_client_version> <![CDATA[ <message> The device does not recognize the command. (0x16) - exit code 22 (0x16) </message> <stderr_txt> 08:05:17 (9156): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6376, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6376, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6376, iMonCtr=1 Model crash detected, will try to restart... 08:35:55 (6376): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 10:34:47 (5936): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 12:33:41 (1740): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 20:28:57 (6876): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 02:25:32 (6920): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 06:23:13 (7604): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 10:20:52 (7984): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 18:16:09 (2444): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4312, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4312, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4312, iMonCtr=1 Model crash detected, will try to restart... Sorry, too many model crashes! :-( Called boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
12 Jan 2014 00:11:22 | 1042468 | 16195812 | hadcm3n_7wbr_1980_40_008453178_4 | 544,320 | 838,123 | 1.5398 |
11 Jan 2014 12:23:32 | 1042468 | 16195812 | hadcm3n_7wbr_1980_40_008453178_4 | 518,400 | 797,735 | 1.5388 |
11 Jan 2014 00:39:21 | 1042468 | 16195812 | hadcm3n_7wbr_1980_40_008453178_4 | 492,480 | 757,340 | 1.5378 |
10 Jan 2014 17:05:13 | 1042468 | 16195812 | hadcm3n_7wbr_1980_40_008453178_4 | 466,560 | 717,262 | 1.5373 |
10 Jan 2014 01:10:06 | 1042468 | 16195812 | hadcm3n_7wbr_1980_40_008453178_4 | 440,640 | 676,672 | 1.5357 |
09 Jan 2014 13:28:21 | 1042468 | 16195812 | hadcm3n_7wbr_1980_40_008453178_4 | 414,720 | 636,656 | 1.5351 |
09 Jan 2014 01:55:17 | 1042468 | 16195812 | hadcm3n_7wbr_1980_40_008453178_4 | 388,800 | 596,886 | 1.5352 |
08 Jan 2014 14:13:02 | 1042468 | 16195812 | hadcm3n_7wbr_1980_40_008453178_4 | 362,880 | 556,791 | 1.5344 |
08 Jan 2014 02:44:03 | 1042468 | 16195812 | hadcm3n_7wbr_1980_40_008453178_4 | 336,960 | 515,435 | 1.5297 |
07 Jan 2014 14:11:07 | 1042468 | 16195812 | hadcm3n_7wbr_1980_40_008453178_4 | 311,040 | 475,191 | 1.5277 |
07 Jan 2014 01:56:12 | 1042468 | 16195812 | hadcm3n_7wbr_1980_40_008453178_4 | 285,120 | 434,513 | 1.5240 |
06 Jan 2014 13:16:23 | 1042468 | 16195812 | hadcm3n_7wbr_1980_40_008453178_4 | 259,200 | 393,167 | 1.5168 |
06 Jan 2014 01:01:29 | 1042468 | 16195812 | hadcm3n_7wbr_1980_40_008453178_4 | 233,280 | 352,877 | 1.5127 |
05 Jan 2014 13:30:58 | 1042468 | 16195812 | hadcm3n_7wbr_1980_40_008453178_4 | 207,360 | 313,257 | 1.5107 |
05 Jan 2014 02:23:36 | 1042468 | 16195812 | hadcm3n_7wbr_1980_40_008453178_4 | 181,440 | 274,299 | 1.5118 |
04 Jan 2014 14:58:37 | 1042468 | 16195812 | hadcm3n_7wbr_1980_40_008453178_4 | 155,520 | 234,446 | 1.5075 |
04 Jan 2014 03:48:41 | 1042468 | 16195812 | hadcm3n_7wbr_1980_40_008453178_4 | 129,600 | 195,593 | 1.5092 |
03 Jan 2014 16:39:21 | 1042468 | 16195812 | hadcm3n_7wbr_1980_40_008453178_4 | 103,680 | 156,330 | 1.5078 |
03 Jan 2014 05:29:54 | 1042468 | 16195812 | hadcm3n_7wbr_1980_40_008453178_4 | 77,760 | 117,142 | 1.5065 |
02 Jan 2014 18:21:55 | 1042468 | 16195812 | hadcm3n_7wbr_1980_40_008453178_4 | 51,840 | 77,993 | 1.5045 |
©2024 climateprediction.net