Name | hadcm3n_sccz_1940_40_009113398_0 |
Workunit | 9243734 |
Created | 22 Oct 2014, 15:17:01 UTC |
Sent | 23 Oct 2014, 14:54:43 UTC |
Report deadline | 22 Jan 2015, 22:21:54 UTC |
Received | 10 Dec 2014, 13:02:04 UTC |
Server state | Over |
Outcome | Success |
Client state | Done |
Exit status | 0 (0x00000000) |
Computer ID | 1169024 |
Run time | 18 days 16 hours 38 min 37 sec |
CPU time | 18 days 13 hours 38 min 59 sec |
Validate state | Valid |
Credit | 12,441.60 |
Device peak FLOPS | 2.43 GFLOPS |
Application version | UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86 |
Stderr | <core_client_version>7.2.42</core_client_version> <![CDATA[ <stderr_txt> Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6976, iMonCtr=1 Model crash detected, will try to restart... 09:00:53 (6652): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 07:55:09 (6900): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 13:48:14 (5028): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6932, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6932, iMonCtr=1 Model crash detected, will try to restart... 08:53:49 (6424): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 08:37:14 (6476): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 08:00:05 (7048): No heartbeat from core client for 30 sec - exiting 08:00:06 (7048): No heartbeat from core client for 30 sec - exiting 08:00:07 (7048): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6916, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6916, iMonCtr=1 Model crash detected, will try to restart... 07:22:52 (6604): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6448, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6448, iMonCtr=1 Model crash detected, will try to restart... 07:27:38 (3884): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 13:54:19 (5448): No heartbeat from core client for 30 sec - exiting 13:54:20 (5448): No heartbeat from core client for 30 sec - exiting 13:54:21 (5448): No heartbeat from core client for 30 sec - exiting 13:54:22 (5448): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 08:22:51 (6084): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6836, iMonCtr=1 Model crash detected, will try to restart... 07:53:17 (3648): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 07:57:19 (5048): No heartbeat from core client for 30 sec - exiting 07:57:20 (5048): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... BUFFIN: C I/O Error feof - Unit 63 - Return code = 16 BUFFIN: C I/O Error feof - Unit 64 - Return code = 16 BUFFIN: C I/O Error feof - Unit 65 - Return code = 16 BUFFIN: C I/O Error feof - Unit 66 - Return code = 16 BUFFIN: C I/O Error feof - Unit 67 - Return code = 16 BUFFIN: C I/O Error feof - Unit 68 - Return code = 16 BUFFIN: C I/O Error feof - Unit 69 - Return code = 16 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6644, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6852, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6756, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6928, iMonCtr=1 Model crash detected, will try to restart... 08:40:44 (5924): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4656, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... 08:39:50 (6996): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 11:37:56 (6404): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Called boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
10 Dec 2014 11:55:50 | 1169024 | 17255369 | hadcm3n_sccz_1940_40_009113398_0 | 1,036,800 | 1,603,707 | 1.5468 |
09 Dec 2014 14:45:57 | 1169024 | 17255369 | hadcm3n_sccz_1940_40_009113398_0 | 1,010,880 | 1,563,387 | 1.5466 |
08 Dec 2014 17:37:18 | 1169024 | 17255369 | hadcm3n_sccz_1940_40_009113398_0 | 984,960 | 1,523,068 | 1.5463 |
07 Dec 2014 20:03:49 | 1169024 | 17255369 | hadcm3n_sccz_1940_40_009113398_0 | 959,040 | 1,482,697 | 1.5460 |
06 Dec 2014 20:32:06 | 1169024 | 17255369 | hadcm3n_sccz_1940_40_009113398_0 | 933,120 | 1,442,188 | 1.5456 |
06 Dec 2014 09:18:34 | 1169024 | 17255369 | hadcm3n_sccz_1940_40_009113398_0 | 907,200 | 1,401,855 | 1.5453 |
05 Dec 2014 11:15:47 | 1169024 | 17255369 | hadcm3n_sccz_1940_40_009113398_0 | 881,280 | 1,361,596 | 1.5450 |
04 Dec 2014 11:19:15 | 1169024 | 17255369 | hadcm3n_sccz_1940_40_009113398_0 | 855,360 | 1,321,406 | 1.5449 |
03 Dec 2014 13:49:59 | 1169024 | 17255369 | hadcm3n_sccz_1940_40_009113398_0 | 829,440 | 1,281,240 | 1.5447 |
02 Dec 2014 16:12:47 | 1169024 | 17255369 | hadcm3n_sccz_1940_40_009113398_0 | 803,520 | 1,241,096 | 1.5446 |
01 Dec 2014 17:57:52 | 1169024 | 17255369 | hadcm3n_sccz_1940_40_009113398_0 | 777,600 | 1,200,893 | 1.5444 |
30 Nov 2014 20:41:42 | 1169024 | 17255369 | hadcm3n_sccz_1940_40_009113398_0 | 751,680 | 1,167,236 | 1.5528 |
30 Nov 2014 09:20:22 | 1169024 | 17255369 | hadcm3n_sccz_1940_40_009113398_0 | 725,760 | 1,126,728 | 1.5525 |
29 Nov 2014 10:58:03 | 1169024 | 17255369 | hadcm3n_sccz_1940_40_009113398_0 | 699,840 | 1,086,331 | 1.5523 |
28 Nov 2014 07:13:30 | 1169024 | 17255369 | hadcm3n_sccz_1940_40_009113398_0 | 673,920 | 1,046,141 | 1.5523 |
27 Nov 2014 11:46:26 | 1169024 | 17255369 | hadcm3n_sccz_1940_40_009113398_0 | 648,000 | 1,005,695 | 1.5520 |
26 Nov 2014 15:12:33 | 1169024 | 17255369 | hadcm3n_sccz_1940_40_009113398_0 | 622,080 | 965,277 | 1.5517 |
25 Nov 2014 18:22:51 | 1169024 | 17255369 | hadcm3n_sccz_1940_40_009113398_0 | 596,160 | 924,869 | 1.5514 |
24 Nov 2014 21:45:09 | 1169024 | 17255369 | hadcm3n_sccz_1940_40_009113398_0 | 570,240 | 884,495 | 1.5511 |
24 Nov 2014 10:45:46 | 1169024 | 17255369 | hadcm3n_sccz_1940_40_009113398_0 | 544,320 | 844,262 | 1.5510 |
©2024 climateprediction.net