Name | hadcm3n_p29m_1900_40_007220522_0 |
Workunit | 7418762 |
Created | 26 Apr 2011, 15:20:26 UTC |
Sent | 2 May 2011, 20:35:22 UTC |
Report deadline | 2 Aug 2011, 4:02:33 UTC |
Received | 3 Jun 2011, 17:17:40 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 22 (0x00000016) Unknown error code |
Computer ID | 1089900 |
Run time | 18 days 13 hours 44 min 37 sec |
CPU time | 18 days 11 hours 58 min 29 sec |
Validate state | Invalid |
Credit | 9,331.20 |
Device peak FLOPS | 2.32 GFLOPS |
Application version | UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86 |
Stderr | <core_client_version>6.10.58</core_client_version> <![CDATA[ <message> The device does not recognize the command. (0x16) - exit code 22 (0x16) </message> <stderr_txt> Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... 10:28:46 (4380): No heartbeat from core client for 30 sec - exiting 10:28:47 (4380): No heartbeat from core client for 30 sec - exiting 10:28:48 (4380): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1096, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1096, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3648, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3648, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3648, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3648, iMonCtr=1 Model crash detected, will try to restart... Sorry, too many model crashes! :-( Called boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
03 Jun 2011 11:49:16 | 1089900 | 12820966 | hadcm3n_p29m_1900_40_007220522_0 | 777,600 | 1,588,827 | 2.0432 |
02 Jun 2011 21:07:16 | 1089900 | 12820966 | hadcm3n_p29m_1900_40_007220522_0 | 751,680 | 1,536,059 | 2.0435 |
02 Jun 2011 06:26:42 | 1089900 | 12820966 | hadcm3n_p29m_1900_40_007220522_0 | 725,760 | 1,483,468 | 2.0440 |
01 Jun 2011 14:21:18 | 1089900 | 12820966 | hadcm3n_p29m_1900_40_007220522_0 | 699,840 | 1,430,364 | 2.0438 |
31 May 2011 23:25:58 | 1089900 | 12820966 | hadcm3n_p29m_1900_40_007220522_0 | 673,920 | 1,377,503 | 2.0440 |
31 May 2011 08:33:28 | 1089900 | 12820966 | hadcm3n_p29m_1900_40_007220522_0 | 648,000 | 1,324,203 | 2.0435 |
30 May 2011 17:56:48 | 1089900 | 12820966 | hadcm3n_p29m_1900_40_007220522_0 | 622,080 | 1,271,470 | 2.0439 |
30 May 2011 03:12:08 | 1089900 | 12820966 | hadcm3n_p29m_1900_40_007220522_0 | 596,160 | 1,218,765 | 2.0444 |
29 May 2011 12:29:01 | 1089900 | 12820966 | hadcm3n_p29m_1900_40_007220522_0 | 570,240 | 1,165,831 | 2.0445 |
28 May 2011 17:07:03 | 1089900 | 12820966 | hadcm3n_p29m_1900_40_007220522_0 | 544,320 | 1,112,857 | 2.0445 |
28 May 2011 02:27:47 | 1089900 | 12820966 | hadcm3n_p29m_1900_40_007220522_0 | 518,400 | 1,060,488 | 2.0457 |
27 May 2011 09:45:27 | 1089900 | 12820966 | hadcm3n_p29m_1900_40_007220522_0 | 492,480 | 1,007,861 | 2.0465 |
26 May 2011 19:13:17 | 1089900 | 12820966 | hadcm3n_p29m_1900_40_007220522_0 | 466,560 | 955,556 | 2.0481 |
26 May 2011 04:29:00 | 1089900 | 12820966 | hadcm3n_p29m_1900_40_007220522_0 | 440,640 | 902,744 | 2.0487 |
25 May 2011 12:55:40 | 1089900 | 12820966 | hadcm3n_p29m_1900_40_007220522_0 | 414,720 | 849,883 | 2.0493 |
24 May 2011 20:47:11 | 1089900 | 12820966 | hadcm3n_p29m_1900_40_007220522_0 | 388,800 | 796,538 | 2.0487 |
23 May 2011 11:59:29 | 1089900 | 12820966 | hadcm3n_p29m_1900_40_007220522_0 | 362,880 | 742,947 | 2.0474 |
22 May 2011 20:47:21 | 1089900 | 12820966 | hadcm3n_p29m_1900_40_007220522_0 | 336,960 | 688,507 | 2.0433 |
21 May 2011 10:53:21 | 1089900 | 12820966 | hadcm3n_p29m_1900_40_007220522_0 | 311,040 | 634,947 | 2.0414 |
20 May 2011 20:10:50 | 1089900 | 12820966 | hadcm3n_p29m_1900_40_007220522_0 | 285,120 | 582,148 | 2.0418 |
©2024 climateprediction.net