Name | hadcm3n_8c6n_1980_40_008725210_3 |
Workunit | 8871188 |
Created | 1 May 2014, 15:20:01 UTC |
Sent | 1 May 2014, 15:30:33 UTC |
Report deadline | 31 Jul 2014, 22:57:44 UTC |
Received | 22 Sep 2014, 14:36:39 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 22 (0x00000016) Unknown error code |
Computer ID | 851413 |
Run time | 34 days 19 hours 7 min 28 sec |
CPU time | 20 days 18 hours 21 min 27 sec |
Validate state | Invalid |
Credit | 8,087.04 |
Device peak FLOPS | 2.17 GFLOPS |
Application version | UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86 |
Stderr | <core_client_version>7.3.11</core_client_version> <![CDATA[ <message> Zařízení nezná tento příkaz. (0x16) - exit code 22 (0x16) </message> <stderr_txt> 17:32:30 (5768): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4880, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2352, iMonCtr=1 Model crash detected, will try to restart... 16:58:13 (5060): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 17:01:16 (3600): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 17:01:17 (3600): No heartbeat from core client for 30 sec - exiting 17:10:25 (5740): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 17:16:28 (1116): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 19:29:04 (1140): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 19:32:06 (3928): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 20:14:24 (1432): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 20:26:27 (5796): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2884, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3692, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... CSuspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2208, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2168, iMonCtr=1 Model crasSuspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4728, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4872, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4492, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... CSuspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4216, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4136, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CSuspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3956, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3408, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3604, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4852, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3108, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3440, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4624, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2564, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4576, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3384, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4772, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... zip error: Could not create output file (was replacing the original zip file) Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2876, iMonCtr=1 Model crash detected, will try to restart... CSuspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4688, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3448, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3020, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2064, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4332, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4332, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4332, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4332, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4332, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4332, iMonCtr=1 Model crash detected, will try to restart... Sorry, too many model crashes! :-( Called boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
21 Sep 2014 11:09:18 | 851413 | 16606351 | hadcm3n_8c6n_1980_40_008725210_3 | 673,920 | 1,778,205 | 2.6386 |
17 Sep 2014 13:18:38 | 851413 | 16606351 | hadcm3n_8c6n_1980_40_008725210_3 | 648,000 | 1,711,321 | 2.6409 |
14 Sep 2014 19:11:43 | 851413 | 16606351 | hadcm3n_8c6n_1980_40_008725210_3 | 622,080 | 1,644,943 | 2.6443 |
10 Sep 2014 19:55:44 | 851413 | 16606351 | hadcm3n_8c6n_1980_40_008725210_3 | 596,160 | 1,577,650 | 2.6464 |
07 Sep 2014 12:35:51 | 851413 | 16606351 | hadcm3n_8c6n_1980_40_008725210_3 | 570,240 | 1,510,686 | 2.6492 |
03 Sep 2014 13:40:47 | 851413 | 16606351 | hadcm3n_8c6n_1980_40_008725210_3 | 544,320 | 1,443,769 | 2.6524 |
31 Aug 2014 19:09:16 | 851413 | 16606351 | hadcm3n_8c6n_1980_40_008725210_3 | 518,400 | 1,377,291 | 2.6568 |
20 Aug 2014 15:55:53 | 851413 | 16606351 | hadcm3n_8c6n_1980_40_008725210_3 | 492,480 | 1,309,749 | 2.6595 |
14 Aug 2014 15:49:09 | 851413 | 16606351 | hadcm3n_8c6n_1980_40_008725210_3 | 466,560 | 1,242,748 | 2.6636 |
31 Jul 2014 19:23:18 | 851413 | 16606351 | hadcm3n_8c6n_1980_40_008725210_3 | 440,640 | 1,173,395 | 2.6629 |
23 Jul 2014 16:44:52 | 851413 | 16606351 | hadcm3n_8c6n_1980_40_008725210_3 | 414,720 | 1,105,850 | 2.6665 |
12 Jul 2014 10:44:02 | 851413 | 16606351 | hadcm3n_8c6n_1980_40_008725210_3 | 388,800 | 1,037,934 | 2.6696 |
08 Jul 2014 20:41:05 | 851413 | 16606351 | hadcm3n_8c6n_1980_40_008725210_3 | 362,880 | 970,215 | 2.6737 |
03 Jul 2014 13:51:57 | 851413 | 16606351 | hadcm3n_8c6n_1980_40_008725210_3 | 336,960 | 902,639 | 2.6788 |
29 Jun 2014 18:33:05 | 851413 | 16606351 | hadcm3n_8c6n_1980_40_008725210_3 | 311,040 | 835,118 | 2.6849 |
26 Jun 2014 18:52:52 | 851413 | 16606351 | hadcm3n_8c6n_1980_40_008725210_3 | 285,120 | 767,037 | 2.6902 |
21 Jun 2014 14:16:24 | 851413 | 16606351 | hadcm3n_8c6n_1980_40_008725210_3 | 259,200 | 698,656 | 2.6954 |
16 Jun 2014 15:49:00 | 851413 | 16606351 | hadcm3n_8c6n_1980_40_008725210_3 | 233,280 | 629,615 | 2.6990 |
12 Jun 2014 15:20:22 | 851413 | 16606351 | hadcm3n_8c6n_1980_40_008725210_3 | 207,360 | 560,720 | 2.7041 |
07 Jun 2014 07:47:16 | 851413 | 16606351 | hadcm3n_8c6n_1980_40_008725210_3 | 181,440 | 491,878 | 2.7110 |
©2024 cpdn.org