Name | hadcm3n_zdz4_1880_40_008244949_3 |
Workunit | 8400073 |
Created | 16 Feb 2013, 6:55:42 UTC |
Sent | 16 Feb 2013, 6:55:56 UTC |
Report deadline | 18 May 2013, 14:23:07 UTC |
Received | 21 Mar 2013, 17:24:52 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 22 (0x00000016) Unknown error code |
Computer ID | 1260863 |
Run time | 30 days 12 hours 36 min 24 sec |
CPU time | 29 days 20 hours 43 min 10 sec |
Validate state | Invalid |
Credit | 8,398.08 |
Device peak FLOPS | 2.65 GFLOPS |
Application version | UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86 |
Stderr | <core_client_version>7.0.28</core_client_version> <![CDATA[ <message> The device does not recognize the command. (0x16) - exit code 22 (0x16) </message> <stderr_txt> 02:41:56 (405028): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 02:55:08 (487124): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 06:36:55 (868256): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 03:14:42 (1104788): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Called boinc_finish 10:44:23 (4172): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... 22:46:54 (94940): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 02:59:49 (306352): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=306692, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=306692, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=306692, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=306692, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=306692, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=306692, iMonCtr=1 Model crash detected, will try to restart... Sorry, too many model crashes! :-( Called boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
04 Mar 2013 06:32:52 | 1260863 | 15610019 | hadcm3n_zdz4_1880_40_008244949_3 | 699,840 | 1,359,032 | 1.9419 |
03 Mar 2013 16:25:49 | 1260863 | 15610019 | hadcm3n_zdz4_1880_40_008244949_3 | 673,920 | 1,308,525 | 1.9417 |
03 Mar 2013 02:24:21 | 1260863 | 15610019 | hadcm3n_zdz4_1880_40_008244949_3 | 648,000 | 1,258,187 | 1.9416 |
02 Mar 2013 12:14:34 | 1260863 | 15610019 | hadcm3n_zdz4_1880_40_008244949_3 | 622,080 | 1,207,270 | 1.9407 |
01 Mar 2013 20:52:04 | 1260863 | 15610019 | hadcm3n_zdz4_1880_40_008244949_3 | 596,160 | 1,156,074 | 1.9392 |
01 Mar 2013 06:41:57 | 1260863 | 15610019 | hadcm3n_zdz4_1880_40_008244949_3 | 570,240 | 1,105,952 | 1.9395 |
28 Feb 2013 16:25:40 | 1260863 | 15610019 | hadcm3n_zdz4_1880_40_008244949_3 | 544,320 | 1,055,268 | 1.9387 |
28 Feb 2013 02:26:06 | 1260863 | 15610019 | hadcm3n_zdz4_1880_40_008244949_3 | 518,400 | 1,005,723 | 1.9401 |
27 Feb 2013 12:02:33 | 1260863 | 15610019 | hadcm3n_zdz4_1880_40_008244949_3 | 492,480 | 954,394 | 1.9379 |
26 Feb 2013 21:28:21 | 1260863 | 15610019 | hadcm3n_zdz4_1880_40_008244949_3 | 466,560 | 902,956 | 1.9353 |
26 Feb 2013 07:09:49 | 1260863 | 15610019 | hadcm3n_zdz4_1880_40_008244949_3 | 440,640 | 852,372 | 1.9344 |
25 Feb 2013 16:42:10 | 1260863 | 15610019 | hadcm3n_zdz4_1880_40_008244949_3 | 414,720 | 800,664 | 1.9306 |
25 Feb 2013 02:12:22 | 1260863 | 15610019 | hadcm3n_zdz4_1880_40_008244949_3 | 388,800 | 750,147 | 1.9294 |
24 Feb 2013 11:31:29 | 1260863 | 15610019 | hadcm3n_zdz4_1880_40_008244949_3 | 362,880 | 698,340 | 1.9244 |
23 Feb 2013 20:59:31 | 1260863 | 15610019 | hadcm3n_zdz4_1880_40_008244949_3 | 336,960 | 646,866 | 1.9197 |
23 Feb 2013 06:46:44 | 1260863 | 15610019 | hadcm3n_zdz4_1880_40_008244949_3 | 311,040 | 596,390 | 1.9174 |
22 Feb 2013 16:20:44 | 1260863 | 15610019 | hadcm3n_zdz4_1880_40_008244949_3 | 285,120 | 544,993 | 1.9115 |
22 Feb 2013 02:05:33 | 1260863 | 15610019 | hadcm3n_zdz4_1880_40_008244949_3 | 259,200 | 494,550 | 1.9080 |
21 Feb 2013 11:45:18 | 1260863 | 15610019 | hadcm3n_zdz4_1880_40_008244949_3 | 233,280 | 443,683 | 1.9019 |
20 Feb 2013 21:46:49 | 1260863 | 15610019 | hadcm3n_zdz4_1880_40_008244949_3 | 207,360 | 393,554 | 1.8979 |
©2024 cpdn.org