Name | hadcm3n_t1pj_1940_40_007447107_0 |
Workunit | 7644610 |
Created | 9 Sep 2011, 17:14:37 UTC |
Sent | 16 Sep 2011, 14:19:34 UTC |
Report deadline | 16 Dec 2011, 21:46:45 UTC |
Received | 14 Oct 2011, 7:36:51 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 193 (0x000000C1) EXIT_SIGNAL |
Computer ID | 1166597 |
Run time | 6 days 6 hours 2 min 15 sec |
CPU time | 5 days 15 hours 3 min 55 sec |
Validate state | Invalid |
Credit | 3,110.40 |
Device peak FLOPS | 2.36 GFLOPS |
Application version | UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86 |
Stderr | <core_client_version>6.12.34</core_client_version> <![CDATA[ <message> - exit code 193 (0xc1) </message> <stderr_txt> CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5788, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5340, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 08:39:24 (5852): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 08:39:25 (5852): No heartbeat from core client for 30 sec - exiting 08:39:26 (5852): No heartbeat from core client for 30 sec - exiting 08:39:27 (5852): No heartbeat from core client for 30 sec - exiting 08:39:28 (5852): No heartbeat from core client for 30 sec - exiting 08:39:29 (5852): No heartbeat from core client for 30 sec - exiting 08:39:30 (5852): No heartbeat from core client for 30 sec - exiting 08:39:31 (5852): No heartbeat from core client for 30 sec - exiting 08:39:32 (5852): No heartbeat from core client for 30 sec - exiting 08:39:33 (5852): No heartbeat from core client for 30 sec - exiting 08:39:34 (5852): No heartbeat from core client for 30 sec - exiting CPDN Monitor - Quit request from BOINC... 08:45:28 (5660): No heartbeat from core client for 30 sec - exiting 08:45:29 (5660): No heartbeat from core client for 30 sec - exiting 08:45:30 (5660): No heartbeat from core client for 30 sec - exiting 08:45:31 (5660): No heartbeat from core client for 30 sec - exiting 08:45:32 (5660): No heartbeat from core client for 30 sec - exiting 08:45:33 (5660): No heartbeat from core client for 30 sec - exiting 08:45:34 (5660): No heartbeat from core client for 30 sec - exiting 08:45:36 (5660): No heartbeat from core client for 30 sec - exiting 08:45:37 (5660): No heartbeat from core client for 30 sec - exiting 08:45:38 (5660): No heartbeat from core client for 30 sec - exiting 08:45:39 (5660): No heartbeat from core client for 30 sec - exiting 08:45:40 (5660): No heartbeat from core client for 30 sec - exiting 08:45:41 (5660): No heartbeat from core client for 30 sec - exiting 08:45:42 (5660): No heartbeat from core client for 30 sec - exiting 08:45:43 (5660): No heartbeat from core client for 30 sec - exiting 08:45:44 (5660): No heartbeat from core client for 30 sec - exiting 08:45:45 (5660): No heartbeat from core client for 30 sec - exiting 08:45:46 (5660): No heartbeat from core client for 30 sec - exiting 08:45:48 (5660): No heartbeat from core client for 30 sec - exiting 08:45:49 (5660): No heartbeat from core client for 30 sec - exiting 08:45:50 (5660): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CController:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2564, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5420, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5420, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6052, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CSuspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5960, iMonCtr=1 Model crash detected, will try to restart... 08:59:37 (5056): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4716, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4664, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Signal 11 received, exiting... Called boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
14 Oct 2011 07:38:09 | 1166597 | 13358466 | hadcm3n_t1pj_1940_40_007447107_0 | 259,200 | 486,231 | 1.8759 |
11 Oct 2011 10:24:11 | 1166597 | 13358466 | hadcm3n_t1pj_1940_40_007447107_0 | 233,280 | 436,022 | 1.8691 |
07 Oct 2011 07:19:10 | 1166597 | 13358466 | hadcm3n_t1pj_1940_40_007447107_0 | 207,360 | 387,003 | 1.8663 |
06 Oct 2011 02:06:41 | 1166597 | 13358466 | hadcm3n_t1pj_1940_40_007447107_0 | 181,440 | 338,935 | 1.8680 |
05 Oct 2011 10:01:17 | 1166597 | 13358466 | hadcm3n_t1pj_1940_40_007447107_0 | 155,520 | 288,201 | 1.8531 |
03 Oct 2011 17:51:06 | 1166597 | 13358466 | hadcm3n_t1pj_1940_40_007447107_0 | 129,600 | 240,609 | 1.8566 |
30 Sep 2011 14:16:04 | 1166597 | 13358466 | hadcm3n_t1pj_1940_40_007447107_0 | 103,680 | 192,554 | 1.8572 |
28 Sep 2011 13:49:19 | 1166597 | 13358466 | hadcm3n_t1pj_1940_40_007447107_0 | 77,760 | 144,144 | 1.8537 |
26 Sep 2011 10:56:05 | 1166597 | 13358466 | hadcm3n_t1pj_1940_40_007447107_0 | 51,840 | 97,230 | 1.8756 |
22 Sep 2011 12:31:06 | 1166597 | 13358466 | hadcm3n_t1pj_1940_40_007447107_0 | 25,920 | 48,647 | 1.8768 |
©2024 cpdn.org