Name | hadcm3n_ymfz_1900_40_007523515_1 |
Workunit | 7720990 |
Created | 28 Oct 2011, 13:22:55 UTC |
Sent | 31 Oct 2011, 11:58:34 UTC |
Report deadline | 30 Jan 2012, 19:25:45 UTC |
Received | 7 Dec 2011, 11:10:17 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 193 (0x000000C1) EXIT_SIGNAL |
Computer ID | 1142323 |
Run time | 12 days 20 hours 20 min 39 sec |
CPU time | 11 days 18 hours 30 min 2 sec |
Validate state | Invalid |
Credit | 6,220.80 |
Device peak FLOPS | 2.16 GFLOPS |
Application version | UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86 |
Stderr | <core_client_version>6.10.60</core_client_version> <![CDATA[ <message> - exit code 193 (0xc1) </message> <stderr_txt> Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7836, iMonCtr=1 Model crash detected, will try to restart... 09:39:55 (4516): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4856, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4516, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5800, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6564, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5188, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3224, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5836, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=156, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... 06:28:09 (5524): No heartbeat from core client for 30 sec - exiting 06:28:10 (5524): No heartbeat from core client for 30 sec - exiting 06:28:11 (5524): No heartbeat from core client for 30 sec - exiting 06:28:12 (5524): No heartbeat from core client for 30 sec - exiting 06:28:13 (5524): No heartbeat from core client for 30 sec - exiting 06:28:14 (5524): No heartbeat from core client for 30 sec - exiting 06:28:15 (5524): No heartbeat from core client for 30 sec - exiting 06:28:16 (5524): No heartbeat from core client for 30 sec - exiting 06:28:17 (5524): No heartbeat from core client for 30 sec - exiting 06:28:18 (5524): No heartbeat from core client for 30 sec - exiting 06:28:19 (5524): No heartbeat from core client for 30 sec - exiting 06:28:20 (5524): No heartbeat from core client for 30 sec - exiting 06:28:21 (5524): No heartbeat from core client for 30 sec - exiting 06:28:22 (5524): No heartbeat from core client for 30 sec - exiting 06:28:23 (5524): No heartbeat from core client for 30 sec - exiting 06:28:24 (5524): No heartbeat from core client for 30 sec - exiting 06:28:25 (5524): No heartbeat from core client for 30 sec - exiting 06:28:26 (5524): No heartbeat from core client for 30 sec - exiting 06:28:27 (5524): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5844, iMonCtr=1 Model crash detected, will try to restart... 08:21:09 (3536): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5860, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... 20:33:10 (4384): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4824, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 18:51:56 (6396): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... 20:59:59 (5436): No heartbeat from core client for 30 sec - exiting 21:00:00 (5436): No heartbeat from core client for 30 sec - exiting 21:00:01 (5436): No heartbeat from core client for 30 sec - exiting 21:00:02 (5436): No heartbeat from core client for 30 sec - exiting 21:00:03 (5436): No heartbeat from core client for 30 sec - exiting 21:00:04 (5436): No heartbeat from core client for 30 sec - exiting 21:00:05 (5436): No heartbeat from core client for 30 sec - exiting 21:00:06 (5436): No heartbeat from core client for 30 sec - exiting 21:00:07 (5436): No heartbeat from core client for 30 sec - exiting 21:00:08 (5436): No heartbeat from core client for 30 sec - exiting 21:00:09 (5436): No heartbeat from core client for 30 sec - exiting 21:00:10 (5436): No heartbeat from core client for 30 sec - exiting 21:00:11 (5436): No heartbeat from core client for 30 sec - exiting 21:00:12 (5436): No heartbeat from core client for 30 sec - exiting 21:00:13 (5436): No heartbeat from core client for 30 sec - exiting 21:00:14 (5436): No heartbeat from core client for 30 sec - exiting 21:00:15 (5436): No heartbeat from core client for 30 sec - exiting 21:00:16 (5436): No heartbeat from core client for 30 sec - exiting 21:00:17 (5436): No heartbeat from core client for 30 sec - exiting 21:00:18 (5436): No heartbeat from core client for 30 sec - exiting 21:00:19 (5436): No heartbeat from core client for 30 sec - exiting 21:00:20 (5436): No heartbeat from core client for 30 sec - exiting 21:00:21 (5436): No heartbeat from core client for 30 sec - exiting 21:00:22 (5436): No heartbeat from core client for 30 sec - exiting 21:00:23 (5436): No heartbeat from core client for 30 sec - exiting 21:00:24 (5436): No heartbeat from core client for 30 sec - exiting 21:00:25 (5436): No heartbeat from core client for 30 sec - exiting 21:00:26 (5436): No heartbeat from core client for 30 sec - exiting 21:00:27 (5436): No heartbeat from core client for 30 sec - exiting 21:00:28 (5436): No heartbeat from core client for 30 sec - exiting 21:00:29 (5436): No heartbeat from core client for 30 sec - exiting 21:00:30 (5436): No heartbeat from core client for 30 sec - exiting 21:00:31 (5436): No heartbeat from core client for 30 sec - exiting 21:00:32 (5436): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 06:23:17 (5032): No heartbeat from core client for 30 sec - exiting 06:23:18 (5032): No heartbeat from core client for 30 sec - exiting 06:23:19 (5032): No heartbeat from core client for 30 sec - exiting 06:23:20 (5032): No heartbeat from core client for 30 sec - exiting 06:23:21 (5032): No heartbeat from core client for 30 sec - exiting 06:23:22 (5032): No heartbeat from core client for 30 sec - exiting 06:23:23 (5032): No heartbeat from core client for 30 sec - exiting 06:23:24 (5032): No heartbeat from core client for 30 sec - exiting 06:23:25 (5032): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 06:28:42 (5936): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 20:29:49 (5872): No heartbeat from core client for 30 sec - exiting 20:29:51 (5872): No heartbeat from core client for 30 sec - exiting 20:29:52 (5872): No heartbeat from core client for 30 sec - exiting 20:29:53 (5872): No heartbeat from core client for 30 sec - exiting 20:29:54 (5872): No heartbeat from core client for 30 sec - exiting 20:29:55 (5872): No heartbeat from core client for 30 sec - exiting 20:29:56 (5872): No heartbeat from core client for 30 sec - exiting 20:29:57 (5872): No heartbeat from core client for 30 sec - exiting 20:29:58 (5872): No heartbeat from core client for 30 sec - exiting 20:29:59 (5872): No heartbeat from core client for 30 sec - exiting 20:30:00 (5872): No heartbeat from core client for 30 sec - exiting 20:30:01 (5872): No heartbeat from core client for 30 sec - exiting 20:30:02 (5872): No heartbeat from core client for 30 sec - exiting 20:30:03 (5872): No heartbeat from core client for 30 sec - exiting 20:30:04 (5872): No heartbeat from core client for 30 sec - exiting 20:30:05 (5872): No heartbeat from core client for 30 sec - exiting 20:30:06 (5872): No heartbeat from core client for 30 sec - exiting 20:30:07 (5872): No heartbeat from core client for 30 sec - exiting 20:30:08 (5872): No heartbeat from core client for 30 sec - exiting 20:30:09 (5872): No heartbeat from core client for 30 sec - exiting 20:30:10 (5872): No heartbeat from core client for 30 sec - exiting 20:30:11 (5872): No heartbeat from core client for 30 sec - exiting 20:30:12 (5872): No heartbeat from core client for 30 sec - exiting 20:30:13 (5872): No heartbeat from core client for 30 sec - exiting 20:30:14 (5872): No heartbeat from core client for 30 sec - exiting 20:30:15 (5872): No heartbeat from core client for 30 sec - exiting 20:30:16 (5872): No heartbeat from core client for 30 sec - exiting 20:30:17 (5872): No heartbeat from core client for 30 sec - exiting 20:30:18 (5872): No heartbeat from core client for 30 sec - exiting 20:30:19 (5872): No heartbeat from core client for 30 sec - exiting 20:30:20 (5872): No heartbeat from core client for 30 sec - exiting 20:30:21 (5872): No heartbeat from core client for 30 sec - exiting 20:30:22 (5872): No heartbeat from core client for 30 sec - exiting 20:30:23 (5872): No heartbeat from core client for 30 sec - exiting 20:30:24 (5872): No heartbeat from core client for 30 sec - exiting 20:30:25 (5872): No heartbeat from core client for 30 sec - exiting 20:30:26 (5872): No heartbeat from core client for 30 sec - exiting 20:30:27 (5872): No heartbeat from core client for 30 sec - exiting 20:30:28 (5872): No heartbeat from core client for 30 sec - exiting 20:30:29 (5872): No heartbeat from core client for 30 sec - exiting 20:30:30 (5872): No heartbeat from core client for 30 sec - exiting 20:30:31 (5872): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 06:12:37 (2788): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 06:12:38 (2788): No heartbeat from core client for 30 sec - exiting Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4156, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 06:08:08 (6968): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... 20:16:36 (8108): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=8188, iMonCtr=1 Model crash detected, will try to restart... 06:17:46 (4832): No heartbeat from core client for 30 sec - exiting 06:17:47 (4832): No heartbeat from core client for 30 sec - exiting 06:17:48 (4832): No heartbeat from core client for 30 sec - exiting 06:17:49 (4832): No heartbeat from core client for 30 sec - exiting 06:17:50 (4832): No heartbeat from core client for 30 sec - exiting 06:17:51 (4832): No heartbeat from core client for 30 sec - exiting 06:17:52 (4832): No heartbeat from core client for 30 sec - exiting 06:17:53 (4832): No heartbeat from core client for 30 sec - exiting 06:17:54 (4832): No heartbeat from core client for 30 sec - exiting 06:17:55 (4832): No heartbeat from core client for 30 sec - exiting 06:17:56 (4832): No heartbeat from core client for 30 sec - exiting 06:17:57 (4832): No heartbeat from core client for 30 sec - exiting 06:17:58 (4832): No heartbeat from core client for 30 sec - exiting 06:17:59 (4832): No heartbeat from core client for 30 sec - exiting 06:18:00 (4832): No heartbeat from core client for 30 sec - exiting 06:18:01 (4832): No heartbeat from core client for 30 sec - exiting 06:18:02 (4832): No heartbeat from core client for 30 sec - exiting 06:18:03 (4832): No heartbeat from core client for 30 sec - exiting 06:18:04 (4832): No heartbeat from core client for 30 sec - exiting 06:18:05 (4832): No heartbeat from core client for 30 sec - exiting 06:18:06 (4832): No heartbeat from core client for 30 sec - exiting 06:18:07 (4832): No heartbeat from core client for 30 sec - exiting 06:18:08 (4832): No heartbeat from core client for 30 sec - exiting 06:18:09 (4832): No heartbeat from core client for 30 sec - exiting 06:18:10 (4832): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 06:18:11 (4832): No heartbeat from core client for 30 sec - exiting 06:21:17 (1716): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... 22:20:40 (4420): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Signal 11 received, exiting... Called boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
06 Dec 2011 11:10:37 | 1142323 | 13552730 | hadcm3n_ymfz_1900_40_007523515_1 | 518,400 | 1,016,980 | 1.9618 |
05 Dec 2011 12:42:00 | 1142323 | 13552730 | hadcm3n_ymfz_1900_40_007523515_1 | 492,480 | 967,595 | 1.9647 |
04 Dec 2011 09:30:08 | 1142323 | 13552730 | hadcm3n_ymfz_1900_40_007523515_1 | 466,560 | 918,354 | 1.9684 |
02 Dec 2011 11:02:33 | 1142323 | 13552730 | hadcm3n_ymfz_1900_40_007523515_1 | 440,640 | 869,191 | 1.9726 |
01 Dec 2011 20:38:29 | 1142323 | 13552730 | hadcm3n_ymfz_1900_40_007523515_1 | 414,720 | 819,237 | 1.9754 |
29 Nov 2011 08:21:13 | 1142323 | 13552730 | hadcm3n_ymfz_1900_40_007523515_1 | 388,800 | 764,545 | 1.9664 |
27 Nov 2011 01:58:41 | 1142323 | 13552730 | hadcm3n_ymfz_1900_40_007523515_1 | 362,880 | 710,199 | 1.9571 |
23 Nov 2011 21:18:06 | 1142323 | 13552730 | hadcm3n_ymfz_1900_40_007523515_1 | 336,960 | 656,267 | 1.9476 |
22 Nov 2011 11:09:21 | 1142323 | 13552730 | hadcm3n_ymfz_1900_40_007523515_1 | 311,040 | 605,473 | 1.9466 |
20 Nov 2011 09:02:51 | 1142323 | 13552730 | hadcm3n_ymfz_1900_40_007523515_1 | 285,120 | 554,993 | 1.9465 |
16 Nov 2011 21:29:59 | 1142323 | 13552730 | hadcm3n_ymfz_1900_40_007523515_1 | 259,200 | 505,669 | 1.9509 |
15 Nov 2011 21:19:07 | 1142323 | 13552730 | hadcm3n_ymfz_1900_40_007523515_1 | 233,280 | 457,472 | 1.9610 |
15 Nov 2011 21:19:07 | 1142323 | 13552730 | hadcm3n_ymfz_1900_40_007523515_1 | 207,360 | 408,775 | 1.9713 |
15 Nov 2011 21:19:07 | 1142323 | 13552730 | hadcm3n_ymfz_1900_40_007523515_1 | 181,440 | 355,133 | 1.9573 |
15 Nov 2011 21:19:07 | 1142323 | 13552730 | hadcm3n_ymfz_1900_40_007523515_1 | 155,520 | 300,881 | 1.9347 |
07 Nov 2011 23:13:38 | 1142323 | 13552730 | hadcm3n_ymfz_1900_40_007523515_1 | 129,600 | 251,149 | 1.9379 |
06 Nov 2011 10:45:24 | 1142323 | 13552730 | hadcm3n_ymfz_1900_40_007523515_1 | 103,680 | 200,786 | 1.9366 |
04 Nov 2011 14:29:30 | 1142323 | 13552730 | hadcm3n_ymfz_1900_40_007523515_1 | 77,760 | 150,911 | 1.9407 |
03 Nov 2011 01:55:30 | 1142323 | 13552730 | hadcm3n_ymfz_1900_40_007523515_1 | 51,840 | 101,799 | 1.9637 |
02 Nov 2011 10:52:45 | 1142323 | 13552730 | hadcm3n_ymfz_1900_40_007523515_1 | 25,920 | 51,398 | 1.9829 |
©2024 cpdn.org