Name | hadcm3n_006x_1980_40_008085614_2 |
Workunit | 8240728 |
Created | 23 Jul 2012, 20:43:15 UTC |
Sent | 23 Jul 2012, 20:51:40 UTC |
Report deadline | 23 Oct 2012, 4:18:51 UTC |
Received | 8 Aug 2012, 18:44:11 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | -226 (0xFFFFFF1E) ERR_TOO_MANY_EXITS |
Computer ID | 1203142 |
Run time | 8 days 5 hours 43 min 25 sec |
CPU time | 6 days 7 hours 22 min 48 sec |
Validate state | Invalid |
Credit | 3,421.44 |
Device peak FLOPS | 2.11 GFLOPS |
Application version | UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86 |
Stderr | <core_client_version>7.0.28</core_client_version> <![CDATA[ <message> too many exit(0)s </message> <stderr_txt> CPDN Monitor - Quit request from BOINC... 22:41:42 (1268): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6020, iMonCtr=1 Model crash detected, will try to restart... 19:50:44 (6060): No heartbeat from core client for 30 sec - exiting 19:50:45 (6060): No heartbeat from core client for 30 sec - exiting 19:50:46 (6060): No heartbeat from core client for 30 sec - exiting 19:50:47 (6060): No heartbeat from core client for 30 sec - exiting 19:50:48 (6060): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 14:02:05 (2608): No heartbeat from core client for 30 sec - exiting 14:02:06 (2608): No heartbeat from core client for 30 sec - exiting 14:02:07 (2608): No heartbeat from core client for 30 sec - exiting 14:02:08 (2608): No heartbeat from core client for 30 sec - exiting 14:02:09 (2608): No heartbeat from core client for 30 sec - exiting 14:02:10 (2608): No heartbeat from core client for 30 sec - exiting 14:02:11 (2608): No heartbeat from core client for 30 sec - exiting 14:02:12 (2608): No heartbeat from core client for 30 sec - exiting 14:02:13 (2608): No heartbeat from core client for 30 sec - exiting 14:02:14 (2608): No heartbeat from core client for 30 sec - exiting 14:02:15 (2608): No heartbeat from core client for 30 sec - exiting 14:02:16 (2608): No heartbeat from core client for 30 sec - exiting 14:02:17 (2608): No heartbeat from core client for 30 sec - exiting 14:02:18 (2608): No heartbeat from core client for 30 sec - exiting 14:02:19 (2608): No heartbeat from core client for 30 sec - exiting 14:02:20 (2608): No heartbeat from core client for 30 sec - exiting 14:02:21 (2608): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5708, iMonCtr=1 Model crash detected, will try to restart... 20:50:14 (1640): No heartbeat from core client for 30 sec - exiting 20:50:15 (1640): No heartbeat from core client for 30 sec - exiting 20:50:16 (1640): No heartbeat from core client for 30 sec - exiting 20:50:17 (1640): No heartbeat from core client for 30 sec - exiting 20:50:18 (1640): No heartbeat from core client for 30 sec - exiting 20:50:19 (1640): No heartbeat from core client for 30 sec - exiting 20:50:20 (1640): No heartbeat from core client for 30 sec - exiting 20:50:21 (1640): No heartbeat from core client for 30 sec - exiting 20:50:22 (1640): No heartbeat from core client for 30 sec - exiting 20:50:23 (1640): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 17:26:01 (3564): No heartbeat from core client for 30 sec - exiting 17:26:03 (3564): No heartbeat from core client for 30 sec - exiting 17:26:04 (3564): No heartbeat from core client for 30 sec - exiting 17:26:05 (3564): No heartbeat from core client for 30 sec - exiting 17:26:06 (3564): No heartbeat from core client for 30 sec - exiting 17:26:07 (3564): No heartbeat from core client for 30 sec - exiting 17:26:08 (3564): No heartbeat from core client for 30 sec - exiting 17:26:09 (3564): No heartbeat from core client for 30 sec - exiting 17:26:10 (3564): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 18:04:21 (3732): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... 02:35:38 (4216): No heartbeat from core client for 30 sec - exiting 02:35:39 (4216): No heartbeat from core client for 30 sec - exiting 02:35:40 (4216): No heartbeat from core client for 30 sec - exiting 02:35:41 (4216): No heartbeat from core client for 30 sec - exiting 02:35:42 (4216): No heartbeat from core client for 30 sec - exiting 02:35:43 (4216): No heartbeat from core client for 30 sec - exiting 02:35:44 (4216): No heartbeat from core client for 30 sec - exiting 02:35:45 (4216): No heartbeat from core client for 30 sec - exiting 02:35:46 (4216): No heartbeat from core client for 30 sec - exiting 02:35:47 (4216): No heartbeat from core client for 30 sec - exiting 02:35:48 (4216): No heartbeat from core client for 30 sec - exiting 02:35:49 (4216): No heartbeat from core client for 30 sec - exiting 02:35:50 (4216): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4188, iMonCtr=1 Model crash detected, will try to restart... 18:05:14 (5064): No heartbeat from core client for 30 sec - exiting 18:05:15 (5064): No heartbeat from core client for 30 sec - exiting 18:05:16 (5064): No heartbeat from core client for 30 sec - exiting 18:05:17 (5064): No heartbeat from core client for 30 sec - exiting 18:05:18 (5064): No heartbeat from core client for 30 sec - exiting 18:05:19 (5064): No heartbeat from core client for 30 sec - exiting 18:05:20 (5064): No heartbeat from core client for 30 sec - exiting 18:05:21 (5064): No heartbeat from core client for 30 sec - exiting 18:05:22 (5064): No heartbeat from core client for 30 sec - exiting 18:05:23 (5064): No heartbeat from core client for 30 sec - exiting 18:05:24 (5064): No heartbeat from core client for 30 sec - exiting 18:05:25 (5064): No heartbeat from core client for 30 sec - exiting 18:05:26 (5064): No heartbeat from core client for 30 sec - exiting 18:05:27 (5064): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 18:05:28 (5064): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - Suspend request from BOINC... 18:23:04 (4996): No heartbeat from core client for 30 sec - exiting 18:23:05 (4996): No heartbeat from core client for 30 sec - exiting 18:23:06 (4996): No heartbeat from core client for 30 sec - exiting 18:23:07 (4996): No heartbeat from core client for 30 sec - exiting 18:23:08 (4996): No heartbeat from core client for 30 sec - exiting 18:23:09 (4996): No heartbeat from core client for 30 sec - exiting 18:23:10 (4996): No heartbeat from core client for 30 sec - exiting 18:23:11 (4996): No heartbeat from core client for 30 sec - exiting 18:23:12 (4996): No heartbeat from core client for 30 sec - exiting 18:23:13 (4996): No heartbeat from core client for 30 sec - exiting 18:23:14 (4996): No heartbeat from core client for 30 sec - exiting 18:23:15 (4996): No heartbeat from core client for 30 sec - exiting 18:23:16 (4996): No heartbeat from core client for 30 sec - exiting 18:23:17 (4996): No heartbeat from core client for 30 sec - exiting 18:23:18 (4996): No heartbeat from core client for 30 sec - exiting 18:23:19 (4996): No heartbeat from core client for 30 sec - exiting 18:23:20 (4996): No heartbeat from core client for 30 sec - exiting 18:23:21 (4996): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
08 Aug 2012 08:41:30 | 1203142 | 14985726 | hadcm3n_006x_1980_40_008085614_2 | 285,120 | 523,787 | 1.8371 |
07 Aug 2012 02:33:32 | 1203142 | 14985726 | hadcm3n_006x_1980_40_008085614_2 | 259,200 | 476,853 | 1.8397 |
05 Aug 2012 00:25:15 | 1203142 | 14985726 | hadcm3n_006x_1980_40_008085614_2 | 233,280 | 430,877 | 1.8470 |
04 Aug 2012 07:50:25 | 1203142 | 14985726 | hadcm3n_006x_1980_40_008085614_2 | 207,360 | 384,931 | 1.8563 |
02 Aug 2012 21:24:08 | 1203142 | 14985726 | hadcm3n_006x_1980_40_008085614_2 | 181,440 | 339,592 | 1.8716 |
31 Jul 2012 22:17:59 | 1203142 | 14985726 | hadcm3n_006x_1980_40_008085614_2 | 155,520 | 294,527 | 1.8938 |
30 Jul 2012 19:43:34 | 1203142 | 14985726 | hadcm3n_006x_1980_40_008085614_2 | 129,600 | 248,092 | 1.9143 |
29 Jul 2012 14:35:33 | 1203142 | 14985726 | hadcm3n_006x_1980_40_008085614_2 | 103,680 | 199,770 | 1.9268 |
28 Jul 2012 12:34:47 | 1203142 | 14985726 | hadcm3n_006x_1980_40_008085614_2 | 77,760 | 148,068 | 1.9042 |
26 Jul 2012 05:39:46 | 1203142 | 14985726 | hadcm3n_006x_1980_40_008085614_2 | 51,840 | 96,256 | 1.8568 |
25 Jul 2012 13:53:16 | 1203142 | 14985726 | hadcm3n_006x_1980_40_008085614_2 | 25,920 | 49,109 | 1.8946 |
©2024 cpdn.org